Server-side script for combining RSS feeds?
January 2, 2009 10:43 AM
Subscribe
Do you know of a server-side script for combining RSS feeds, removing duplicates, filtering out items that match keywords, and then generating a new RSS feed as a result?
I'm looking for something that can be run on a standard web server with a typical LAMP configuration. I have 189 feeds, 3500+ feed items, and 500+ keyword filters I want to combine.
I'd settle for a combination of services that successfully can each do one of the tasks of combining feeds, filtering out duplicates, acting on keywords matches, and then generating a clean, unified feed.
This feed will be loaded into a shared Google Reader account, where various people will look at the combined feed items and star some for later attention, so I'm not looking for a client-side single-user feed reader.
Newsgator and Bloglines do not allow the marking and sorting of posts that I want, nor are they particularly efficient at going through hundreds of posts in a single sitting. Google Reader's single-letter keyboard commands are hard to beat.
Google Reader, however, does not do filtering (its biggest drawback out of the many drawbacks it has). There is a Greasemonkey script for doing filtering, but the people who will be viewing this combined feed either do not use Firefox or do not have the technical aptitude to use and update Greasemonkey. Further, I add and revise the keyword filters dozens of times a day, so a client-side filter doesn't really work. It needs to be server side so my changes are reflected wherever the work is being done.
Yahoo Pipes chokes and fails on the feeds. It only sporadically pushes out XML, it only pushes out a small bit of it, and it does it infrequently and after much delay.
FeedRinse keeps failing to add all of my feeds, inexplicably not saving them when I add them to a channel. The ones that are added to a channel are not pushing out any aggregated XML. The individually rinsed channels do load, but I don't want to individually add all those keyword filters to all those feeds, which is the point of channels. I would spend days just entering in the keyword filters.
MySyndicaat does not seem to be permitting new registrations (hitting the submit button throws up a very stupid pop-up that tells you to go to another site, where they lead you right back to the same registration that doesn't work). RSSMix, which I might be able to use to at least combine the feeds, does accept and read the feed but times out when I try to read the aggregated feed it produces.
MyFeedz is shut down.
Google Reader's sharing will only show 20 items of a shared folder containing all the feeds; I need it to show ALL of those items in the shared folder. Otherwise, it might serve as a decent feed aggregator.
posted by Mo Nickels to computers & internet (13 comments total)
2 users marked this as a favorite
posted by shownomercy at 11:20 AM on January 2