need a library for text classification
July 20, 2008 4:17 PM Subscribe
I need to be able to automatically identify language (English, Japaneese, Russian, etc ... ) in which a particular blog-post has been written. (lang attribute might or might not be available).
Few years ago I came across a library for RSS feeds that was doing roughly what I need - can not find it anymore though.
posted by chexov to computers & internet (4 answers total) 1 user marked this as a favorite
Not a very sophisticated algorithm, but it's simple and might work perfectly well.
posted by hattifattener at 4:43 PM on July 20, 2008