I have several scripts for movies and TV shows which appear to be in a standard tab-delimited format. I would like to get each character's lines alone into a new separate text document (likely via a computer script). [more inside]
I was peripherally involved in a recent work project predicate languages parsing and some natural language. I am interested in learning about modern development environments in these areas. Very basic and on up - I know what predicates are and natural language processing at a beginner level, decent understanding of code and networks, but not computer scientist. I want to easily parse and database a string of predicates from another application, set a value, quickly evaluate only if relevant, evaluate them against a database and set a shared memory location. All thoughts appreciated!
I'm studying Japanese. I want to tag and track individual words and grammatical structures that I'm learning. What software will help me do this? [more inside]
Can you point me to resources that deal with the thresholds that computers use to overwrite authorial intent? I'm interested in the boundaries between automatic correction and untrammeled thumbs.
Help me find this data analysis tool, so I can process lots of cool data. [more inside]
Matlab textparsing filter: So for a project, I am working with over one hundred years of hourly data collected for water levels. I have all of this collected in a giant, ~35 Mb text file. Toward the last decade or so, there are double entries for water level thanks to semidiurnal tide measurement upgrades, and so there is an irregular number of columns that isn't easy to work with the usual data import strategies. I need your help, citizens of Metafilter, so that I can successfully rig the Matlab textscan command (or something else) to read in this formatted data, and make accommodations for empty value strings! More inside. [more inside]
How can I parse several largish (~6mb) text documents to produce a common index of keywords and phrases? I need something that will recognize phrases as well as key words, kind of like Amazon's Statistically Improbable Phrases. [more inside]
Should-be-simple Linux timestamped file parsing question. [more inside]
How do I parse a few lines from several hundred word documents into a spreadsheet? [more inside]
ParseFilter: I have a CSV file full of leads I need to parse into a more, er, concise format. What would the hive mind recommend? [more inside]
RSS to HTML: Why can't my PHP file open remote RSS files? [more inside]
Lexical analysis! What are some good resources for a beginner? [more inside]
Is there a way to represent algorithms in a form that in turn requires minimal or no knowldege of other algorithms? [more inside]
Can anyone suggest good PHP books or existing scripts that can help me successfully parse RSS 1.0, 2.0 and Atom 0.3 feeds, then save them to a MySQL database? [more inside]
A friend wants to include headlines from my website on his site. Is there a simple way for him to publish headlines as links using my RSS feed, preferably without the branding of a third party service? [more inside]
htaccess, SSI, and PHP parsing. Can one file get both php and ssi parsing? if yes, how, if not, help! [*]