What's the quickest way to code this data?
October 16, 2012 6:32 AM Subscribe
I have an Excel spreadsheet with just over 3000 sentences in column A. I need to go through looking at each one and entering some data into columns B-F. The first thing I have to enter is whether the sentence is in the past tense or present tense. If it is past tense, I can ignore it and not have to enter anything in columns C-F. I just move on to the next sentence. Is it going to be quicker if I go through the whole spreadsheet filling out column B first (past/present) and then do a second pass to fill out the rest of the details for the present tense sentences, or quicker to do it all at once for each sentence? Or is it a wash?
posted by lollusc to computers & internet (8 answers total)
The features in the other columns are each things that can take a second's thought to decide on, but no more than that. For example "Is the subject a pronoun?"; "Is the verb a form of the verb to be?", etc.
Any other tips for speeding this up would be appreciated. I've already arranged my columns in optimum order, I think, through a bit of trial and error.
(Incidentally, this needs to be extremely accurate, so I can't just program in an algorithm to reduce my sentences by weeding out anything with e.g. "he" followed by a word ending in "-ed". I tried various more complicated versions of this - they all had too many errors - both false positives and false negatives. And the language is a non-standard dialect, conversational and colloquial, so I'm not having luck with automating things like subject identification either. I'd appreciate it if this doesn't turn into a discussion about that sort of automation and just assumes I'm going to be doing it all by hand.)