July 18, 2014 12:29 PM Subscribe
I grabbed a bunch of posts off of a vBulletin forum to analyze for a research project (with permission, of course, from the forums and the users and my IRB), and I've got it all cleaned up and ready to go. I've got thousands of files in .xls (Microsoft Excel 97-2003) format. So far, so good. Now how do I get these files into a format that my analysis software will play with nicely?
Is there a macro that I can apply to all of the .xls files in a specific folder that will convert them into either html files or pdf files or word files? If there is a macro, how do I go about applying it to all of the files in a specific folder?
I do not want to open every single file. There are thousands of files, and I've messed with all of them individually multiple times already.
If you could walk me through this like I am not very technologically savvy, that would be wonderful.
More details: I am using NVivo 10 for data analysis. For the last year I have been using the "Dataset" feature to pull in each spreadsheet as data, analyzing it that way - but I don't need to do all that. It's really time consuming and every file has to be done manually with a series of annoying clicks. It takes forever and imposes structure on my data that really seems unnecessary from my point of view. It's slowing me down and impeding my progress in a significant way. Other file formats - Word, PDF, text files, html files - can just be dragged-and-dropped into NVivo wholesale, which is just so, so much easier. I just need to be able to read the text, know which user typed which thing and in what order, and that's it - comments aren't threaded or anything like that.
I don't need the data to appear in spreadsheet form. An HTML file would work just as well as long as the structure of the text remains (e.g. one "chunk" of text for each row would be fine - it's really just usernames and the content of the posts that I'm interested in actually reading and analyzing). A word file would be OK too. PDF is the last acceptable format on my list, because although I can drag-and-drop PDFs, it's slow as molasses for some reason.