How can I grab the text (not code) off of a bunch of .htm files?
February 13, 2007 6:25 PM
Subscribe
How can I automatically grab the text (not code) off of a bunch of .htm files?
I have a bunch of .htm files which are based on the same template, and I am looking for a way to grab all the text from these pages and collect it in a text file for a voice actor to read. I could copy each page's text through a browser but I thought there had to be an easier way, as I need to grab the text from over 100 pages. Any advice appreciated!
posted by pantufla to computers & internet (16 comments total)
3 users marked this as a favorite
posted by saraswati at 6:37 PM on February 13, 2007 [1 favorite has favorites]