Grab list of Amazon Prime video URLs?
January 8, 2018 12:31 PM Subscribe
Is there a quick way to grab all the Prime video URLs of a media property with the property's associated text title?
In particular, in working on my DS9 Duffer's Guide, it would be very convenient to have all the APV URLs for each episode of the series in a table so that when I build the posting text I can enable URLs for each episode without manually building the HTML tag.
Google searches appear to produce hits to only consumer-facing resources such as the Prime series home page or to IMDB.
I suppose I could scrape the Prime series home page down to a source page and manually strip out the garbage, but it seems likely there's a better way, unless not providing such a hook in public is a business goal intended to increase page impressions, much as APV did not allow autoplay for a long time.
TIA!
In particular, in working on my DS9 Duffer's Guide, it would be very convenient to have all the APV URLs for each episode of the series in a table so that when I build the posting text I can enable URLs for each episode without manually building the HTML tag.
Google searches appear to produce hits to only consumer-facing resources such as the Prime series home page or to IMDB.
I suppose I could scrape the Prime series home page down to a source page and manually strip out the garbage, but it seems likely there's a better way, unless not providing such a hook in public is a business goal intended to increase page impressions, much as APV did not allow autoplay for a long time.
TIA!
Response by poster: looks like something worth checking out, sure
posted by mwhybark at 1:28 PM on January 8, 2018
posted by mwhybark at 1:28 PM on January 8, 2018
Response by poster: adding 'scraper' to Google search is surfacing more useful results, such as this. 2014 seems like kind of a long time ago. The package does use Beautiful Soup.
posted by mwhybark at 2:56 PM on January 8, 2018
posted by mwhybark at 2:56 PM on January 8, 2018
Response by poster: I ended up just grabbing source for each season page and running some quick greps in bbedit, less than an hour's work to strip out the unneeded info.
posted by mwhybark at 10:31 PM on January 9, 2018
posted by mwhybark at 10:31 PM on January 9, 2018
« Older Two browsers talking to each other via a webserver... | Should I try to contact my ex-best friend again? Newer »
This thread is closed to new comments.
posted by bradbane at 1:09 PM on January 8, 2018 [1 favorite]