Extract reviews and ratings from IMDB

Zach picture Zach · Jul 18, 2014 · Viewed 7.9k times · Source

The Internet Movie Database does not allow scraping from their website, but does provide an FTP site with text files that can be downloaded and used for research: http://www.imdb.com/interfaces

How can I extract reviews and the corresponding ratings from this FTP server, preferably in Python?

Answer

Davide Alberani picture Davide Alberani · Jul 19, 2014

Reviews are not distributed by IMDb in their plain text data files.

For all the other data, you can parse them and store them in a SQL database using IMDbPY, or look at it to see how to parse only the information that are relevant to you.