Hi Davide, Thanks to all your help and suggestions, everything has gone perfect so far. But I guess there is one problem about the local IMDb database that has been created with the help of IMDbPY. So I wanted to contact you to ask for your opinion.
After some investigation, I have found out that imdbId is required in order to reach the thumbnail of a movie. There is even an explanation made by you in the help forum, which says get_movie procedure can be used for this purpose. But I see that in my current database, imdb_id column in title table is all blank, although all other fields are full. Please also note that I have downloaded almost all gz files from imdb interface. Could you please explain which file causes this field to be set or is there any other way to store the thumbnails in my local database during the run of imdbpy2sql? Otherwise, even if my imdb_ids become full, I will have to run other scripts for storing thumbnails for each movie in my local database, which will certainly cause great loss of time. Thank you very much for your support. Best Regards, Gozde 2009/5/12 Davide Alberani <[email protected]> > On May 12, Gozde Ozbal <[email protected]> wrote: > > > I just wanted to let you know that after increasing my RAM capacity, > > I am now able to import all movie data in only 190 minutes :) > > Much better. :-) > > > I still receive the incorrect string value warnings although I have > > changed mysql conf file to use utf8 as mentioned in the document > > that you have mentioned. > > That's not good; I'd try to fix these, too. > Do you have an example of the text of one of the warnings? > > > 1) downloading every file on imdb interface and running IMDbPY script > > at the end of each week (that will certainly be more accurate but > > time consuming) > > After that you have to identify which ones are new. > That's not easy as it seems, since movies can change titles (while > still in production but even after release). > Maybe you can check the release dates to see which movies are about > to be released, but I'm not sure it will suit your needs. > > > 2) Properly crawling http://www.imdb.com/nowplaying/ or > > http://italian.imdb.com/Recent/ > > >From there you can parse titles/movieID pairs, check if the > title is "new to you" and fetch its data if needed. > > > Please, let me know if you have any other idea than those two. > > Probably fetching a list on the web is the best solution: I don't > know if these lists are complete, but look exactly what you need. > > > Bye and good luck for you thesis! > > -- > Davide Alberani <[email protected]> [GPG KeyID: 0x465BFD47] > http://erlug.linux.it/~da/ <http://erlug.linux.it/%7Eda/> > > > ------------------------------------------------------------------------------ > The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your > production scanning environment may not be a perfect world - but thanks to > Kodak, there's a perfect scanner to get the job done! With the NEW KODAK > i700 > Series Scanner you'll get full speed at 300 dpi even with all image > processing features enabled. http://p.sf.net/sfu/kodak-com > _______________________________________________ > Imdbpy-help mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/imdbpy-help >
------------------------------------------------------------------------------ Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT is a gathering of tech-side developers & brand creativity professionals. Meet the minds behind Google Creative Lab, Visual Complexity, Processing, & iPhoneDevCamp asthey present alongside digital heavyweights like Barbarian Group, R/GA, & Big Spaceship. http://www.creativitycat.com
_______________________________________________ Imdbpy-help mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/imdbpy-help
