Hi, I'm parsing IMDB movie reviews (each movie is in its own text file). In my script, I'm trying to extract genre information. Movies have up to three categories of genres--but not all have a "genre" tag and that fact is making my script abort whenever it encounters a movie text file that doesn't have a "genre" tag.
I thought the following should solve it, but it doesn't. The basic question is how I say "if genre information doesn't at all, just make rg1=rg2=rg3="NA"? rgenre = re.split(r';', rf.info["genre"]) # When movies have genre information they store it as <genre>Drama;Western;Thriller</genre> if len(rgenre)>0: if len(rgenre)>2: rg1=rgenre[0] rg2=rgenre[1] rg3=rgenre[2] elif len(rgenre)==2: rg1=rgenre[0] rg2=rgenre[1] rg3="NA" elif len(rgenre)==1: rg1=rgenre[0] rg2="NA" rg3="NA" else len(rgenre)<1: # I was hoping this would take care of the "there is no genre information" scenario but it doesn't rg1=rg2=rg3="NA" This probably does a weird nesting thing, but even simpler version I have tried don't work. Thanks very much for any help! Tyler
_______________________________________________ Tutor maillist - Tutor@python.org To unsubscribe or change subscription options: http://mail.python.org/mailman/listinfo/tutor