Hi Manuel, Why don't you create a program to parse the html files, maybe using xslt, and them submit the output to Solr?
--- Marcelo On Thursday, April 5, 2012, Manuel Antonio Novoa Proenza < mano...@estudiantes.uci.cu> wrote: > Hello, > > I would like to know the method of extracting from the images that are in html documents Alt attribute data > > > > > > > > > > > > > > > > > > 10mo. ANIVERSARIO DE LA CREACION DE LA UNIVERSIDAD DE LAS CIENCIAS INFORMATICAS... > CONECTADOS AL FUTURO, CONECTADOS A LA REVOLUCION > > http://www.uci.cu > http://www.facebook.com/universidad.uci > http://www.flickr.com/photos/universidad_uci -- ---- Marcelo Carvalho Fernandes +55 21 8272-7970 +55 21 2205-2786