If you want to find individual subtitles, you have to index them separately. Maybe as parent child block.
Regards, Alex On 27/08/2014 11:43 am, "Rafael Merino García" < rmer...@paradigmatecnologico.com> wrote: > Hi all > > I have to index srt files belonged to videos so that the users can get not > only the video but also the time when their search takes place in it. For > the sake of clarity, you can find below an example of this kind of files: > > 1 > 00:00:08,580 --> 00:00:12,880 > Welcome back, and in this video we're > going to continue where we left off > > 2 > 00:00:12,880 --> 00:00:14,160 > in the previous video, and talk a > > 3 > 00:00:14,160 --> 00:00:16,840 > little bit more about the linear > programming problem. > > The easiest approach would be index so many documents as counters the file > is made up of (in the previous example it would be three), but then there > will be billion of tiny Solr documents in the index and also all the > documents belonged to the same srt file would have a lot of fields with the > same value (the ID, title of the video, etc) which means a lot of redundant > data. I know I could use new JOIN feature to get through it, but I'd like > to receive some inputs first from such a active community. > > Thanks in advance >