Often Solr documents are “semi-structured”. They have some structured fields and some free-text fields. e-mail messages are like that, with structured headers and an unstructured body.
wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Dec 9, 2015, at 4:13 AM, Alexandre Rafalovitch <arafa...@gmail.com> wrote: > > Don't think about indexing so much, think about searching. > > Say you are searching a video? What does that mean? Do you want to > match random sequence of binary values that represent inter-frame > change? Probably not. When you answer what you want to actually search > (title? length? subscripts?), you will discover that structure. What > do you want to return? A whole video, a segment, a description with a > link? > > So, you pre-process/index your data to give you the things you want to > search for and in the form you want them to receive. > > Regards, > Alex. > ---- > Newsletter and resources for Solr beginners and intermediates: > http://www.solr-start.com/ > > > On 9 December 2015 at 03:09, subinalex <alexkutt...@gmail.com> wrote: >> Hi, >> >> I am a solr newbie,just got a quick question. >> >> SOLR is designed for querying unstructured data,but then why we have to send >> it in a structured form(json,xml) for indexing?. >> >> Thanks & Regards,S >> Subin >> >> >> >> -- >> View this message in context: >> http://lucene.472066.n3.nabble.com/Unstructured-Structured-data-for-indexing-tp4244406.html >> Sent from the Solr - User mailing list archive at Nabble.com.