Often Solr documents are “semi-structured”. They have some structured fields 
and some free-text fields. e-mail messages are like that, with structured 
headers and an unstructured body.

wunder
Walter Underwood
wun...@wunderwood.org
http://observer.wunderwood.org/  (my blog)


> On Dec 9, 2015, at 4:13 AM, Alexandre Rafalovitch <arafa...@gmail.com> wrote:
> 
> Don't think about indexing so much, think about searching.
> 
> Say you are searching a video? What does that mean? Do you want to
> match random sequence of binary values that represent inter-frame
> change? Probably not. When you answer what you want to actually search
> (title? length? subscripts?), you will discover that structure. What
> do you want to return? A whole video, a segment, a description with a
> link?
> 
> So, you pre-process/index your data to give you the things you want to
> search for and in the form you want them to receive.
> 
> Regards,
>   Alex.
> ----
> Newsletter and resources for Solr beginners and intermediates:
> http://www.solr-start.com/
> 
> 
> On 9 December 2015 at 03:09, subinalex <alexkutt...@gmail.com> wrote:
>> Hi,
>> 
>> I am a solr newbie,just got a quick question.
>> 
>> SOLR is designed for querying unstructured data,but then why we have to send
>> it in a structured form(json,xml) for indexing?.
>> 
>> Thanks & Regards,S
>> Subin
>> 
>> 
>> 
>> --
>> View this message in context: 
>> http://lucene.472066.n3.nabble.com/Unstructured-Structured-data-for-indexing-tp4244406.html
>> Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to