Indexing and query of raw JSON would be a valuable addition to Solr, so maybe you could simply explain more precisely your data model and transformation rules. For example, when multi-level nesting occurs, what does your loader do?

Maybe if the fielld names were derived by concatenating the full path of JSON key names, like titles_json.FR, field_naming nesting could be handled in a fully automated manner.

I had been thinking of filing a Jira proposing exactly that, so that even the most deeply nested JSON maps could be supported, although combinations of arrays and maps would be problematic.

-- Jack Krupansky

-----Original Message----- From: Michael Pitsounis
Sent: Wednesday, August 20, 2014 7:14 PM
To: solr-user@lucene.apache.org
Subject: embedded documents

Hello everybody,

I had a requirement to store complicated json documents in solr.

i have modified the JsonLoader to accept complicated json documents with
arrays/objects as values.

It stores the object/array and then flatten it and  indexes the fields.

e.g  basic example document

 {
       "titles_json":{"FR":"This is the FR title" , "EN":"This is the EN
title"} ,
       "id": 1000003,
       "guid": "3b2f2998-85ac-4a4e-8867-beb551c0b3c6"
  }

It will store titles_json:{"FR":"This is the FR title" , "EN":"This is the
EN title"}
and then index fields

titles.FR:"This is the FR title"
titles.EN:"This is the EN title"


Do you see any problems with this approach?



Regards,
Michael Pitsounis

Reply via email to