Re: embedded documents

Jack Krupansky Mon, 25 Aug 2014 06:46:27 -0700

Thanks, Erik, but... I've read that Jira several times over the past month,it is is far too cryptic for me to make any sense out of what it is reallytrying to do. A simpler approach is clearly needed.

My perception of SOLR-6304 is not that it indexes a single JSON object as asingle Solr document, but that it generates a collection of separatedocuments, somewhat analogous to Lucene block/child documents, but... notquite.

I understood the request on this message thread to be the flattening of asingle nested JSON object to a single Solr document.

IMHO, we need to be trying to make Solr more automatic and moreapproachable, not an even more complicated "toolkit".


-- Jack Krupansky

-----Original Message-----From: Erik Hatcher

Sent: Monday, August 25, 2014 9:32 AM
To: solr-user@lucene.apache.org
Subject: Re: embedded documents

Jack et al - there’s now this, which is available in the any-minute releaseof Solr 4.10: https://issues.apache.org/jira/browse/SOLR-6304


Erik

On Aug 25, 2014, at 5:01 AM, Jack Krupansky <j...@basetechnology.com> wrote:

That's a completely different concept, I think - the ability to return asingle field value as a structured JSON object in the "writer", ratherthan simply "loading" from a nested JSON object and distributing the keyvalues to normal Solr fields.
-- Jack Krupansky

-----Original Message----- From: Bill Bell
Sent: Sunday, August 24, 2014 7:30 PM
To: solr-user@lucene.apache.org
Subject: Re: embedded documents

See my Jira. It supports it via json.fsuffix=_json&wt=json

http://mail-archives.apache.org/mod_mbox/lucene-dev/201304.mbox/%3CJIRA.12641293.1365394604231.125944.1365397875874@arcas%3E

Bill Bell
Sent from mobile
On Aug 24, 2014, at 6:43 AM, "Jack Krupansky" <j...@basetechnology.com>wrote:
Indexing and query of raw JSON would be a valuable addition to Solr, somaybe you could simply explain more precisely your data model andtransformation rules. For example, when multi-level nesting occurs, whatdoes your loader do?
Maybe if the fielld names were derived by concatenating the full path ofJSON key names, like titles_json.FR, field_naming nesting could behandled in a fully automated manner.
I had been thinking of filing a Jira proposing exactly that, so that eventhe most deeply nested JSON maps could be supported, althoughcombinations of arrays and maps would be problematic.
-- Jack Krupansky

-----Original Message----- From: Michael Pitsounis
Sent: Wednesday, August 20, 2014 7:14 PM
To: solr-user@lucene.apache.org
Subject: embedded documents

Hello everybody,

I had a requirement to store complicated json documents in solr.

i have modified the JsonLoader to accept complicated json documents with
arrays/objects as values.

It stores the object/array and then flatten it and  indexes the fields.

e.g  basic example document

{
     "titles_json":{"FR":"This is the FR title" , "EN":"This is the EN
title"} ,
     "id": 1000003,
     "guid": "3b2f2998-85ac-4a4e-8867-beb551c0b3c6"
}
It will store titles_json:{"FR":"This is the FR title" , "EN":"This isthe
EN title"}
and then index fields

titles.FR:"This is the FR title"
titles.EN:"This is the EN title"


Do you see any problems with this approach?



Regards,
Michael Pitsounis

Re: embedded documents

Reply via email to