Ah... sure, you can create a schema that has several different document types in it, with extra fields that are used in some but not all documents - books have the metadata fields but no page bodies while pages have page bodies but no metadata. And maybe even do a Solr join for the "block" of pages that are for the same book. Or, just two queries - the first to get the pages, grouped, and then take their book names/IDs and query the book-level metadata. You can also store the book-level metadata in a separate Solr collection.

But, having said that, you have to decide whether your content search is a pure content search or whether you also want to search by metadata as well. The searchable metadata should be present on each of the pages in addition to the book level. That may seem like repetition, but that's okay. The bulk of the storage will be the page bodies themselves.

-- Jack Krupansky

-----Original Message----- From: denl0
Sent: Thursday, November 15, 2012 5:23 AM
To: solr-user@lucene.apache.org
Subject: Re: Solr defining Schema structure trouble.

Yes this is what I'm trying to do. But stuff related to the document like
language/title/...(i got way more fields) are stored many times. Each page
has a part of data that's the same is it possible to seperate that data?



--
View this message in context: http://lucene.472066.n3.nabble.com/Solr-defining-Schema-structure-trouble-tp4020305p4020471.html Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to