Hi, I want to ingest a collection of documents along with extracted full-text from PDFs using solr 'update/extract' endpoint to store the text in a field called "fullText". I want to relate some documents to other documents so when I query the "fullText" field with user terms, solr returns the first matching document with "contentType" field equal to "overview", and several related documents with different values for "contentType" like this:
{ "id":"1", "contentType":"overview", "fullText":"Lorem ipsum dolor sit amet, consectetur adipiscing elit. Etiam consectetur ipsum libero, at egestas ante laoreet nec. Aliquam sem elit, rhoncus efficitur laoreet sodales, hendrerit eget mi. Nulla facilisis tincidunt tortor vel placerat. Phasellus blandit velit eget semper tristique. Maecenas convallis orci purus, ac scelerisque erat pulvinar id. Donec semper enim id justo cursus, vitae bibendum magna interdum. Maecenas eu laoreet nibh. Quisque magna massa, semper et lorem sed, volutpat pulvinar quam. Quisque a urna et risus feugiat fermentum nec et orci. Pellentesque ac neque sed tortor convallis finibus sit amet id purus. Sed blandit eget ante et semper. Vivamus.", "product":"paper & goods" }, { "id":"2", "contentType":"support", "title":"The latest support boards", "points":["Nulla facilisis tincidunt tortor vel placerat."," Phasellus blandit velit eget semper tristique."], "product":"paper & goods", "parentID":"1" },{ "id":"3", "contentType":"boards", "title":"", "points":["Nulla facilisis tincidunt tortor vel placerat."," Phasellus blandit velit eget semper tristique."], "product":"paper & goods", "parentID":"1" } I'm looking for any recommendations on ingesting and querying these documents. Can I ingest these documents by nesting child documents in the overview document and also extract full-text from a PDF? If so, how can I query for both the parent and the children documents? Or should I not nest related documents and instead match the overview's ID field with a field in the related document called "parentID"? If so, how do I form my query to match documents whose parentID field matches the value of a document's ID field? -- Stephon Harris *Enterprise Knowledge, LLC* *Web: *http://www.enterprise-knowledge.com/ <http://www.google.com/url?q=http%3A%2F%2Fwww.enterprise-knowledge.com%2F&sa=D&sntz=1&usg=AFQjCNFDktFDhseOl_Pha6Pz3fIFaWolNg> *E-mail:* shar...@enterprise-knowledge.com/ <http://www.google.com/url?q=http%3A%2F%2Fwww.enterprise-knowledge.com%2F&sa=D&sntz=1&usg=AFQjCNFDktFDhseOl_Pha6Pz3fIFaWolNg> *Cell:* 832-628-8352