Thanks Alex, > Do you just care about document content? content only. The documents (not necessarily coming from a Db) are being pushed (through Solrj). This is at least the initial idea, mainly due to the dynamic nature of our index/search architecture. I could of course push in the filename(s) in a field, but this would require Solr (due to field-type e.g. "filecontent") to extract the content from the given file. Is something alike this possible in Solr-indexing?
> DataImportHandler Would I need to write a custom DIH? Or is the DIH as is, i.e. just configurable through the data-config.xml? > nested entities design Could you link me to this concept/idea? -----Ursprüngliche Nachricht----- Von: Alexandre Rafalovitch [mailto:arafa...@gmail.com] Gesendet: Freitag, 12. September 2014 14:12 An: solr-user Betreff: Re: SolrJ : fieldcontent from (multiple) file(s) Do you just care about document content? Not metadata, such as file name, date, author, etc? Does it have to be push into Solr or can be pull? If pull, DataImportHandler should be able to do what you want with nested entities design. Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources and newsletter: http://www.solr-start.com/ and @solrstart Solr popularizers community: https://www.linkedin.com/groups?gid=6713853 On 12 September 2014 06:53, Clemens Wyss DEV <clemens...@mysign.ch> wrote: > Looks like I haven't finished " I know" > I know I could extract the content on our server's side, but I'd really like > to take that burden of it. > That said: > Can I hand in the path-to-the-file in a "specific field" which would yield an > extraction in Solr? > > -----Ursprüngliche Nachricht----- > Von: Clemens Wyss DEV [mailto:clemens...@mysign.ch] > Gesendet: Freitag, 12. September 2014 11:30 > An: 'solr-user@lucene.apache.org' > Betreff: SolrJ : fieldcontent from (multiple) file(s) > > First of all I'd like to say hello to the Solr world/community ;) So far we > have been using Lucene as-is and now intend to go for Solr. > > Say I have a document which in one field should have the content of a > file (indexed only, not stored), in order to make the document > searchable due to the file's content. I know > > How is this achieved using SolrJ, i.e. how do I hand in this document? > > Thx > Clemens >