I agree with all the ideas so far explained, but actually I would have suggested the DIH ( Data Import Handler) as a first plan.
It does already allow out of the box indexing from different datasources. It supports Jdbc datasources with extensive processors and it does support also a file system datasource with the possibility of using the TikaEntityProcessor. So actually the requirement of the user can be reached directly with a single configuration of the DIH and a proper schema design. Of course if the situation gets more complicated there will be the necessity of customising some DIH component or proceeding writing a custom Indexer. Cheers 2015-06-11 16:20 GMT+01:00 Erick Erickson <erickerick...@gmail.com>: > Here's a skeleton that uses Tika from a SolrJ client. It mixes in > a database too, but the parts are pretty separate. > > https://lucidworks.com/blog/indexing-with-solrj/ > > Best, > Erick > > On Thu, Jun 11, 2015 at 7:14 AM, Paden <rumsey...@gmail.com> wrote: > > You were very VERY helpful. Thank you very much. If I could bug you for > one > > last question. Do you know where the documentation is that would help me > > write my own indexer? > > > > > > > > -- > > View this message in context: > http://lucene.472066.n3.nabble.com/Merging-Sets-of-Data-from-Two-Different-Sources-tp4211166p4211180.html > > Sent from the Solr - User mailing list archive at Nabble.com. > -- -------------------------- Benedetti Alessandro Visiting card : http://about.me/alessandro_benedetti "Tyger, tyger burning bright In the forests of the night, What immortal hand or eye Could frame thy fearful symmetry?" William Blake - Songs of Experience -1794 England