Use SolrJ if you end up developing Indexer in Java to send documents to Solr. Its been a long i have used DIH but you can gave it a try first, otherwise as Walter suggested developing external indexer is best.
On Sun, Jul 9, 2017 at 6:46 PM, Walter Underwood <wun...@wunderwood.org> wrote: > 4. Write an external program that fetches the file, fetches the metadata, > combines them, and send them to Solr. > > wunder > Walter Underwood > wun...@wunderwood.org > http://observer.wunderwood.org/ (my blog) > > > > On Jul 9, 2017, at 3:03 PM, Giovanni De Stefano <giova...@servisoft.be> > wrote: > > > > Hello all, > > > > I have to index (and search) data organised as followed: many files on > the filesystem and each file has extra metadata stored on a DB (the DB > table has a reference to the file path). > > > > I think I should have 1 Solr document per file with fields coming from > both the DB (through DIH) and from Tika. > > > > How do you suggest to proceed? > > > > 1. index into different cores and search across cores (I would rather > not do that but I would be able to reuse “standard” importers) > > 2. extend the DIH (which one?) > > 3. implement a custom import handler > > > > How would you do it? > > > > Developing in Java is not a problem, I would just need some ideas on > where to start (I have been away from Solr for many years…). > > > > Thanks! > > G. > >