Hi, I need to submit thousands of online PDF/html files to Solr. I can submit one file using SolrJ (StreamingUpdateSolrServer and ..solr.common.util.ContentStreamBase.URLStream), setting literal.idparameter to the url. I can't do the same with a batch of multiple files, as their 'id' should be unique (set to their urls).
I couldn't get this to work. Is there a way to somehow get the 'id' field set automatically to the url of the files posted to Solr (something like to 'stream_name')? How to set this in solrconfig.xml or schema.xml? or any other way? Thanks.