We gather all the data for a document, then send it as one update to Solr. Actually, we create a JSON object for each document, then make a JSONL (one JSON object per line) feed of everything we want to send. That gets compressed and saved in Amazon S3. Then we break it into batches and send it to Solr.
Putting the entire feed in S3 allows us to analyze that feed, load it into a test cluster, load yesterday’s feed, load it into a different prod cluster for disaster recovery, etc. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On May 4, 2019, at 3:57 AM, Jie Luo <j...@ebi.ac.uk> wrote: > > Dear solr user, > > I have processes, first process is to build the SolrDocuments and indexes, > other processes try to update the other fields of the solrdocuments and their > indexes. Then I noticed that when I search previous indexed multivalue > fields (not stored), it returns wrong results (fewer data). I tested with > five documents, it will only return one documents with (field:*) search. > Before I run the other processes, the behaviour is fine return 5 documents. > However, it seems to me that single valued fields worked fine > > Best Regards > > Jie