@Gora Thank You! I know that Solr accepts xml with Solr specific elements that are commands that only it understands ... such as <add/>, <commit/> etc.
Question: Is there some way to ask Solr to dump out whatever it has in its index already ... as a Solr xml document? Plan: I intend to message that xml dump (add the field + value that I need in every doc's xml element) and then I should be able to push this dump back to Solr to get data indexed again, I hope. Thanks! - Pulkit On Sun, Oct 9, 2011 at 2:57 PM, Gora Mohanty <g...@mimirtech.com> wrote: > On Mon, Oct 10, 2011 at 1:17 AM, Pulkit Singhal <pulkitsing...@gmail.com> > wrote: > > Hello Folks, > > > > I'm a big DIH fan but I'm fairly sure that now I've run into a scenario > > where it can't help me anymore ... but before I give up and roll my own > > solution, I jsut wanted to check with everyone else. > > > > The scenario: > > - already have 1M+ documents indexed > > - the schema.xml needs to have one more field added to it ... > > problem/do-able? yes? no? remove all the old data? or do the update per > doc > > (add/delete)? > > This is independent of DIH. If you want to add a new field to the schema, > you should reindex. 1M documents should not take that long. > > > - need to populate data from a file that has a key and value per line and > i > > need to use the key to find the doc to update and then add the value to > the > > new schema field > > It is best just to reindex, but it should be possible to write a script to > pull > the doc from the existing Solr index, massage the return format into > Solr's XML format, adding a value for the new field in the process, and > then posting the new file to Solr for indexing. > > Regards, > Gora >