On Mon, Oct 10, 2011 at 1:17 AM, Pulkit Singhal <pulkitsing...@gmail.com> wrote:
> Hello Folks,
>
> I'm a big DIH fan but I'm fairly sure that now I've run into a scenario
> where it can't help me anymore ... but before I give up and roll my own
> solution, I jsut wanted to check with everyone else.
>
> The scenario:
> - already have 1M+ documents indexed
> - the schema.xml needs to have one more field added to it ...
> problem/do-able? yes? no? remove all the old data? or do the update per doc
> (add/delete)?

This is independent of DIH. If you want to add a new field to the schema,
you should reindex. 1M documents should not take that long.

> - need to populate data from a file that has a key and value per line and i
> need to use the key to find the doc to update and then add the value to the
> new schema field

It is best just to reindex, but it should be possible to write a script to pull
the doc from the existing Solr index, massage the return format into
Solr's XML format, adding a value for the new field in the process, and
then posting the new file to Solr for indexing.

Regards,
Gora

Reply via email to