Thanks Yonik and Walter,

putting it that way, it does make good sense to not store the transient xml 
file which it is most of the usecases (I was thinking differently because I do 
have xml files on file system or over http, like from a webdav call)

Anyway, thx for all answers, and again, sry for mails not indenting properly at 
the moment, it irritates me as well :-)

Regards Ard


> thanks for the info. Would it a be an improvement to be able to specify in 
> the schema.xml wether or not the URI should be stored or not in a field which 
> name you can also specify in the schema? It might be very well possible that 
> you do not "own" the xml documents you index over http, and at the same time, 
> you do not want to store its contents in the index. Since at indexing time 
> the uri is known, adding it to the index is trivial.


Think of it a different way... Solr isn't indexing XML documents, it's
simply using XML as a serialization format to pass the data to
serialize.  Often, a program is written to read some other data source
(like a database), and send an XML message to Solr to index it (and
hence the XML document only exists for a very brief time).

-Yonik



Reply via email to