Thanks for your reply Erik. I will use your suggested approach.
IMHO this could be something to add for future versions of solr. The Terrier IR-framework for example and other IR solutions allow to specify different XML-elements, which should be indexed in only one (lucene) field.
As I said in my previous post, this approach is especially helpful, if you have heterogeneous documents with different XML-elements.
Erik Hatcher wrote:
Thomas - you will need to do this client-side if you don't want to use copyField. The client needs to gather up all the text you want indexed and send that as <field name="text">....</field>Erik On Mar 14, 2007, at 3:50 AM, thomas arni wrote:Hello I'm currently evaluate solr for our needs. In a first step I used your example and adapted the “schema.xml”. In contrast to the example docs provided I haven't homogeneous documents, which means I only want to index to two fields. This fields are the uniqueKey (docno) and a textfield (text). <fields> <field name="docno" type="string" indexed="true" stored="true"/> <field name="text" type="text" indexed="true" stored="true"/> </fields> Instead of using the copyField for other XML-elements, to copy (and duplicate) this fields to my “text”-field, I want to specify which fields should be indexed directly in the “text”-field without copying nor duplicating. I have no need for additional index-fields in my heterogeneous environment. This extra fields only need additional space in my index, which is a disadvantage for me. How can I specify arbitrary xml-elements, which should be indexed in my one and only field “text”. I have no need of additional fields in my index. Any help is appreciated. Thomas
