Thanks for your reply Erik. I will use your suggested approach.
IMHO this could be something to add for future versions of solr. The
Terrier IR-framework for example and other IR solutions allow to specify
different XML-elements, which should be indexed in only one (lucene) field.
As I said in my previous post, this approach is especially helpful, if
you have heterogeneous documents with different XML-elements.
Erik Hatcher wrote:
Thomas - you will need to do this client-side if you don't want to use
copyField. The client needs to gather up all the text you want
indexed and send that as <field name="text">....</field>
Erik
On Mar 14, 2007, at 3:50 AM, thomas arni wrote:
Hello
I'm currently evaluate solr for our needs. In a first step I used your
example and adapted the “schema.xml”.
In contrast to the example docs provided I haven't homogeneous
documents, which means I only want to index to two fields. This fields
are the uniqueKey (docno) and a textfield (text).
<fields>
<field name="docno" type="string" indexed="true" stored="true"/>
<field name="text" type="text" indexed="true" stored="true"/>
</fields>
Instead of using the copyField for other XML-elements, to copy (and
duplicate) this fields to my “text”-field, I want to specify which
fields should be indexed directly in the “text”-field without copying
nor duplicating. I have no need for additional index-fields in my
heterogeneous environment. This extra fields only need additional space
in my index, which is a disadvantage for me.
How can I specify arbitrary xml-elements, which should be indexed in my
one and only field “text”. I have no need of additional fields in my
index.
Any help is appreciated.
Thomas