: The schema is a copy of the techproducts sample.
: 
: Entire include here - and I take your point about the possibility of
: malformation - thanks.
: 
: I assumed (perhaps wrongly) that I could duplicate the <schema ...>
:  </schema> arrangement from the schema.xml file.

I really can't make heads or tails of what you're saying here -- what i 
asked you to provide was the full details on your schema.xml file and the 
file you are xincluding into it -- what you provided looks like a normal 
schema.xml, w/o any xinclude tags.  I also don't see any mention of any 
other files you are attempting to include in your schema.xml so i can see 
what it's structure looks like.

For comparison, here is an example of a schema.xml file that uses an 
xinclude elements...
https://github.com/apache/lucene-solr/blob/master/solr/core/src/test-files/solr/collection1/conf/schema-xinclude.xml

Here is a specific example of one xinclude element from that file...
<xi:include href="schema-snippet-type.xml"/>

here is the file that is included by that xinclude element...
https://github.com/apache/lucene-solr/blob/master/solr/core/src/test-files/solr/collection1/conf/schema-snippet-type.xml


...if you can provide the corrisponding specifics for your schema -- 
showing the xinclude elements, and the files refrenced by them -- we can 
try to help make sense of why it's not working for you.



: 
: I'm unfamiliar with xml entity includes, but I'll go look them up...
: 
: <?xml version="1.0" encoding="UTF-8" ?>
: <schema name="example" version="1.6">
: 
:    <!-- ngram field to support suggestions / lookahead search on title (and
: category, contentType)-->
:    <copyField source="foobar" dest="text"/>
:    <field name="suggestion_ngram_for_title" type="text_suggest_ngram"
: indexed="true" stored="false"/>
:    <field name="displayurl" type="text_general" indexed="true"
: stored="true" multiValued="false"/>
:    <field name="productVersionId" type="string" indexed="true"
: stored="true" multiValued="false"/>
:    <field name="caption" type="text_general" indexed="true" stored="true"
: multiValued="false"/>
:    <field name="documentId" type="string" indexed="true" stored="true"
: multiValued="false"/>
:    <!--<field name="category" type="string" indexed="true" stored="true"
: multiValued="true"/>-->
:    <field name="contentType" type="text_special_synonym" indexed="true"
: stored="true" multiValued="false"/>
:    <!-- Do NOT assume that much thought went into using int on the
: following field. This is testing only!-->
:    <field name="preference_" type="int" indexed="true" stored="true"
: multiValued="false"/>
: 
:    <field name="meta_doc_type" type="text_general" indexed="true"
: stored="true" multiValued="false"/>
:    <!--<field name="content" type="text_general" indexed="true"
: stored="true" multiValued="false"/>-->
: 
:    <!-- STATdx Weighting fields here. These are not part of the document,
: but are used to calculate relevancy scores -->
:    <field name="category_weight"  type="double" indexed="true"
:  stored="true"/>    <!-- used for rule one - weighting docs on general
: usefulness -->
: 
:    <!-- Main body of document extracted by SolrCell.
:         NOTE: This field is not indexed by default, since it is also copied
: to "text"
:         using copyField below. This is to save space. Use this field for
: returning and
:         highlighting document content. Use the "text" field to search the
: content. -->
:    <field name="content" type="text_en" indexed="false" stored="true"
: multiValued="true"/> *//HERE IS WHERE "CONTENT" IS DEFINED*
: 
: <!-- test for parsing statdx-provided html in content field. text_html has
: been modified to clean html -->
:    <field name="html_content" type="text_html" indexed="true" stored="true"
: multiValued="true"/>
: 
:    <!-- Text fields from SolrCell to search by default in our catch-all
: field -->
:    <copyField source="title" dest="text"/>
:    <copyField source="author" dest="text"/>
:    <copyField source="description" dest="text"/>
:    <copyField source="keywords" dest="text"/>
:    <copyField source="content" dest="text"/>  /*/THROWING ERROR ABOUT
: "CONTENT" NOT EXISTING HERE*
:    <copyField source="content_type" dest="text"/>
:    <copyField source="resourcename" dest="text"/>
:    <copyField source="url" dest="text"/>
: 
:    <!-- Create a string version of author for faceting -->
:    <copyField source="author" dest="author_s"/>
: 
:   <!-- Above, multiple source fields are copied to the [text] field.
:           Another way to map multiple source fields to the same
:           destination field is to use the dynamic field syntax.
:           copyField also supports a maxChars to copy setting.  -->
: 
:         <copyField source="*_en" dest="text"/>
: 
: 
:     <!-- a copy of text_general. Used to handle the rule that says that
: docs with "table"
:          and "tsm" in the contentType field should show at the top of
: results IF any of the
:          following terms are in the search term submitted by the user:
:          [TNM, AJCC, Stage, Staging, FIGO]   Note the special synonym file
: in the xml below.
:          Note to self: Expand this documentation if we end up adding more
: "special" synonyms -->
:     <fieldType name="text_special_synonym" class="solr.TextField"
: positionIncrementGap="100">
:       <analyzer type="index">
:         <tokenizer class="solr.StandardTokenizerFactory"/>
:         <filter class="solr.StopFilterFactory" ignoreCase="true"
: words="stopwords.txt" />
:                 <!-- in this example, we will only use synonyms at query
: time
:         <filter class="solr.SynonymFilterFactory"
: synonyms="index_synonyms.txt" ignoreCase="true" expand="false"/>
:         -->
:         <filter class="solr.LowerCaseFilterFactory"/>
:       </analyzer>
:       <analyzer type="query">
:         <tokenizer class="solr.StandardTokenizerFactory"/>
:         <filter class="solr.StopFilterFactory" ignoreCase="true"
: words="stopwords.txt" />
:         <!-- Special synonym file here!!!!  -->
:         <filter class="solr.SynonymFilterFactory"
: synonyms="contentType_synonyms.txt" ignoreCase="true" expand="true"/>
:         <filter class="solr.LowerCaseFilterFactory"/>
:       </analyzer>
:     </fieldType>
: 
: </schema>
: 
: 
: 
: On Thu, Aug 4, 2016 at 3:55 PM, Chris Hostetter <hossman_luc...@fucit.org>
: wrote:
: 
: >
: > you mentioned that the problem only happens when you use xinclude, but you
: > havne't shown us hte details of your xinclude -- what exactly does your
: > schema.xml look like (with the xinclude call) and what exactly does the
: > file being included look like (entire contents)
: >
: > (I suspect the problem you are seeing is realted to the way xinclude
: > doens't really support "snippets" of malformed xml, and instead requires
: > some root tag -- i can't imagine what root tag you are using in the
: > included file that would play nicely with mixing/matching field
: > declarations. ... using xml entity includes may be a simpler/safer option)
: >
: >
: >
: > : Date: Thu, 4 Aug 2016 15:47:00 -0600
: > : From: John Bickerstaff <j...@johnbickerstaff.com>
: > : Reply-To: solr-user@lucene.apache.org
: > : To: solr-user@lucene.apache.org
: > : Subject: Re: Problems using fieldType text_general in copyField
: > :
: > : I would call this a bug...
: > :
: > : I'm going out on a limb and say that if you define a field in the
: > included
: > : XML file, you will get this error.
: > :
: > : As long as the field is defined first in schema.xml, you can "copyFIeld"
: > it
: > : or whatever in the include file, but apparently fields MUST be created in
: > : the schema.xml file.
: > :
: > : That makes use of the include for custom things somewhat moot - at least
: > in
: > : my situation.
: > :
: > : I'd love to be wrong by the way, but that's what my tests suggest right
: > : now...
: > :
: > : On Thu, Aug 4, 2016 at 1:37 PM, John Bickerstaff <
: > j...@johnbickerstaff.com>
: > : wrote:
: > :
: > : > Summary:
: > : >
: > : > Using xinclude to include an xml file into schema.xml
: > : >
: > : > The following line
: > : >
: > : > <copyField source="content" dest="text"/>
: > : >
: > : > generates an error:  about a field being "not a glob and not matching
: > an
: > : > explicit field" even though I declare the field in the line just above.
: > : >
: > : > This seems to happen only for for fieldType text_general?
: > : >
: > : > ============
: > : >
: > : > Explanation:
: > : >
: > : > I need a little help - keep getting an error when trying to use the
: > : > ability to include an additional XML file.  I may be overlooking
: > something,
: > : > but if so, I need help to see it.
: > : >
: > : > I have the following two lines which throw zero errors when part of
: > : > schema.xml:
: > : >
: > : > <field name="content" type="text_general" indexed="false" stored="true"
: > : > multiValued="true"/>
: > : >  <copyField source="content" dest="text"/>
: > : >
: > : > However, when I put this into an include file and use xinclude, then I
: > get
: > : > this error when starting Solr.
: > : >
: > : >
: > : >
: > : >    - *statdx_shard1_replica3:* org.apache.solr.common.
: > : >    SolrException:org.apache.solr.common.SolrException: Could not load
: > : >    conf for core statdx_shard1_replica3: Can't load schema schema.xml:
: > : >    copyField source :'content' is not a glob and doesn't match any
: > explicit
: > : >    field or dynamicField.
: > : >
: > : >
: > : > Given that I am defining the field in the line right above the
: > copyField
: > : > statement, I'm confused about why this works fine in schema.xml but
: > NOT in
: > : > an included file.
: > : >
: > : > I experimented and found that any field of type "text_general" will
: > throw
: > : > this same error if it is part of the included xml file.  Other
: > fieldTypes
: > : > that I tried (string, int, double) did not have this issue.
: > : >
: > : > I'm using Solr 5.4, although I'm pulling custom config into an included
: > : > file for purposes of moving to 6.1
: > : >
: > : > I have the following list of copyField commands in the included xml
: > file,
: > : > and get no errors on any but the "content" one.  It just so happens
: > that
: > : > "content" is the only field of type "text_general" in there.
: > : >
: > : >
: > : > Any hints greatly appreciated.
: > : >
: > : >   <copyField source="title" dest="text"/>
: > : >    <copyField source="author" dest="text"/>
: > : >    <copyField source="description" dest="text"/>
: > : >    <copyField source="keywords" dest="text"/>
: > : >    <copyField source="content" dest="text"/>
: > : >    <copyField source="content_type" dest="text"/>
: > : >    <copyField source="resourcename" dest="text"/>
: > : >    <copyField source="url" dest="text"/>
: > : >
: > : >
: > :
: >
: > -Hoss
: > http://www.lucidworks.com/
: >
: 

-Hoss
http://www.lucidworks.com/

Reply via email to