You can unmarshall the xml docs using jaxb and use the pojo adding
capabilities of solr to index the doc. You may need to create the classes
from the schema using xjc tool.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Using-EmbeddedSolrServer-with-static-documents-tp2767614
OK, you're still not quite on the right track. You can't just
index XML documents without transforming them into
valid Solr XML documents. Ditto for HTML.
Take a look at the ExtractingRequestHandler documentation at:
http://wiki.apache.org/solr/ExtractingRequestHandler
Here's some more documentat
Hi Erick,
thanx for getting back to me.
"Well, what is "a document on the filesystem"? Solr deals
with well-formed XML documents of a specific format."
I would like to index all kinds of documents. For a start I'll be happy to
be able to work with xml and html documents.
--
View this message in
Well, what is "a document on the filesystem"? Solr deals
with well-formed XML documents of a specific format. You
can't just stream a random file to Solr. Specifically
documents look like:
value for field
.
.
.
perhaps with an .
There are ways for structured documents to be added using the
T