Re: Using EmbeddedSolrServer with static documents

2011-04-04 Thread vinodreddyr17
You can unmarshall the xml docs using jaxb and use the pojo adding capabilities of solr to index the doc. You may need to create the classes from the schema using xjc tool. -- View this message in context: http://lucene.472066.n3.nabble.com/Using-EmbeddedSolrServer-with-static-documents-tp2767614

Re: Using EmbeddedSolrServer with static documents

2011-04-03 Thread Erick Erickson
OK, you're still not quite on the right track. You can't just index XML documents without transforming them into valid Solr XML documents. Ditto for HTML. Take a look at the ExtractingRequestHandler documentation at: http://wiki.apache.org/solr/ExtractingRequestHandler Here's some more documentat

Re: Using EmbeddedSolrServer with static documents

2011-04-03 Thread michael.i
Hi Erick, thanx for getting back to me. "Well, what is "a document on the filesystem"? Solr deals with well-formed XML documents of a specific format." I would like to index all kinds of documents. For a start I'll be happy to be able to work with xml and html documents. -- View this message in

Re: Using EmbeddedSolrServer with static documents

2011-04-03 Thread Erick Erickson
Well, what is "a document on the filesystem"? Solr deals with well-formed XML documents of a specific format. You can't just stream a random file to Solr. Specifically documents look like: value for field . . . perhaps with an . There are ways for structured documents to be added using the T