Re: a way to marshall xml doc into a SolrInputDocument

2012-01-13 Thread jmuguruza
Chris Hostetter-3 wrote > > but you're the first person i've ever seen ask about > serializng to Solr's XML format on the client, then parse it again, then > send the SolrInputDocument to Solr (seems like a lot of > gratuitious serialize/desrialze/serialise/etc...) > -Hoss > Yes, , but I am

Re: a way to marshall xml doc into a SolrInputDocument

2012-01-12 Thread jmuguruza
even if they could (not sure if they could be done there, as they involve properly formatting some fields so dates are in correct format etc, and maybe the format is checked first) I would prefer to do it in the solrj side as the code will be much simpler for me. thanks -- View this message in co

a way to marshall xml doc into a SolrInputDocument

2012-01-12 Thread jmuguruza
If I have individual files in the expected Solr format (having just ONE doc per file): GB18030TEST Test with some GB18030 encoded characters No accents here ÕâÊÇÒ»¸ö¹¦ÄÜ 0 Is not there a way to easily marshal that file into a SolrInputDocument? Do I have to do the pars

Re: Large data set or data corpus

2012-01-12 Thread jmuguruza
http://www.data.gov/ has lots of datasets available for free -- View this message in context: http://lucene.472066.n3.nabble.com/Large-data-set-or-data-corpus-tp3650316p3653154.html Sent from the Solr - User mailing list archive at Nabble.com.