Re: Fastest way to use solrj

2010-01-27 Thread Noble Paul നോബിള്‍ नोब्ळ्
The binary format just reduces overhead. in your case , all the data is in the big text field which is not compressed. But overall, the parsing is a lot faster for the binary format. So you see a perf boost 2010/1/27 Tim Terlegård : > I have 6 fields. The text field is the biggest, it contains alm

Re: Fastest way to use solrj

2010-01-27 Thread Tim Terlegård
I have 6 fields. The text field is the biggest, it contains almost all of the 5000 chars. /Tim 2010/1/27 Noble Paul നോബിള്‍ नोब्ळ् : > how many fields are there in each doc? the binary format just reduces > overhead. it does not touch/compress the payload > > 2010/1/27 Tim Terlegård : >> I have

Re: Fastest way to use solrj

2010-01-27 Thread Noble Paul നോബിള്‍ नोब्ळ्
how many fields are there in each doc? the binary format just reduces overhead. it does not touch/compress the payload 2010/1/27 Tim Terlegård : > I have 3 millon documents, each having 5000 chars. The xml file is > about 15GB. The binary file is also about 15GB. > > I was a bit surprised about th

Re: Fastest way to use solrj

2010-01-26 Thread Noble Paul നോബിള്‍ नोब्ळ्
if you write only a few docs you may not observe much difference in size. if you write large no:of docs you may observe a big difference. 2010/1/27 Tim Terlegård : > I got the binary format to work perfectly now. Performance is better > than with xml. Thanks! > > Although, it doesn't look like a b

Re: Fastest way to use solrj

2010-01-20 Thread Tim Terlegård
Yes, it worked! Thank you very much. But do I need to use curl or can I use CommonsHttpSolrServer or StreamingUpdateSolrServer? If I can't use BinaryWriter then I don't know how to do this. /Tim 2010/1/20 Noble Paul നോബിള്‍ नोब्ळ् : > 2010/1/20 Tim Terlegård : > BinaryRequestWriter does not

Re: Fastest way to use solrj

2010-01-20 Thread Noble Paul നോബിള്‍ नोब्ळ्
2010/1/20 Tim Terlegård : BinaryRequestWriter does not read from a file and post it >>> >>> Is there any other way or is this use case not supported? I tried this: >>> >>> $ curl /solr/update/javabin -F stream.file=/tmp/data.bin >>> $ curl /solr/update -F stream.body=' ' >>> >>> Solr did read

Re: Fastest way to use solrj

2010-01-19 Thread Tim Terlegård
2010/1/19 Noble Paul നോബിള്‍ नोब्ळ् : > 2010/1/19 Tim Terlegård : >>    server = new CommonsHttpSolrServer("http://localhost:8983/solr";) >>    server.setRequestWriter(new BinaryRequestWriter()) >>    request = new UpdateRequest() >>    request.setAction(UpdateRequest.ACTION.COMMIT, true, true); >

Re: Fastest way to use solrj

2010-01-19 Thread Noble Paul നോബിള്‍ नोब्ळ्
2010/1/19 Tim Terlegård : > There are a few ways to use solrj. I just learned that I can use the > javabin format to get some performance gain. But when I try the binary > format nothing is added to the index. This is how I try to use this: > >    server = new CommonsHttpSolrServer("http://localhos