The binary format just reduces overhead. in your case , all the data
is in the big text field which is not compressed. But overall, the
parsing is a lot faster for the binary format. So you see a perf boost
2010/1/27 Tim Terlegård :
> I have 6 fields. The text field is the biggest, it contains alm
I have 6 fields. The text field is the biggest, it contains almost all
of the 5000 chars.
/Tim
2010/1/27 Noble Paul നോബിള് नोब्ळ् :
> how many fields are there in each doc? the binary format just reduces
> overhead. it does not touch/compress the payload
>
> 2010/1/27 Tim Terlegård :
>> I have
how many fields are there in each doc? the binary format just reduces
overhead. it does not touch/compress the payload
2010/1/27 Tim Terlegård :
> I have 3 millon documents, each having 5000 chars. The xml file is
> about 15GB. The binary file is also about 15GB.
>
> I was a bit surprised about th
if you write only a few docs you may not observe much difference in
size. if you write large no:of docs you may observe a big difference.
2010/1/27 Tim Terlegård :
> I got the binary format to work perfectly now. Performance is better
> than with xml. Thanks!
>
> Although, it doesn't look like a b
Yes, it worked! Thank you very much. But do I need to use curl or can
I use CommonsHttpSolrServer or StreamingUpdateSolrServer? If I can't
use BinaryWriter then I don't know how to do this.
/Tim
2010/1/20 Noble Paul നോബിള് नोब्ळ् :
> 2010/1/20 Tim Terlegård :
> BinaryRequestWriter does not
2010/1/20 Tim Terlegård :
BinaryRequestWriter does not read from a file and post it
>>>
>>> Is there any other way or is this use case not supported? I tried this:
>>>
>>> $ curl /solr/update/javabin -F stream.file=/tmp/data.bin
>>> $ curl /solr/update -F stream.body=' '
>>>
>>> Solr did read
2010/1/19 Noble Paul നോബിള് नोब्ळ् :
> 2010/1/19 Tim Terlegård :
>> server = new CommonsHttpSolrServer("http://localhost:8983/solr";)
>> server.setRequestWriter(new BinaryRequestWriter())
>> request = new UpdateRequest()
>> request.setAction(UpdateRequest.ACTION.COMMIT, true, true);
>
2010/1/19 Tim Terlegård :
> There are a few ways to use solrj. I just learned that I can use the
> javabin format to get some performance gain. But when I try the binary
> format nothing is added to the index. This is how I try to use this:
>
> server = new CommonsHttpSolrServer("http://localhos