Big Data Developer for Arrow Electronics, Inc.

2016-05-16 Thread Bin Wang
ented backend developer who really knows and loves Solr and API design. Here <https://www.linkedin.com/pulse/join-arrow-big-data-team-bin-wang?trk=pulse_spock-articles> is a post that I shared on my Linkedin and here <https://career4.successfactors.com/career?career_ns=job_listing

Re: Solrcloud Batch Indexing

2016-03-09 Thread Bin Wang
k isn't Solr at all, it's the data acquisition, > so be > careful when measuring that the Solr CPU's are pegged... otherwise > you're bottlenecking upstream of Solr. A super-simple way to figure that > out is to comment out the solrServer.add(list, 1) line in

Re: Solrcloud Batch Indexing

2016-03-07 Thread Bin Wang
inevitably adds complexity, what are you trying > to gain here that you can't get with several threads in a SolrJ client? > > Best, > Erick > > On Mon, Mar 7, 2016 at 12:28 PM, Bin Wang wrote: > > Hi there, > > > > I have a fairly big data set that I n

Re: Solr Deserialize/Read .fdt file

2016-03-07 Thread Bin Wang
ar with the Lucene (not > Solr) source code. > > If you want to trace through the code from Solr through Lucene, I suggest > you start with Solr unit tests in Eclipse. > > But none of that will be an appropriate topic for users on this (Solr) > list. > > > > -- Jack Kru

Solrcloud Batch Indexing

2016-03-07 Thread Bin Wang
Hi there, I have a fairly big data set that I need to quick index into Solrcloud. I have done some research and none of them looked really good to me. (1) Kite Morphline: I managed to get it working, the mapreduce finished in a few minutes which is good, however, it took a really long time, like

Solr Deserialize/Read .fdt file

2016-03-06 Thread Bin Wang
Hi there, I am interested in understanding all the files in the index folder. here is a stackoverflow question that I have tried however failed. Can anyone provide some sample code to help me get started. Best regards, Bin