[job-post] Looking for developers and mangers - OpenSource @AWS

2020-04-21 Thread Anirudha Jadhav
Hi Folks, I have worked with Lucene and Solr since 1.4 and lately been involved in building on core-lucene work on ML, SQL and other query engines. Want to work on 100% open source software, While working at Amazon? Let’s chat!!! Backend SQL engine development ( https://lnkd.in/dGA96vv) Frontend

Re: Spread SolrCloud across two locations

2017-05-24 Thread Anirudha Jadhav
Latest zk supports auto reconfigure. Keep one DC as quorum and another as observers. When a DC goes down initiate a zk reconfigure action. To flip quorum and observers. When I tested this solr survived just fine, but it been a while. Ani On Wed, May 24, 2017 at 6:35 PM Pushkar Raste wrote: >

Re: Separate network interfaces for inter-node communication and update/search requests?

2015-06-11 Thread Anirudha Jadhav
Modern network interfaces are pretty capable. I would doubt this optimization would yield any performance improvements. I would love to see some test results which prove me wrong. is performance the primary reason for this? or do you have any other reasons. -Ani On Thu, Jun 11, 2015 at 9:04 AM,

Re: Solr Cloud

2015-05-04 Thread Anirudha Jadhav
the jmx metrics are good, you can start there, lets talk offline for more. -Ani On Mon, May 4, 2015 at 10:51 PM, Jilani Shaik wrote: > Thanks Shawn, It has provided the pointers of open source, I am really > interested to look for open source solution, I have basic knowledge of > Ganglia and Nag

WordDelimiterFilter issue and suggested fix

2014-04-09 Thread Anirudha Jadhav
indexed term: bdeut_NullableValue query term 1 : bdeut_nullablevalue (lowercase) query term 2 : nullablevalue (lowercase) current field type definition in order: Whitespace Tokenizer Word Delimiter Filter Lowercase filter current analysis output: [INDEX]schema_field WT text bdeut_NullableValue

[ JOB ] - Search Specialist, Bloomberg LP [ NY and London ]

2014-04-04 Thread Anirudha Jadhav
http://jobs.bloomberg.com/job/New-York-Search-Technology-Specialist-Job-NY/45497500/ http://jobs.bloomberg.com/job/London-R&D-News-Search-Backend-Developer-Job/50463600/ keeping it short here , feel free to talk to me with more questions -- Anirudha P. Jadhav

Re: Listing Collections in Solr Cloud

2013-11-18 Thread Anirudha Jadhav
i'll create one, i'll reach you if I need anything. thanks On Mon, Nov 18, 2013 at 2:13 PM, Mark Miller wrote: > We should have a list command in the collections api. I can help if > someone wants to make a JIRA issue. > > - Mark > > On Nov 18, 2013, at 2:1

Re: Listing Collections in Solr Cloud

2013-11-18 Thread Anirudha Jadhav
you can use the following 2 ways 1. ZK client API you could just do a get_children on the zk node /collections/ to get all collections. or without ZK client API point this url at your solrCloud install http://host:port/solr/zookeeper?detail=true&path=%2Fcollections&_=1384801522456

Re: coordination factor in between query terms

2013-08-28 Thread Anirudha Jadhav
my bad, typo there q=termA termB i know omitNorms is indexTime field option, can it be applied to the query also? are there other solutions to this kind of a problem? curious On Wed, Aug 28, 2013 at 4:52 PM, Chris Hostetter wrote: > > 1) Coordination factor is controlled by the Similarity you

Re: coordination factor in between query terms

2013-08-28 Thread Anirudha Jadhav
show up higher in your results. > > http://wiki.apache.org/solr/SolrRelevancyCookbook#Boosting_Ranking_Terms > > - Greg > > -Original Message- > From: anirudh...@gmail.com [mailto:anirudh...@gmail.com] On Behalf Of > Anirudha Jadhav > Sent: Wednesday, Augu

coordination factor in between query terms

2013-08-28 Thread Anirudha Jadhav
How can i specify coordination factor between query terms eg. q="termA termB" doc1= { field: termA} doc2 = {field: termA termB termC termD } I want doc2 scored higher than doc1 -- Anirudha P. Jadhav

Re: Of tlogs and atomic updates

2013-08-13 Thread Anirudha Jadhav
quick question on a similar topic, for a NRT call to index a doc ,returns a success return code, if and only if all available server have successfully written the doc to their tlog. correct? On Tue, Aug 13, 2013 at 10:35 AM, Yonik Seeley wrote: > On Tue, Aug 13, 2013 at 10:11 AM, Erick Erickso

Re: SolrCloud: Programmatically create multiple collections?

2013-08-13 Thread Anirudha Jadhav
At this point you would need a higher level service sitting on top on solr clusters which also talks to your zk setup in order to create custom collections on the fly. its not super difficult, but seems out of scope for solrcloud now. let me know if others have a different opinion. thanks, Ani

Re: index merge question

2013-06-11 Thread Anirudha Jadhav
>From my experience the lucene mergeTool and the one invoked by coreAdmin is a pure lucene implementation and does not understand the concepts of a unique Key(solr land concept) http://wiki.apache.org/solr/MergingSolrIndexes has a cautionary note at the end we do frequent index merges for which

hostname -> ipaddress change in solr4.0 to solr4.1+

2013-05-22 Thread Anirudha Jadhav
Logging/UI used to show hostname in 4.0 in 4.1+ it switched to ip addresses is this by design or a bug/side effect ? its pretty painful to look at ip addresses, I am planning to change. let me know if you have any concerns -- Anirudha

opinion: Stats over the faceting component

2013-03-20 Thread Anirudha Jadhav
I want to get an opinion here , instead of having statistics as an independent component which is always limited by faceting features ( eg. does not support date ranges or custom ranges , pivots etc). Why not have a parameter to facet component to compute and return stats. eg. facet.stats=true,f

Re: Get page number of searchresult of a pdf in solr

2013-03-02 Thread Anirudha Jadhav
if you increase the granularity of your document in index to a single page instead of an entire pdf; it becomes an easy problem. Your description states that you are not searching for a terms in a pdf but instead you are searching for a term in a page from a pdf. I assume you load the pdf externa

Re: solr 4.1 - trying to create 2 collection with 2 different sets of configurations

2013-02-28 Thread Anirudha Jadhav
*1.empty Zookeeper* *2.empty index directories for solr* *3.empty solr.xml* * 3.1 upload / link cfg in zookeeper for test collection* *4*.* start 4 solr servers on different machines* *5. Access server* : i see that's ok *6. CREATE collection* http://hostname:15000/solr/admin/collections?

Re: update fails if one doc is wrong

2013-02-26 Thread Anirudha Jadhav
Ideally you would want to use SOLRJ or other interface which can catch exceptions/error and re-try them. On Tue, Feb 26, 2013 at 3:45 PM, Walter Underwood wrote: > I've done exactly the same thing. On error, set the batch size to one and > try again. > > wunder > > On Feb 26, 2013, at 12:27 PM,

Re: zk Config URL?

2013-02-25 Thread Anirudha Jadhav
Solr cloud reads solr cfg files from zookeeper. You need to push the cfg to zookeeper & link collection to cfg. This is exactly what mark suggested earlier in the thread. This is also explained in solr cloud wiki. On Monday, February 25, 2013, Darren Govoni wrote: > Hi Mark, > >I download la

suggestions w.r.t Issue with Collections API in 4.1

2013-02-14 Thread Anirudha Jadhav
*1.empty Zookeeper* *2.empty index directories for solr* *3.empty solr.xml* *3.1 upload / link cfg in zookeeper for test collection* *4*.* start 4 solr servers on different machines* *5. Access server* : i see that's ok *6. CREATE collection* http://hostname:15000/solr/admin/collections?a

SolrCloud : $SOLR_HOME/solr.xml

2013-02-13 Thread Anirudha Jadhav
is there a strong reason why we still need solr.xml on disk and it cannot be persisted and used from in zookeeper ? thanks, -- Anirudha P. Jadhav

Re: Difference between IntField and TrieIntField in Lucene 4.0

2013-01-12 Thread Anirudha Jadhav
here is a good post listing the details http://searchhub.org/2009/05/13/exploring-lucene-and-solrs-trierange-capabilities/ you can also read up Trie data structure on wikipedia. You should almost always want to use the trieintField the default int field in solr is solr.TrieIntField On Sat, J

Re: Restoring SolrCloud state in ZooKeeper?

2013-01-12 Thread Anirudha Jadhav
we don't backup clusterstate.json ,but regenerate it (failed parts) based on ping requests to all servers in our farm and use ZKClient C++ binding to update it. actual implementation and logic is a bit more complex, and for all normal apps this is not necessary. we are just more paranoid about fai

Re: ZooKeeper ensemble behind load balancer

2012-12-29 Thread Anirudha Jadhav
A zookeeper ensemble should be a fairly reliable, large enough no.of machines(3+ typically 5,7,9) for a quorum. So adding a load balancer on top will just add a hop and decrease performance, and also add a failure point in the system. that being said there needs to be a way to provide solr with a

Re: Question on WordDelimiterFilterFactory use

2012-12-26 Thread Anirudha Jadhav
try this to see the results yourself http://lucidworks.lucidimagination.com/display/solr/Running+Your+Analyzer On Wed, Dec 26, 2012 at 3:36 PM, Dmitry Kan wrote: > Hi, > > Have you tried looking at admin analysis page? You can see how i-pod gets > indexed and highlight query results there too.

Re: facet query

2012-12-25 Thread Anirudha Jadhav
Please see http://wiki.apache.org/solr/SimpleFacetParameters for more details On Friday, December 21, 2012, hank williams wrote: > Great, thank you. > > > Date: Fri, 21 Dec 2012 14:42:13 +0100 > > From: r@solr.pl > > To: solr-user@lucene.apache.org > > Subject: Re: facet query > > > > Hello

Re: Solr Cloud 4.0 Production Ready?

2012-12-19 Thread Anirudha Jadhav
I am curious to know what issues you are facing with 3.5 and large indices. We use 3.5 with 200g indices with no issues until now -Ani On Tuesday, December 18, 2012, Luis Cappa Banda wrote: > Any idea about when Solr 4.1 will be released? > > 2012/12/18 Otis Gospodnetic > > > > Hi, > > > > If yo

surround parser not working for span queries

2012-11-29 Thread Anirudha Jadhav
I was trying to port surround parer in 4.0 to 3.5 After getting the plugin to work I am not able to get the following results: http://localhost:8983/solr/collection1/select?q=_query_:{!surround}features:(document3w shiny) this works on 4.0 but not on 3.5 with the plugin installed 3.5 query http:

Re: SPAN queries in solr

2012-11-25 Thread Anirudha Jadhav
thanks Erik. Nested Queries help. I'll keep working on the XML Query parser on the side. -Ani On Sun, Nov 25, 2012 at 5:18 PM, Erik Hatcher wrote: > _query_: -- Anirudha P. Jadhav

Re: SPAN queries in solr

2012-11-25 Thread Anirudha Jadhav
ng it to 3.6 and 4.0. and implement all suggestions from solr-839. On Fri, Nov 23, 2012 at 6:14 PM, Anirudha Jadhav wrote: > Can this be made to work with solr 3.5 ? > i will give it a try. > > Thanks > > On Nov 23, 2012, at 17:28, simon wrote: > > > take a look a

Re: SPAN queries in solr

2012-11-23 Thread Anirudha Jadhav
2012 at 3:38 PM, Anirudha Jadhav wrote: > >> What is the best way to use span queries in solr ? >> >> I see https://issues.apache.org/jira/browse/SOLR-839 which enables the XML >> Query parser that supports span queries. >> >> >> >> -- >> Anirudha P. Jadhav >>

SPAN queries in solr

2012-11-23 Thread Anirudha Jadhav
What is the best way to use span queries in solr ? I see https://issues.apache.org/jira/browse/SOLR-839 which enables the XML Query parser that supports span queries. -- Anirudha P. Jadhav

Re: Internal Vs. External ZooKeeper

2012-11-14 Thread Anirudha Jadhav
just fine - we simply recommend the other way to save > headaches. If you know what you are getting into, it's certainly a valid > choice. > > - Mark > > > On 11/11/2012 05:11 PM, Anirudha Jadhav wrote: > >> let me see if i get this correctly, >> >> g

Re: Internal Vs. External ZooKeeper

2012-11-11 Thread Anirudha Jadhav
let me see if i get this correctly, greater the no.of zookeeper nodes , more the time it takes to come to a consensus. During an indexing operation, how many times does a solr client needs to contact zookeeper for consensus ? - per docs ? per commit ? ? thanks, Ani On Sun, Nov 11, 2012 at 11:1

Re: Questions about HttpSolrServer

2012-10-23 Thread Anirudha Jadhav
try reading up on : LBHttpSolrServer. We have a layer on top of solr to manage such scenarios. On Tue, Oct 23, 2012 at 4:52 PM, Benjamin, Roy wrote: > Assuming one has hundreds of Solr nodes should an indexing application > pool HttpSolrServer instances ala a connection pool ? > > Thanks > Roy

Looking for a Senior Search Architect/Developer - Bloomberg LP (NYC)

2012-05-03 Thread Anirudha Jadhav
reference. Feel free to contact me if you have any more questions. C/C++ advanced knowledge is good to have but not a requirement. http://careers.bloomberg.com/hire/jobs/job32519.html thanks, Anirudha Jadhav Sr. Developer Bloomberg LP

where do i find xsd schemas for MultiCore Solr responses

2011-09-08 Thread Anirudha Jadhav
Hi , I am using separate task to monitor solr instances, where do i find xsd schemas for MultiCore Solr responses inorder to correctly parse them. thanks, Ani eg. XML " 016 core0multicore\core0\multicore\core0\data\2011-09-07T21:36:53.864Z 67193861001313764498766 falsetruefalse

Lazy loading of Cores

2011-07-14 Thread Anirudha Jadhav
Is this implemented ? I was just seeing the doc-wiki , was not sure if it was up-to-date. Are there any specific resources for someone new to start working, contributin Solr code viz. current architecture, steps to compile and get it to build from source. thanks a lot -- Anirudha