Query results change

2016-01-15 Thread Brian Narsi
We have an index of 25 fields. Currently number of records in index is about 120,000. We are using parser: edismax qf: contains 8 fields fq: 1 field mm = 1 qs = 6 pf: containing g 3 fields bf: containing 1 field We have noticed that sometimes results change between two searches even if ever

Re: Query results change

2016-01-15 Thread Brian Narsi
inoy Dalal > wrote: > > You should try debugging such queries to see how exactly they're being > > executed. > > That will give you an idea as to why you're seeing the results you see. > > > > On Fri, 15 Jan 2016, 19:05 Brian Narsi wrote: > > >

Scaling DataImportHandler

2016-01-22 Thread Brian Narsi
What are the various ways DataImportHandler can be scaled? Thanks

Re: Query results change

2016-01-25 Thread Brian Narsi
xactly the same)? Thanks On Fri, Jan 15, 2016 at 3:12 PM, Brian Narsi wrote: > Data is indexed using Data Import Handler with clean=true, commit=true and > optimize=true. After that there are no updates or delete. > > The setup is SolrCloud with 2 shards and 2 replicas each. > > I

Re: SolrCloud replicas out of sync

2016-01-27 Thread Brian Narsi
This on the surface appears to be similar to an earlier thread by me: "Query results change" On Tue, Jan 26, 2016 at 4:32 PM, Jeff Wartes wrote: > > Ah, perhaps you fell into something like this then? > https://issues.apache.org/jira/browse/SOLR-7844 > > That says it’s fixed in 5.4, but that wou

Select distinct records

2016-02-11 Thread Brian Narsi
I am trying to select distinct records from a collection. (I need distinct name and corresponding id) I have tried using grouping and group format of simple but that takes a long time to execute and sometimes runs into out of memory exception. Another limitation seems to be that total number of gr

Re: Select distinct records

2016-02-11 Thread Brian Narsi
e: > > https://cwiki.apache.org/confluence/display/solr/Collapse+and+Expand+Results > > On Thu, Feb 11, 2016 at 8:44 PM Brian Narsi wrote: > > > I am trying to select distinct records from a collection. (I need > distinct > > name and corresponding id) > > >

Re: Select distinct records

2016-02-11 Thread Brian Narsi
I have tried to use the Collapsing feature but it appears that it leaves duplicated records in the result set. Is that expected? Or any suggestions on working around it? Thanks On Thu, Feb 11, 2016 at 9:30 AM, Brian Narsi wrote: > I am using > > Solr 5.1.0 > > On Thu, Feb 11,

Re: Select distinct records

2016-02-11 Thread Brian Narsi
provide the details? > > Joel Bernstein > http://joelsolr.blogspot.com/ > > On Thu, Feb 11, 2016 at 12:02 PM, Brian Narsi wrote: > > > I have tried to use the Collapsing feature but it appears that it leaves > > duplicated records in the result set. > > >

Re: Select distinct records

2016-02-11 Thread Brian Narsi
gspot.com/ > > On Thu, Feb 11, 2016 at 12:28 PM, Brian Narsi wrote: > > > Ok I see that Collapsing features requires documents to be co-located in > > the same shard in SolrCloud. > > > > Could that be a reason for duplication? > > > > On Thu,

Document Routing based on clientid (and null clientid)

2016-02-14 Thread Brian Narsi
My current design: All clients data in a 2 shard 2 replica each 2 node solr cluster. The data contains records with both clientid having value and clientid=null (the clientid=null is used for search across all clients) When searching I use fq: clientid = null or clientid = searchParameterClientI

DIH error - Bad Request

2016-04-14 Thread Brian Narsi
We have solr 5.1.0 running for several months retrieving about 10.5 millions records with no issues and no errors or warning in logs. I checked several times and the number of records reported as processed in DIH was exactly the same number in the collection. Recently I reviewed logs and found out

Solr relevant results

2015-08-14 Thread Brian Narsi
In my documents there are several fields, but for example say there are three fields: Description - text - this variable text Code - string - always a single character Prefer - boolean User searches on Description. When returning results I have to order results as following: Code = C Code = B

Re: Solr relevant results

2015-08-14 Thread Brian Narsi
; such as buckets and top-n items within those. > > Regards, > Alex. > > Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter: > http://www.solr-start.com/ > > > On 14 August 2015 at 15:10, Brian Narsi wrote: > > In my documents there are several

Re: Solr relevant results

2015-08-14 Thread Brian Narsi
the document is indexed. > > > > Regards, > >Alex. > > > > Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter: > > http://www.solr-start.com/ > > > > > > On 14 August 2015 at 23:52, Brian Narsi wrote: > >> Sear

Solr Query + vs AND

2015-08-21 Thread Brian Narsi
CustomerID is a string CustomerName is text I have a query like this (i.e. (CustomerID is NULL or CustomerID = 56789) and (CustomerName like smith)): ((-(CustomerID: [* TO *])) OR (CustomerID: 56789))AND(CustomerName:"smith") Number of results: 150 (Not correct) But when I change the query to

Using join vs flattening structure

2015-08-30 Thread Brian Narsi
I have read a lot about using flattened structures in solr (instead of relational). Looks like it is preferable to use flattened structure. But in our case we have to consider using (sort of) relational structure to keep index maintenance cost low. Does anyone have deeper insight into this? 1) W

Re: Using join vs flattening structure

2015-08-31 Thread Brian Narsi
alized etc., so there's > not much guidance we can give. > > I'll skip 4 > > 5 probably another month or two in Solr 5.4 > > Best, > Erick > > On Sun, Aug 30, 2015 at 6:59 PM, Brian Narsi wrote: > > I have read a lot about using flattened structures in

partial search EdgeNGramFilterFactory

2015-10-14 Thread Brian Narsi
I have the following fieldtype in my schema: and the following field: With the following data: SellerName:CARDINAL HEALTH When I do the following search q:SellerName:cardinal I get back the results with SellerName: CARDINAL HEALTH (correct) or I do the search q:SellerName:

Re: partial search EdgeNGramFilterFactory

2015-10-14 Thread Brian Narsi
ake some care to make sure you escaped spaces to get > the whole thing through the query parser though. > > Best, > Erick > > On Wed, Oct 14, 2015 at 11:03 AM, Brian Narsi wrote: > > I have the following fieldtype in my schema: > > > > > positionIncreme

Query differently or change fieldtype

2015-10-26 Thread Brian Narsi
I have the following field type on a field ClientName: For data where ClientName = st jude medical inc When querying I get the following: 1) st --> result = st jude medical inc (works correctly) 2) st j --> No results are returned (NOT correct) - Expect to find st jude medical i

Re: Query differently or change fieldtype

2015-10-26 Thread Brian Narsi
ry will tokenize st j to st and j > > 2015年10月26日星期一,Brian Narsi 写道: > > > I have the following field type on a field ClientName: > > > > > positionIncrementGap="100"> > > > > > > > > > maxGramSize="25"/&g

Sort not working as expected

2015-10-30 Thread Brian Narsi
I have a fieldtype setup as < tokenizer class="solr.StandardTokenizerFactory"/> When I sort on this field type in ascending order I am not getting results sorted alphabetically as expected. Why is that? What should I do to get the sort on? Thanks

Re: Sort not working as expected

2015-10-30 Thread Brian Narsi
Is there no way that the existing field can be used? On Fri, Oct 30, 2015 at 1:42 PM, Ray Niu wrote: > you should use string type instead of text if you want to sort > alphabetically > > 2015-10-30 11:12 GMT-07:00 Brian Narsi : > > > I have a fieldtype setup as > >

phrase query

2015-11-04 Thread Brian Narsi
I have the following field type: I am trying to use dismax query parser (because it seems to have better phrase query support compared with standard query parser?) I have mm = 1 ps = 4 I have the following data indexed: 1) acute care pharmaceuticals 2) carefusion llc When q=acute

Solr results relevancy / scoring

2015-11-06 Thread Brian Narsi
I have a situation where. User search query q=15% Solr results contain several documents that are 15% 15% 15% 15% 15 (why?) 15% 15% I have debugged the query and can see that the score for 15 is higher than the ones below it. Why is that? Where can I read in detail about how the scoring is be

Re: Solr results relevancy / scoring

2015-11-10 Thread Brian Narsi
the question your asking is. You say > >> that you have debugged the query and the score for 15 is > >> higher than the ones below it. What's surprising about that? > >> > >> Are you saying you don't understand how the score is > >> calculated? Or

Data Import Handler / Backup indexes

2015-11-17 Thread Brian Narsi
I am using Data Import Handler to retrieve data from a database with full-import, clean = true, commit = true and optimize = true This has always worked correctly without any errors. But just to be on the safe side, I am thinking that we should do a backup before initiating Data Import Handler.

Re: Data Import Handler / Backup indexes

2015-11-17 Thread Brian Narsi
out : https://github.com/bloomreach/solrcloud-haft. This > helps backup solr indices across clusters. > > On Tue, Nov 17, 2015 at 7:08 AM, Brian Narsi wrote: > > > I am using Data Import Handler to retrieve data from a database with > > > > full-import, clean

Re: Data Import Handler / Backup indexes

2015-11-21 Thread Brian Narsi
e contents of ZK. I’m currently working on some parts of that. > > > Making a copy of a collection is supported too, with some caveats. > > > On 11/17/15, 10:20 AM, "Brian Narsi" wrote: > > >Sorry I forgot to mention that we are using SolrCloud 5.1.0. >

Re: Solrcloud with Zookeeper in production

2015-11-27 Thread Brian Narsi
solrconfig.xml, schema.xml are all loaded into the zookeeper. You can view the files loaded into zookeeper by clicking on Cloud on the home page of SolrCloud and then navigating to config folder. Below is a summary of steps to creating a collection: 1) Load the configuration files into zookeeper

Re: Solrcloud with Zookeeper in production

2015-11-28 Thread Brian Narsi
I suggest the documentation Apache Solr Reference Guide for general learning. https://cwiki.apache.org/confluence/display/solr/Using+ZooKeeper+to+Manage+Configuration+Files https://cwiki.apache.org/confluence/display/solr/Solr+Field+Types https://wiki.apache.org/solr/SolrConfigXml And if you run

Synonyms in Search Results and More Accurate Matches

2015-11-30 Thread Brian Narsi
I am using edismax with mm=1 and qs=6 I have a field type with synonyms attached to it. A sample synonym is: toothbrush tbrush For the following data: 1) Phillips toothbrush 2) Oral-B tbrush 3) Phillips Sonicare toothbrush If a user searches for q = tbrush I am getting 1), 3), 2) i.e. t

Re: Synonyms in Search Results and More Accurate Matches

2015-12-01 Thread Brian Narsi
I do not have synonyms enabled at query time. Below is my fieldtype On Tue, Dec 1, 2015 at 4:18 AM, Markus Jelsma wrote: > Hello - it looks like you have synonyms enabled at query time, which is > fine, but also means TF*IDF stats are different for tbrush and toothbrush, > causing t

Data Import Handler - Multivalued fields - splitBy

2015-12-04 Thread Brian Narsi
I have the following: I believe I had the following working (splitting on pipe delimited) But it does not work now. In-fact now I have even tried But I cannot get the values to split into an array. Any thoughts/suggestions what may be wrong? Thanks,

Re: Data Import Handler - Multivalued fields - splitBy

2015-12-04 Thread Brian Narsi
apache.org/solr/DataImportHandler#RegexTransformer for > more information. > > James Dyer > Ingram Content Group > > > -Original Message- > From: Brian Narsi [mailto:bnars...@gmail.com] > Sent: Friday, December 04, 2015 3:10 PM > To: solr-user@lucene.apache.

Long Running Data Import Handler - Notifications

2015-12-08 Thread Brian Narsi
Is there a way to receive notifications when a Data Import Handler finishes up and whether it succeeded or failed. (typically runs about an hour) Thanks

NullPointerException - Data Import Handler - TimeZone

2016-05-12 Thread Brian Narsi
We are getting the following error: Full Import failed:java.lang.RuntimeException: org.apache.solr.handler.dataimport.DataImportHandlerException: java.lang.NullPointerException at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:270) at org.apache.solr.handler.dataimport.DataI

Data Import Handler - maximum?

2016-12-11 Thread Brian Narsi
We are using Solr 5.1.0 and DIH to build index. We are using DIH with clean=true and commit=true and optimize=true. Currently retrieving about 10.5 million records in about an hour. I will like to find from other member's experiences as to how long can DIH run with no issues? What is the maximum

Stop Solr Node (in distress)?

2016-12-19 Thread Brian Narsi
We have had a situation where Solr node was in distress due to hard drive being full and the queries became very slow. Since our Solr cluster has two nodes with indexes being fully available on both the nodes, we think that one good solution would be to just stop the Solr instance on a distressed n