We have an index of 25 fields. Currently number of records in index is
about 120,000. We are using
parser: edismax
qf: contains 8 fields
fq: 1 field
mm = 1
qs = 6
pf: containing g 3 fields
bf: containing 1 field
We have noticed that sometimes results change between two searches even if
ever
inoy Dalal
> wrote:
> > You should try debugging such queries to see how exactly they're being
> > executed.
> > That will give you an idea as to why you're seeing the results you see.
> >
> > On Fri, 15 Jan 2016, 19:05 Brian Narsi wrote:
> >
>
What are the various ways DataImportHandler can be scaled?
Thanks
xactly the same)?
Thanks
On Fri, Jan 15, 2016 at 3:12 PM, Brian Narsi wrote:
> Data is indexed using Data Import Handler with clean=true, commit=true and
> optimize=true. After that there are no updates or delete.
>
> The setup is SolrCloud with 2 shards and 2 replicas each.
>
> I
This on the surface appears to be similar to an earlier thread by me: "Query
results change"
On Tue, Jan 26, 2016 at 4:32 PM, Jeff Wartes wrote:
>
> Ah, perhaps you fell into something like this then?
> https://issues.apache.org/jira/browse/SOLR-7844
>
> That says it’s fixed in 5.4, but that wou
I am trying to select distinct records from a collection. (I need distinct
name and corresponding id)
I have tried using grouping and group format of simple but that takes a
long time to execute and sometimes runs into out of memory exception.
Another limitation seems to be that total number of gr
e:
>
> https://cwiki.apache.org/confluence/display/solr/Collapse+and+Expand+Results
>
> On Thu, Feb 11, 2016 at 8:44 PM Brian Narsi wrote:
>
> > I am trying to select distinct records from a collection. (I need
> distinct
> > name and corresponding id)
> >
>
I have tried to use the Collapsing feature but it appears that it leaves
duplicated records in the result set.
Is that expected? Or any suggestions on working around it?
Thanks
On Thu, Feb 11, 2016 at 9:30 AM, Brian Narsi wrote:
> I am using
>
> Solr 5.1.0
>
> On Thu, Feb 11,
provide the details?
>
> Joel Bernstein
> http://joelsolr.blogspot.com/
>
> On Thu, Feb 11, 2016 at 12:02 PM, Brian Narsi wrote:
>
> > I have tried to use the Collapsing feature but it appears that it leaves
> > duplicated records in the result set.
> >
>
gspot.com/
>
> On Thu, Feb 11, 2016 at 12:28 PM, Brian Narsi wrote:
>
> > Ok I see that Collapsing features requires documents to be co-located in
> > the same shard in SolrCloud.
> >
> > Could that be a reason for duplication?
> >
> > On Thu,
My current design:
All clients data in a 2 shard 2 replica each 2 node solr cluster.
The data contains records with both clientid having value and clientid=null
(the clientid=null is used for search across all clients)
When searching I use fq: clientid = null or clientid =
searchParameterClientI
We have solr 5.1.0 running for several months retrieving about 10.5
millions records with no issues and no errors or warning in logs. I checked
several times and the number of records reported as processed in DIH was
exactly the same number in the collection.
Recently I reviewed logs and found out
In my documents there are several fields, but for example say there are
three fields:
Description - text - this variable text
Code - string - always a single character
Prefer - boolean
User searches on Description.
When returning results I have to order results as following:
Code = C
Code = B
; such as buckets and top-n items within those.
>
> Regards,
> Alex.
>
> Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter:
> http://www.solr-start.com/
>
>
> On 14 August 2015 at 15:10, Brian Narsi wrote:
> > In my documents there are several
the document is indexed.
> >
> > Regards,
> >Alex.
> >
> > Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter:
> > http://www.solr-start.com/
> >
> >
> > On 14 August 2015 at 23:52, Brian Narsi wrote:
> >> Sear
CustomerID is a string
CustomerName is text
I have a query like this (i.e. (CustomerID is NULL or CustomerID = 56789)
and (CustomerName like smith)):
((-(CustomerID: [* TO *])) OR (CustomerID: 56789))AND(CustomerName:"smith")
Number of results: 150 (Not correct)
But when I change the query to
I have read a lot about using flattened structures in solr (instead of
relational). Looks like it is preferable to use flattened structure. But in
our case we have to consider using (sort of) relational structure to keep
index maintenance cost low.
Does anyone have deeper insight into this?
1) W
alized etc., so there's
> not much guidance we can give.
>
> I'll skip 4
>
> 5 probably another month or two in Solr 5.4
>
> Best,
> Erick
>
> On Sun, Aug 30, 2015 at 6:59 PM, Brian Narsi wrote:
> > I have read a lot about using flattened structures in
I have the following fieldtype in my schema:
and the following field:
With the following data:
SellerName:CARDINAL HEALTH
When I do the following search
q:SellerName:cardinal
I get back the results with SellerName: CARDINAL HEALTH (correct)
or I do the search
q:SellerName:
ake some care to make sure you escaped spaces to get
> the whole thing through the query parser though.
>
> Best,
> Erick
>
> On Wed, Oct 14, 2015 at 11:03 AM, Brian Narsi wrote:
> > I have the following fieldtype in my schema:
> >
> > > positionIncreme
I have the following field type on a field ClientName:
For data where
ClientName = st jude medical inc
When querying I get the following:
1) st --> result = st jude medical inc (works correctly)
2) st j --> No results are returned (NOT correct) - Expect to find st jude
medical i
ry will tokenize st j to st and j
>
> 2015年10月26日星期一,Brian Narsi 写道:
>
> > I have the following field type on a field ClientName:
> >
> > > positionIncrementGap="100">
> >
> >
> >
> > > maxGramSize="25"/&g
I have a fieldtype setup as
<
tokenizer class="solr.StandardTokenizerFactory"/>
When I sort on this field type in ascending order I am not getting results
sorted alphabetically as expected.
Why is that?
What should I do to get the sort on?
Thanks
Is there no way that the existing field can be used?
On Fri, Oct 30, 2015 at 1:42 PM, Ray Niu wrote:
> you should use string type instead of text if you want to sort
> alphabetically
>
> 2015-10-30 11:12 GMT-07:00 Brian Narsi :
>
> > I have a fieldtype setup as
> >
I have the following field type:
I am trying to use dismax query parser (because it seems to have better
phrase query support compared with standard query parser?)
I have mm = 1
ps = 4
I have the following data indexed:
1) acute care pharmaceuticals
2) carefusion llc
When
q=acute
I have a situation where.
User search query
q=15%
Solr results contain several documents that are
15%
15%
15%
15%
15 (why?)
15%
15%
I have debugged the query and can see that the score for 15 is higher than
the ones below it.
Why is that? Where can I read in detail about how the scoring is be
the question your asking is. You say
> >> that you have debugged the query and the score for 15 is
> >> higher than the ones below it. What's surprising about that?
> >>
> >> Are you saying you don't understand how the score is
> >> calculated? Or
I am using Data Import Handler to retrieve data from a database with
full-import, clean = true, commit = true and optimize = true
This has always worked correctly without any errors.
But just to be on the safe side, I am thinking that we should do a backup
before initiating Data Import Handler.
out : https://github.com/bloomreach/solrcloud-haft. This
> helps backup solr indices across clusters.
>
> On Tue, Nov 17, 2015 at 7:08 AM, Brian Narsi wrote:
>
> > I am using Data Import Handler to retrieve data from a database with
> >
> > full-import, clean
e contents of ZK. I’m currently working on some parts of that.
>
>
> Making a copy of a collection is supported too, with some caveats.
>
>
> On 11/17/15, 10:20 AM, "Brian Narsi" wrote:
>
> >Sorry I forgot to mention that we are using SolrCloud 5.1.0.
>
solrconfig.xml, schema.xml are all loaded into the zookeeper.
You can view the files loaded into zookeeper by clicking on Cloud on the
home page of SolrCloud and then navigating to config folder.
Below is a summary of steps to creating a collection:
1) Load the configuration files into zookeeper
I suggest the documentation Apache Solr Reference Guide for general
learning.
https://cwiki.apache.org/confluence/display/solr/Using+ZooKeeper+to+Manage+Configuration+Files
https://cwiki.apache.org/confluence/display/solr/Solr+Field+Types
https://wiki.apache.org/solr/SolrConfigXml
And if you run
I am using edismax with mm=1 and qs=6
I have a field type with synonyms attached to it.
A sample synonym is:
toothbrush tbrush
For the following data:
1) Phillips toothbrush
2) Oral-B tbrush
3) Phillips Sonicare toothbrush
If a user searches for
q = tbrush
I am getting
1), 3), 2)
i.e. t
I do not have synonyms enabled at query time. Below is my fieldtype
On Tue, Dec 1, 2015 at 4:18 AM, Markus Jelsma
wrote:
> Hello - it looks like you have synonyms enabled at query time, which is
> fine, but also means TF*IDF stats are different for tbrush and toothbrush,
> causing t
I have the following:
I believe I had the following working (splitting on pipe delimited)
But it does not work now.
In-fact now I have even tried
But I cannot get the values to split into an array.
Any thoughts/suggestions what may be wrong?
Thanks,
apache.org/solr/DataImportHandler#RegexTransformer for
> more information.
>
> James Dyer
> Ingram Content Group
>
>
> -Original Message-
> From: Brian Narsi [mailto:bnars...@gmail.com]
> Sent: Friday, December 04, 2015 3:10 PM
> To: solr-user@lucene.apache.
Is there a way to receive notifications when a Data Import Handler finishes
up and whether it succeeded or failed. (typically runs about an hour)
Thanks
We are getting the following error:
Full Import failed:java.lang.RuntimeException:
org.apache.solr.handler.dataimport.DataImportHandlerException:
java.lang.NullPointerException
at
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:270)
at
org.apache.solr.handler.dataimport.DataI
We are using Solr 5.1.0 and DIH to build index.
We are using DIH with clean=true and commit=true and optimize=true.
Currently retrieving about 10.5 million records in about an hour.
I will like to find from other member's experiences as to how long can DIH
run with no issues? What is the maximum
We have had a situation where Solr node was in distress due to hard drive
being full and the queries became very slow. Since our Solr cluster has two
nodes with indexes being fully available on both the nodes, we think that
one good solution would be to just stop the Solr instance on a distressed
n
40 matches
Mail list logo