Hello,
I am using solr-4.10.4 with mlt. I noticed that mlt constructs query which is
missing some words. For example, for doc with
title: Jennnifer Lopez
keywords: Jennifer, concert, Hollywood
the parsedquery generated by mlt for this doc is title:lopez
keywords:jennifer keywords:concert
I have used snapshotter api and modified snapinstaller script, so that it
successfully grabs the snapshot folder and updates index folder in slave.
However, it fails to open newSearcher.
It simple, sends a commit command to slave, but hasUncommittedChanges
function returns false.
That is the reas
Hello,
We cannot use replication with the current architecture, so decided to use
snapshotter with snapinstaller.
Here is the full stack trace
8937 [coreLoadExecutor-5-thread-3] INFO
org.apache.solr.core.CachingDirectoryFactory – Closing directory:
/home/solr/solr-4.10.1/solr/example/solr/p
Hello,
I am using latest solr (solr trunk) . I run snapinstaller, and see that it
copies snapshot to index folder but changes are not picked up and
logs in slave after running snapinstaller are
44302 [qtp1312571113-14] INFO org.apache.solr.update.UpdateHandler – start
commit{,optimize=false
Hello,
We need to order solr search results according to specific rules.
I will explain with an example. Let say solr returns 1000 results for query
"sport".
These results must be divided into three buckets according to rules that come
from database.
Then one doc must be chosen from each b
Hi,
From the discussion it is not clear if this is a fixable bug in the case of
documents being in different shards. If this is fixable could someone please
direct me to the part of the code so that I could investigate.
Thanks.
Alex.
-Original Message-
From: Andrew Shumway
To
Hello,
I try to construct wikipedia page url from page title using regexTransformer
with
This does not work for titles that have no space, so title_underscore for them
is empty.
Any ideas what is wrong here?
This is with solr-4.8.1
Thanks. Alex.
Hi,
I see similar problem in our solr application. Sometime it gives number in a
group as number of all documents. This starting to happen after upgrade from
4.6.1 to 4.8.1
Thanks.
Alex.
-Original Message-
From: 海老澤 志信
To: solr-user
Sent: Tue, Jun 17, 2014 5:24 am
Subject: RE: gr
It seems to me that, you are missing this line
under
Alex.
-Original Message-
From: solr-user
To: solr-user
Sent: Tue, Apr 1, 2014 5:01 pm
Subject: Re: how do I get search for "fort st john" to match "ft saint john"
Hi Eric.
Sorry, been away.
The city_index_synonyms
Hello,
For queries in solrcloud and in distributed mode solr-4.6.1 spellcheck does not
return any suggestions, but in non-distrubited mode.
Is this a know bug?
Thanks.
Alex.
I found out that generated files are the same. I think this is because that
these lines inside build file
Note that the gennorm2 and icupkg tools must be on your PATH. These
tools
are part of the ICU4C package. See http://site.icu-
I found out that generated files are the same. I think this is because that
these lines inside build file
Note that the gennorm2 and icupkg tools must be on your PATH. These
tools
are part of the ICU4C package. See http://site.icu-
Hello,
I use
icu4j-49.1.jar,
lucene-analyzers-icu-4.6-SNAPSHOT.jar
for one of the fields in the form
I need to change one of the accent char's corresponding letter. I made changes
to this file
lucene/analysis/icu/src/data/utr30/DiacriticFolding.txt
recompiled solr and lucene and replaced t
Hi,
Could someone please confirm that this must me so or this is a bug in SOLR.
In short, I see three logs in SOLR for one request
http://server1:8983/solr/mycollection/select?q=alex&wt=xml&defType=edismax&facet.field=school&facet.field=company&facet=true&facet.limit=10&facet.mincount=1&qf=schoo
ny__terms have all values from company facet field.
>
> I have added terms=false to the original query sent to solr, but it did
> not help.
>
> Does anyone has any idea how to suppress these queries.
>
> Thanks.
> Alex.
>
>
>
>
>
>
>
>
> -Original
the original query sent to solr, but it did not
help.
Does anyone has any idea how to suppress these queries.
Thanks.
Alex.
-Original Message-
From: alxsss
To: solr-user
Sent: Fri, Jul 19, 2013 5:00 am
Subject: additional requests sent to solr
Hello,
I send to solr( to ser
Hello,
I send to solr( to server1 in the cluster of two servers) the folowing request
http://server1:8983/solr/mycollection/select?q=alex&wt=xml&defType=edismax&facet.field=school&facet.field=company&facet=true&facet.limit=10&facet.mincount=1&qf=school_txt+company_txt+name&shards=server1:8983/sol
Another way of overriding nutch fields is to modify solrindex-mapping.xml file.
hth
Alex.
-Original Message-
From: Jack Krupansky
To: solr-user
Sent: Sun, Jun 23, 2013 12:04 pm
Subject: Re: document id in nutch/solr
Add the "passthrough" dynamic field to your Solr schema, and
Hello,
I have a solr index of size 5GB. I am thinking of increasing cache size to 5
GB, expecting Solr will put whole index into memory.
1. Will Solr indeed put whole index into memory?
2. What are drawbacks of this approach?
Thanks in advance.
Alex.
Hi,
I did not find any descriptions, except constructor and method names.
Thanks.
Alex.
-Original Message-
From: Markus Jelsma
To: solr-user
Sent: Tue, Apr 23, 2013 12:08 pm
Subject: RE: EdgeGram filter
Always check the javadocs. There's a lot of info to be found there:
http
Hi,
I was unable to find more info about
LimitTokenCountFilterFactory
in solr wiki. Is there any other place to get thorough description of what it
does?
Thanks.
Alex.
-Original Message-
From: Jack Krupansky
To: solr-user
Sent: Tue, Apr 23, 2013 11:36 am
Subject: Re: EdgeGr
How many segments each shard has and what is the reason of running multiple
shards in one machine?
Alex.
-Original Message-
From: qibaoyuan
To: solr-user
Sent: Fri, Apr 19, 2013 12:26 am
Subject: Re: solr-cloud performance decrease day by day
there are 6 shards and they are
inside your request handler try to put spellcheck true and name of the
spellcheck dictionary
hth
Alex.
-Original Message-
From: davers
To: solr-user
Sent: Thu, Apr 11, 2013 6:24 pm
Subject: Spellchecker not working for Solr 4.1
This is almost the same exact setup I was usin
Did index size increase after turning on termPositions and termOffsets?
Thanks.
Alex.
-Original Message-
From: Ravi Solr
To: solr-user
Sent: Mon, Mar 25, 2013 8:27 am
Subject: Query slow with termVectors termPositions termOffsets
Hello,
We re-indexed our entire core o
1. group=true, no spellcheck results
2. group=false , there are spellcheck results
case when matches>0
1. group=true, there are spellcheck results
2. group =false, there are spellcheck results
Do these constitute a failing test case?
Thanks.
Alex.
-Original Message-
From:
. group=false , there are spellcheck results
case when matches>0
1. group=true, there are spellcheck results
2. group =false, there are spellcheck results
Do these constitute a failing test case?
Thanks.
Alex.
-Original Message-
From: alxsss
To: solr-user
Sent: Thu, Mar
Thanks in advance.
Alex.
-Original Message-
From: alxsss
To: solr-user
Sent: Tue, Mar 19, 2013 11:34 am
Subject: Re: strange behaviour of wordbreak spellchecker in solr cloud
-- distributed environment. But to nail it down, we probably need to see both
-- the app
Hello,
We need this feature be fixed ASAP. So, please let me know which class is
responsible for combining spellcheck results from all shards. I will try to
debug the code.
Thanks in advance.
Alex.
-Original Message-
From: alxsss
To: solr-user
Sent: Tue, Mar 19, 2013 11:34
-- distributed environment. But to nail it down, we probably need to see both
-- the applicable
Not sure what this is?
I have
spell
direct
spell
solr.DirectSolrSpellChecker
internal
0.5
2
1
Hello,
I was testing my custom testhandler. Direct spellchecker also was not working
in cloud. After I added
spellcheck
to /select requestHandler it worked but the wordbreak spellchecker. I have
added shards.qt=testhanlder to curl request but it did not solve the issue.
Thanks.
A
Hello,
I try to use wordbreak spellchecker in solr-4.2 with cloud feature. We have two
server with one shard in each of them.
curl 'server1:8983/solr/test/testhandler?q=paulusoles&indent=true&rows=10'
curl 'server2:8983/solr/test/testhandler?q=paulusoles&indent=true&rows=10'
does not return any
---So,"search" time is in no way impacting by the existence or non-existence of
stored values,
What about memory? Would it require to increase memeory in order to have the
same Qtime as in the case of indexed only fields?
For example in the case of indexed fields only index size is 5GB, a
Hi,
So, will search time be the same for the case when fields are indexed only vs
the case when they are indexed and stored?
Thanks.
Alex.
-Original Message-
From: Otis Gospodnetic
To: solr-user
Sent: Fri, Mar 15, 2013 8:09 pm
Subject: Re: structure of solr index
Hi,
I thin
Hi,
I wondered if solr searches on indexed fields only or on entire index? In more
detail, let say I have fields id, title and content, all indexed, stored.
Will a search send all these fields to memory or only indexed part of these
fields?
Thanks.
Alex.
Hello,
Recently we noticed that solr and its spellchecker do not return results for
keywords typed with non-whitespace delimiter.
A user accidentally typed u instead of white space. For example, paulusoles
instead of paul soles. Solr does not return any results or spellcheck
suggestion for key
Hi,
It is the index folder. tlog is only a few MB.
I have analysed all changed and found out that only one field in schema was
changed.
This field in non cloud
was changed to
in cloud to use fastVectorHighlighting.
Is it possible that this change could double index size?
Thanks.
Alex.
Hello,
I was unable to change pre and post tags for highlighting when
usefastVectorHighlighter is set to true. Changing default tags in
solrconfig.xml works for standard highlighter though. I searched mailing list
and the net with no success.
I use solr-4.1.0.
Thanks.
Alex.
Hi,
What if you add new collection to solr.xml file?
Alex.
-Original Message-
From: Shankar Sundararaju
To: solr-user
Sent: Thu, Feb 21, 2013 8:51 pm
Subject: How do I create two collections on the same cluster?
I am using Solr 4.1.
I created collection1 consisting of 2 lea
Hi,
It is curious to know how many linux boxes do you have and how many cores in
each of them. It was my understanding that solr puts in the memory all
documents found for a keyword, not the whole index. So, why it must be faster
with more cores, when number of selected documents from many sepa
Depending on your architecture, why not index the same data into two machines?
One will be your prod another your backup?
Thanks.
Alex.
-Original Message-
From: Upayavira
To: solr-user
Sent: Thu, Dec 20, 2012 11:51 am
Subject: Re: Pause and resume indexing on SolR 4 for backup
This is strange. We have data folder size 24Gb, RAM for java 2GB. We query
with grouping, ngroups and highlighting, do not query all fields and query
time mostly is less than 1 sec it rarely goes up to 2 sec. We use solr 3.6 and
tuned off all kind of caching.
Maybe your problem is with caching
What are the RAM of your server and size of the data folder?
-Original Message-
From: Agnieszka Kukałowicz
To: solr-user
Sent: Mon, Jul 16, 2012 6:16 am
Subject: Re: Grouping performance problem
Hi Pavel,
I tried with group.ngroups=false but didn't notice a big improvement.
The time
I had the same problem with jetty. It turned out that broken pipe happens when
application disconnects from jetty. In my case I was using php client and it
had 10 sec restriction in curl request. When solr takes more than 10 sec to
respond, curl automatically disconnected from jetty.
Hope this
I use jetty that comes with solr.
I use solr's dedupe
true
id
true
url
solr.processor.Lookup3Signature
and because of this id is not url itself but its encoded signature.
I see solrclean uses url to delete
all caching is disabled and I restarted jetty. The same results.
Thanks.
Alex.
-Original Message-
From: Lance Norskog
To: solr-user
Sent: Tue, May 1, 2012 2:57 pm
Subject: Re: Removing old documents
Maybe this is the HTTP caching feature? Solr comes with HTTP caching
turned on
Hello,
I did bin/nutch solrclean crawl/crawldb http://127.0.0.1:8983/solr/
without and with -noCommit and restarted solr server
Log shows that 5 documents were removed but they are still in the search
results.
Is this a bug or something is missing?
I use nutch-1.4 and solr 3.5
Thanks.
Alex.
Hello Hoss,
Here are the explain tags for two doc
0.021646015 = (MATCH) sum of:
0.021646015 = (MATCH) sum of:
0.02141003 = (MATCH) max plus 0.01 times others of:
2.84194E-4 = (MATCH) weight(content:apache^0.5 in 3578), product of:
0.0029881175 = queryWeight(content:apache^0.5
In that case documents 1 and 2 will not be in the results. We need them also be
shown in the results but be ranked after those docs with exact match.
I think omitting term frequency in calculating ranking in phrase queries will
solve this issue, but I do not see that such a parameter in configs.
Hello,
I use solr 3.5 with edismax. I have the following issue with phrase search. For
example if I have three documents with content like
1.apache apache
2. solr solr
3.apache solr
then search for apache solr displays documents in the order 1,.2,3 instead of
3, 2, 1 because term frequency in
Hello,
I have copied solr's data folder from dev linux box to prod one. When starting
solr I get this error in prod server. In dev solr starts sucessfully.
Caused by: java.io.FileNotFoundException:
/home/apache-solr-3.5.0/example/solr/data/index/segments_u (No such file or
directory)
take a look to
I think you must use dedup to solve this issue
-Original Message-
From: Thomas Dowling
To: solr-user
Cc: Mikhail Khludnev
Sent: Fri, Mar 2, 2012 1:10 pm
Subject: Re: Help with duplicate unique IDs
Thanks. In fact, the behavior I want is overwrite=true. I wan
you have put this
true
Maybe you need to put
true
Alex.
-Original Message-
From: Dyer, James
To: solr-user
Sent: Mon, Feb 13, 2012 12:43 pm
Subject: RE: spellcheck configuration not providing suggestions or corrections
That would be it, I tbinkl. Your request is to "/sele
Hi Chantal,
In the readme file at solr/contrib/analysis-extras/README.txt it says to add
the ICU library (in lib/)
Do I need also add ... and where?
Thanks.
Alex.
-Original Message-
From: Chantal Ackermann
To: solr-user
Sent: Fri, Jan 13, 2012 1:52 am
Subject: Re: can solr a
Hello,
I would like to know if solr has a functionality to automatically search for a
different punctuation of a word.
For example if I if a user searches for a word Uber, and stemmer is german
lang, then solr looks for both Uber and Über, like in synonyms.
Is it possible to give a file with
Hi Erick,
Why querying would be wrong?
It is my understanding that if I have let say 3 docs and each of them has been
indexed with its own language stemmer, then sending a query will search all
docs and return matching results? Let say if a query is "driving" and one of
the docs has drive a
Hi Eric,
After reading more about pf param I increased them a few times and this solved
options 2, 3, 4 but 1. As an example, for phrase "newspaper latimes"
latimes.com is not even in the results to boost it to the first place and
changing mm param to 1<-1 5<-2 6<90% solves
only 1,4 but 2
Hello,
Here is my request handler
edismax
explicit
0.01
site^1.5 content^0.5 title^1.2
site^1.5 content^0.5 title^1.2
id,title, site
2<-1 5<-2 6<90%
300
true
*:*
content
0
165
title
0
url
regex
I have made a few tests with debugQuery and realised that for two word phrases,
solr takes the fi
Hello,
I have build solr-3.4.0 data folder in dev server and copied it to prod server.
Made a search for a keyword, then modified qf and pf params in solrconfig.xml.
Made search for the same keywords, then restored qf and pf params to their
original value. Now, solr returns very less number of
It seem you forget this
true
-Original Message-
From: Ruixiang Zhang
To: solr-user
Sent: Tue, Nov 22, 2011 11:54 am
Subject: spellcheck in dismax
I put the following into dismax requestHandler, but no suggestion field is
returned.
true
false
1
I found out that curl timeout was set to 10 and for queries taking longer than
10 sec it was closing connection to jetty.
I noticed that when number of docs found is large solr returns results for
about 20 sec. This is too long. I set caching to off but it did not help.
I think solr spends too mu
Hello,
I use solr 3.4 with jetty that is included in it. Periodically, I see this
error in the jetty output
SEVERE: org.mortbay.jetty.EofException
at org.mortbay.jetty.HttpGenerator.flush(HttpGenerator.java:791)
at
org.mortbay.jetty.AbstractGenerator$Output.flush(AbstractGenerat
Hello,
Thanks for your letter. I investigated further and found out that we have title
scored more than content in qf field and those docs in the first places have
one of the words in title but not both of them.
The doc in the first place has only one of the words in the content.
Docs with both
Solr also can query link(url) text and rank them higher if we specify url in qf
field. Only problem is that why it does not rank pages with both words higher
when mm is set as
1<-1. It seems to me that this is a bug.
Thanks.
Alex.
-Original Message-
From: Ted Dunning
To: solr-
Hi Erick,
The term "newspaper latimes" is not found in latimes.com. However, google
places it in the first place. My guess is that mm parameter must not be set as
2<-1 in order to achieve google.com like ranking for two word phrase queries.
My goal is to set mm parameter in such a way that la
Hello,
I use nutch-1.3 crawled results in solr-3.4. I noticed that for two word
phrases like newspaper latimes, latimes.com is not in results at all.
This may be due to the dismax def type that I use in request handler
dismax
url^1.5 id^1.5 content^ title^1.2
url^1.5 id^1.5 content^0.5 title^1
Hello,
I have implemented spellchecker in two ways.
1. Adding a textspell type to schema.xml and making a copy field from original
content field, which is type text.
2. without adding new type and copy field. Simple adding name of spell field,
content to solrconfig.xml
I have an issue in
Is case #2 planned to be coded in the future releases?
Thanks.
Alex.
-Original Message-
From: Bill Bell
To: solr-user
Sent: Thu, Sep 8, 2011 10:17 pm
Subject: Re: pagination with grouping
There are 2 use cases:
1. rows=10 means 10 groups.
2. rows=10 means to results (irr
Hello,
When trying to implement pagination as in the case without grouping I see two
issues.
1. with rows=10 solr feed displays 10 groups not 10 results
2. there is no total number of results with grouping to show the last page.
In detail:
1. I need to display only 10 results in one page.
Hello,
I try to group by a field with type string. In the results I see groupValues as
parts of the group field.
Any ideas how to fix this.
Thanks.
Alex.
Hello,
I have tried to implement spellchecker based on index in nutch-solr by adding
spell field to schema.xml and making it a copy from content field. However,
this increased data folder size twice and spell filed as a copy of content
field appears in xml feed which is not necessary. Is it po
I forget to say that when I do
curl http://localhost:8983/solr/update -H "Content-Type: text/xml"
--data-binary ''
0453
and search for added keywords gives 0 results. Does status 0 mean that addition
was successful?
Thanks.
Alex.
-Original Message-
From: Erik Hatcher
T
Tried to add a new record using
curl http://localhost:8983/solr/update -H "Content-Type: text/xml"
--data-binary '
20090512170318
86937aaee8e748ac3007ed8b66477624
0.21189615
test.com
test test
20090513003210909
'
I get
071
and added records are not found in the search.
Any ideas w
Hello,
I had? Nutch -1.0 to crawl fetch and index a lot of files. Then I needed to?
index a few files also. But I know keywords for those files and their?
locations. I need to add them manually. I took a look to two tutorials on the
wiki, but did not find any info about this issue.
Is there a tu
74 matches
Mail list logo