Hi,
Smaller merge factor will make things worse - it will cause Lucene to merge
index segments more often (than the default merge factor of 10), thus resulting
in more new files being created on the master, thus resulting in more network
IO, more disk IO on the slaves, more OS cache evicted on
You may want to try the RPM tool, it will show you what inside of that
QueryComponent is really slow.
http://blog.sematext.com/2010/05/11/solr-performance-monitoring-announcement/
Or you can run Solr under your own profiler.
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lu
> first step is to do an &debugQuery=true and see where the time is
> going on the server-side. If you're doing highlighting of a stored
> field, that can be a biggie. The timings will be in the debug output
> - be sure to look at both sections of the timings.
>
Looks like the majori
Blargy - Please try to quote the mail you're responding to, at least
> the relevant piece. It's nice to see some context to the discussion.
No problem ;)
Depends - if you optimize the index on the master, then the entire index is
replicated. If you simply commit and let Lucene take care of
You, sir, are on my Christmas card list.
I'll fire it up tomorrow morning & let you know how it goes.
-- Ken
On Jun 17, 2010, at 8:34pm, Erik Hatcher wrote:
Your wish is my command. Check out trunk, fire up Solr (ant run-
example), index example data, hit http://localhost:8983/solr/browse
Your wish is my command. Check out trunk, fire up Solr (ant run-
example), index example data, hit http://localhost:8983/solr/browse -
type in search box.
Just used jQuery's autocomplete plugin and the terms component for
now, on the name field. Quite simple to plug in, actually. Check th
Blargy - Please try to quote the mail you're responding to, at least
the relevant piece. It's nice to see some context to the discussion.
On Jun 17, 2010, at 10:23 PM, Blargy wrote:
Is there an alternative for highlighting on a large stored field?
Not currently.
I thought
for highlighting
Is there an alternative for highlighting on a large stored field? I thought
for highlighting you needed the field stored? I really just need the
excerpting feature for highlighting relevant portions of our item
descriptions.
Not sure if this is because of the index size (17.5G) or because of
high
first step is to do an &debugQuery=true and see where the time is
going on the server-side. If you're doing highlighting of a stored
field, that can be a biggie. The timings will be in the debug output
- be sure to look at both sections of the timings.
Erik
On Jun 17, 2010, at 9
After indexing our item descriptions our index grew from around 3gigs to now
17.5 and I can see our search has deteriorated from sub 50ms searches to
over 500ms now. The sick thing is I'm not even searching across that field
at the moment but I plan to in the near future as well as include
highlig
Hmmm, maybe I'm wrong and it does support AND. Looking at the code I
don't see why it wouldn't, actually. Though I believe I've seen it
documented that it isn't supported (or at least not advertised to
support). Ok, from the dismax wiki page it says: "This query handler
supports an extre
Thanks, Erik. that does work. I misunderstood the documentation, i
thought "clause" meant "field" rather than the terms in the query.
If dismax doesn't support the operator AND, why would the query
"solr/select?q=fuel+cell" and "solr/select?q=fuel+AND+cell" get parsed
differently(it adds the
dismax does not support the operator AND. It uses +/- only.
set mm=100% (not 1), as Hoss said, and try your query again.
Erik
On Jun 17, 2010, at 8:08 PM, Tommy Chheng wrote:
I don't think setting the mm helps.
I have mm to 1 which means the query terms should be in at least one
fi
On Jun 17, 2010, at 7:44 PM, Mark Diggory wrote:
when I saw what was done with the "templating" of the Maven pom
work that was originally donated to solr, I just cringed at it.
Most of us Solr committers are fairly anti-Maven or ambivalent about
it at best, so it hasn't gotten much TLC, ad
I don't think setting the mm helps.
I have mm to 1 which means the query terms should be in at least one
field. Both query strings satisfy this condition.
The query "solr/select?q=fuel+cell" is parsed as
"querystring":"fuel cell",
"parsedquery":"+((DisjunctionMaxQuery((text:fuel |
organi
I don't believe Solritas supports autocompletion out of the box.
So I'm wondering if anybody has experience using the LucidWorks distro
& Solritas, plus the AJAX Solr auto-complete widget.
I realize that AJAX Solr's autocomplete support is mostly just
leveraging the jQuery Autocomplete plug
Erik,
I try not to be exclusionary of others development tool choices in the
selection of my own. However, just to surely stir up a nest of hornets in true
Apache fashion... when I saw what was done with the "templating" of the Maven
pom work that was originally donated to solr, I just cringed
: I want to reorder the results as per function like
: sum(w0*score, w1*field1, w2*field2, w3*filed3,..)
:
: I am using solr1.4 and it seems it does not support sort by function.
:
: How can this be achieved
:
: I tried using
: q=(query)^w0 (_val_:field1)^w1 (_val_:field2...)^w2
try fq=(query
: Because waitFlush doesn't work currently, your client
i didn't realize waitFlush is currently ignored ... is that an open bug
in Jira, or was it a neccessary change because of something else? do we
at least log an warning if someone tries to use waitFlush=false?
-Hoss
: I'm using the dismax request handler and want to set the default operator to
: AND.
: Using the standard handler, i could just use the q.op or defaultOperator in
: the schema, but this doesn't work using the dismax request handler.
:
: For example, if I call "solr/select/?q=fuel+cell", I want
I'm using the dismax request handler and want to set the default
operator to AND.
Using the standard handler, i could just use the q.op or defaultOperator
in the schema, but this doesn't work using the dismax request handler.
For example, if I call "solr/select/?q=fuel+cell", I want solr to ha
yes, you can use distributed search across shards with different
schemas as long as the query only references overlapping fields, i
usually test adding new fields or tokenizers on one shard and deploy
only after i verified its working properly
On Thu, Jun 17, 2010 at 1:10 PM, Markus Jelsma wrote:
: We use Solr along with Drupal for our content management needs. The
: solrconfig.xml that we have from Drupal mentions that "we do not
: spellcheck by default" and here is our request handler from
: solrconfig.xml.
:
: First question - why is it recommended that we do not spellcheck by
: defau
Wonderful,
Thank you both.
Pete.
-Original Message-
From: Erik Hatcher [mailto:erik.hatc...@gmail.com]
Sent: Thursday, June 17, 2010 3:33 PM
To: solr-user@lucene.apache.org
Subject: Re: Exact match on a filter
And when you do that, a best practice for fq'ing on a string field is:
f
And when you do that, a best practice for fq'ing on a string field is:
fq={!raw f=field_name}value
That avoids query parsing and the hassles associated with escaping
special characters.
Erik
On Jun 17, 2010, at 6:22 PM, Joe Calderon wrote:
use a copyField and index the copy as
see yonik's post on nested queries
http://www.lucidimagination.com/blog/2009/03/31/nested-queries-in-solr/
so for example i thought you could possibly do a dismax query across
the main fields (in this case just title) and OR that with
_query_:"{!description:'oil spill'~4}"
On Thu, Jun 17, 2010 at
use a copyField and index the copy as type string, exact matches on
that field should then work as the text wont be tokenized
On Thu, Jun 17, 2010 at 3:13 PM, Pete Chudykowski
wrote:
> Hi,
>
> I'm trying with no luck to filter on the exact-match value of a field.
> Speciffically:
> fq=brand:appl
Hi,
I'm trying with no luck to filter on the exact-match value of a field.
Speciffically:
fq=brand:apple
returns document's whose 'brand' field contains values like "apple bottoms".
Is there a way to formulate the fq expression to match precisely and only
"apple" ?
Thanks in advance for your
Barani,
without more background on dynamic fields, I would say that the most easiest
way would be to define a suffix for each of the fields you want to index
into the mentioned dynamic field and to redefine your dynamic field -
condition.
If suffix does not work, because of other dynamic-field d
Otis,
And again I wished I were registred.
I will check the JIRA and when I feel comfortable with it, I will open it.
Kind regards
- Mitch
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-and-Nutch-Droids-to-use-or-not-to-use-tp900069p904145.html
Sent from the Solr - U
Joe,
please, can you provide an example of what you are thinking of?
Subqueries with Solr... I've never seen something like that before.
Thank you!
Kind regards
- Mitch
--
View this message in context:
http://lucene.472066.n3.nabble.com/DismaxRequestHandler-tp903641p904142.html
Sent from th
Hi,
> One problem down, two left! =) bf ==> bq did the trick, thanks. Now at
> least if I can't get the DIH solution working I don't have to tack that on
> every query string.
>
I would really recommend to use a boost function. If your rank will change
in future implementations, you do not
(cross posted announcement, please keep any replies to gene...@lucene)
On behalf of Lucid Imagination, I'd like to invite folks to a free Webinar
we're hosting on June 24th...
How Cisco’s Pulse uses Lucene/Solr to put Social Networks to Work
Thursday, June 24, 2010
9am
Pulled this out of another thread of mine as it's the only bit left that I
haven't been able to figure out.
Can someone show me briefly how one would include a docBoost inside a DIH?
I've got something like this...
var rank = row.get('rank');
switch (rank) {
Sorry for the repost but I posted under DismaxRequestHandler when I should
have listed it as DismaxQueryParser.. ie im using defType=dismax
I have a title field and a description filed. I am searching across both
fields but I don't want description matches unless they are within some slop
of each
Hi,
I am facing some issue with dynamic fields. I have 2 fields (UID and ID) on
which I want to do whole word search only.. I made those 2 fields to be of
type 'string'.
I also have a dynamic field with textgen field type as below
This dynamic field seems to capture all the data including
I didn't open the issue, Mitch, but feel free to do it.
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
- Original Message
> From: MitchK
> To: solr-user@lucene.apache.org
> Sent: Thu, June 17, 2010 12:07:13 PM
I created LUCENE-2503 to address this.
On Thu, Jun 17, 2010 at 12:56 PM, Rachel Arbit wrote:
> Hi all,
> I'm having trouble finding a stemmer that's less aggressive than the
> porter-stemmer, ideally, one that does only plural stemming.
> I've been trying to get KStem to work by copying the luci
Mitch, thank you very much for your help, I'll read all the links you gave
me.
2010/6/17 MitchK
>
> Antonello,
>
> here are a few links to the Solr Wiki:
> http://wiki.apache.org/solr/SolrReplication Solr Replication
> http://wiki.apache.org/solr/DistributedSearchDesign Distributed Search
> Des
Hi,
Check out Solr sharding [1] capabilities. I never tested it with different
schema's but if each node is queried with fields that it supports, it should
return useful results.
[1]: http://wiki.apache.org/solr/DistributedSearch
Cheers.
-Original message-
From: Sascha Szott
I knew it wasn't me! :)
I found the patch just before I read this and applied it to the trunk
and it works!
Thanks Mark and martijn for all your help!
- Moazzam
On Thu, Jun 17, 2010 at 2:16 PM, Martijn v Groningen
wrote:
> I've added a new patch to the issue, so building the trunk (rev
> 95561
the qs parameter affects matching , but you have to wrap your query in
double quotes,ex
q="oil spill"&qf=title description&qs=4&defType=dismax
im not sure how to formulate such a query to apply that rule just to
description, maybe with nested queries ...
On Thu, Jun 17, 2010 at 12:01 PM, Blargy
I've added a new patch to the issue, so building the trunk (rev
955615) with the latest patch should not be a problem. Due to recent
changes in the Lucene trunk the patch was not compatible.
On 17 June 2010 20:20, Erik Hatcher wrote:
>
> On Jun 16, 2010, at 7:31 PM, Mark Diggory wrote:
>>
>> p.s.
I have a title field and a description filed. I am searching across both
fields but I don't want description matches unless they are within some slop
of each other. How can I query for this? It seems that im getting back crazy
results when there are matches that are nowhere each other
--
View th
One problem down, two left! =) bf ==> bq did the trick, thanks. Now at
least if I can't get the DIH solution working I don't have to tack that on
every query string.
Taking the quotes away from $docBoost results in a syntax error. Needs to
be quoted.
Changed it up to this and still no luck
On Jun 16, 2010, at 7:31 PM, Mark Diggory wrote:
p.s. I'd be glad to contribute our Maven build re-organization back
to the community to get Solr properly Mavenized so that it can be
distributed and released more often. For us the benefit of this
structure is that we will be able to overla
Correct, it uses maven and just constructs the War executable, its upto you to
configure the location of your solr home directory still.
svn co https://scm.dspace.org/svn/repo/modules/dspace-solr/trunk solr
cd solr
mvn package
then you can go into the webapp/target directory and get the generate
Hi folks,
if I'm seeing it right Solr currently does not provide any support for
federated / meta searching. Therefore, I'd like to know if anyone has
already put efforts into this direction? Moreover, is federated / meta
search considered a scenario Solr should be able to deal with at all or
> I'm having trouble finding a stemmer that's less aggressive
> than the
> porter-stemmer, ideally, one that does only plural
> stemming.
Looks like PlingStemmer does this.
http://www.mpi-inf.mpg.de/yago-naga/javatools/doc/javatools/parsers/PlingStemmer.html
Ok that makes perfect sense.
"What I did was use a combination of the two running the indexed terms
through " - I initially read this as you used your current index and use
the terms from that to buildup your dictionary.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Au
Hi all,
I'm having trouble finding a stemmer that's less aggressive than the
porter-stemmer, ideally, one that does only plural stemming.
I've been trying to get KStem to work by copying the lucid-kstem and
lucid-solr-kstem jars from the lucid distribution into solr/lib, but I get a
classNotFound E
Hi Mark,
Thanks for posting those links. I know this is probably a dumb
question, but how do I make Solr work through your repository? I ask
this because I don't see a build xml file and the folder structure is
a bit different (I'm guessing I am not supposed to use ant on that :D)
Thanks,
Moazza
We base the auto-suggest on popular searches. Our site logs the search
terms in a database and a simple query can give us a summary counting
the number of times the search was entered and the number of results
it returned, similar to the criteria used in the lucid imagination
article you cite. Each
What is the usecase for such an architecture?
Do you send requests to two different masters for indexing and that's why
they need to be synchronized?
Kind regards
- Mitch
--
View this message in context:
http://lucene.472066.n3.nabble.com/Master-master-tp884253p903233.html
Sent from the Solr -
Antonello,
here are a few links to the Solr Wiki:
http://wiki.apache.org/solr/SolrReplication Solr Replication
http://wiki.apache.org/solr/DistributedSearchDesign Distributed Search
Design
http://wiki.apache.org/solr/DistributedSearch Distributed Search
http://wiki.apache.org/solr/SolrCloud So
Thanks for the reply Michael. Ill definitely try that out and let you know
how it goes. Your solution sounds similar to the one I've read here:
http://www.lucidimagination.com/blog/2009/09/08/auto-suggest-from-popular-queries-using-edgengrams/
There are some good comments in there too.
I think
Sorry, I've overlooked your other question.
>
> rank:1^10.0 rank:2^9.0 rank:3^8.0 rank:4^7.0 rank:5^6.0 rank:6^5.0
> rank:7^4.0 rank:8^3.0 rank:9^2.0
>
>
This is wrong.
You need to change "bf" to "bq".
Bf -> boosting function
Bq -> boosting query.
--
View this message
Hi,
first of all, are you sure that row.put('$docBoost',docBoostVal) is correct?
I think it should be row.put($docBoost,docBoostVal); - unfortunately I am
not sure.
Hm, I think, until you can solve the problem with the docBoosts itself, you
should use a functionQuery.
Use "div(1, rank)" as boo
Hi to every one I have a question and I hope someone can help me.
I know that mission critical reliability can be implemented with Lucene/Solr
by using multi-node configurations, and redundant architectures, but I
haven't found documentation on how to do it.
Can someone help me to find a link to re
Otis,
you are right. I wasn't aware of this. At least not with such a large
dataList (let's think of an index with 4mio docs, this would mean we got an
ExternalFile with 4mio records). But from what I've read at
search-lucene.com it seems to perform very well. Thanks for the idea!
Btw: Otis, di
Blargy,
I've been experimenting with this myself for a work project. What I
did was use a combination of the two running the indexed terms through
the Shingle factory and then through the edge n-gram filter. I did
this in order to be able to match terms like :
.net asp c#
asp .net c#
c# asp .net
Brand new to this sort of thing so bear with me.
For sake of simplicity, I've got a two field document, title and rank.
Title gets searched on, rank has values from 1 to 10. 1 being highest.
What I'd like to do is boost results of searches on title based on the
documents rank.
Because it's fa
How can I preserve phrases for either autosuggest/autocomplete/spellcheck?
For example we have a bunch of product listings and I want if someone types:
"louis" for it to common up with "Louis Vuitton". "World" ... "World cup".
Would I need n-grams? Shingling? Thanks
--
View this message in con
Thank you so much for the reply...The link suggested by you is helpful but
they have explain everything with use of curl command which I don't want to
use.
I was more interested in uploading the .html documents using HTTP web
request.
So I have stored all .html files at one location & then create
Hi,
I think that "+" there is just a "space" (like %20).
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
- Original Message
> From: Chia Hao Lo
> To: solr-user@lucene.apache.org
> Sent: Thu, June 17, 2010 2:44:3
Mitch,
Yes, one day. But it sounds like you are not aware of ExternalFieldFile, which
you can use today:
http://search-lucene.com/?q=ExternalFileField&fc_project=Solr
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
---
As you can see both versions don't appear to be working. I tried building
each but neither would compile. Which version/tag should be used when
applying this patch?
In general, a patch is written against the latest trunk branch
as of then. For the SOLR-1316.patch, it was posted 2010-5-31,
Hi,
I am using solr 1.3 and when indexing i am getting RejectedExecutionException
after processing the last batch of update records from the database.
happening when coreContainer.shutdown() is called after processing the last
record.
i have autocommits enabled based on maxTime which is 10 minut
Hi Folks,
Is there any way to get or estimate the total number of results when using
field collapsing (SOLR-236) without using faceting or a second query?
Kind Regards,
Adrian Pemsel
--
http://www.jusmeum.de
Hi Moazzam,
Yup i hv encountered the same thing.
Build errors after applying the patch.
Rakhi
On Thu, Jun 17, 2010 at 3:33 AM, Moazzam Khan wrote:
> I got the code from trunk again and now I get this error:
>
>[javac] symbol : class StringIndex
>[javac] location: interfa
> Solr doesn't know anything about OPIC, but I suppose you can feed the OPIC
> score computed by Nutch into a Solr field and use it during scoring, if
> you want, say with a function query.
>
Oh! Yes, that makes more sense than using the OPIC as doc-boost-value. :-)
Anywhere at the Lucene Mail
71 matches
Mail list logo