Hi,
I need to know what are the top (most frequently searched and their
frequencies) 'n' (say 100) search queries that users tried. Does Solr keep
this information and can return, or else what options do i have here?
Thanks,
Praveen
Hi everybody,
We got a problem with parenthesis in a lucene/solr request (Solr 1.4) :
- {!lucene q.op=AND}( ville:"Moscou" -periodicite:"annuel") give
254documents
with parsedquery>+ville:Moscou -periodicite:annuel< in debug mode.
Thas'ts correct.
- {!lucene q.op=AND} (ville:"Moscou" AND NOT
I would like to be able to elevate documents if the query matches part of a
string.
For example, I would like to elevate the document FOO in case the query
contains the word 'archive'. So when executing the queries
"packet archive"
"archive failure"
"archive"
All leads to the document FOO b
Please explain a range query?
tia :-)
Dennis Gearon
Signature Warning
EARTH has a Right To Life,
otherwise we all die.
Read 'Hot, Flat, and Crowded'
Laugh at http://www.yert.com/film.php
--- On Thu, 4/29/10, Yonik Seeley wrote:
> From: Yonik Seeley
> Subject: Re: benefi
Thanks much Koji,
Let me have look on this,
Regards
Prakash
-Original Message-
From: Koji Sekiguchi [mailto:k...@r.email.ne.jp]
Sent: Thursday, April 29, 2010 8:25 PM
To: solr-user@lucene.apache.org
Subject: Re: How to make documents low priority
Doddamani, Prakash wrote:
> Thanks Jon
Hi,
I have my config something like "clubbed_text" of type "text" and
"clubbed_string" of type "string". :
BLOCK-1...
BLOCK-2...
BLOCK-3...
BLOCK-4...
Is the copyField valid specified in BLOCK-4? It seems it is not populating the
clubbed_string with the values
Pablo, Ubuntu Lucid is *brand* new :)
try:
find / -name \*solr\*
or
locate solr.war
Or simply try http://localhost:8080/solr/admin/
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
- Original Message
> From:
Hi
I've installed solr-tomcat package on ubuntu lucid (10.04 latest).
It automatically install java and tomcat and hopefully all other
dependencies.
I can access tomcat at http://localhost:8080 but not sure where to find the
solr web admin
http://localhost:8180 gives me nothing.
Is this package k
On 4/29/10 8:50 PM, Marco Martinez wrote:
Hi Ranveer,
If you don't specify a field type in the q parameter, the search will be
done searching in your default search field defined in the solrconfig.xml,
its your default field a text_sync field?
Regards,
Marco Martínez Bautista
http://www.paradig
I'm trying to reproduce now... single thread adding documents to a
multithreaded client, StreamingUpdateSolrServer(addr,32,4)
I'm currently at the 2.5 hour mark and 100M documents - no issues so far.
-Yonik
Apache Lucene Eurocon 2010
18-21 May 2010 | Prague
On Thu, Apr 29, 2010 at 5:12 PM, Yon
What is the garbage collection status when this happens?
What are the open sockets in the OS when this happens? Run 'netstat
-an | fgrep 8983' where 8983 is the Solr incoming port number.
A side note on sockets:
SUSS uses the MultiThreadedHttpConnectionManager but never calls
MultiThreadedHttpCo
On Thu, Apr 29, 2010 at 6:04 PM, Lance Norskog wrote:
> In solrconfig.xml, there is a parameter controlling remote streaming:
>
>
> multipartUploadLimitInKB="2048000" />
>
> 1) Is this relevant with the SUSS?
No, this relates to solr pulling data from another source (via stream.url
DollarDays.com is currently using it and we display the powered by logo
as at least a gesture of giving back to the community.
Ryan T. Grange, IT Manager
DollarDays International, Inc.
rgra...@dollardays.com (480)922-8155 x106
On 4/29/2010 11:10 AM, Daniel Baughman wrote:
Hi I'm new to the li
In solrconfig.xml, there is a parameter controlling remote streaming:
1) Is this relevant with the SUSS?
2) It seems to be 'true' in the example default, which may not be a good idea.
On Thu, Apr 29, 2010 at 2:12 PM, Yonik Seeley
wrote:
> On Fri, Apr 16, 2010 at 1:34 PM, Sascha
It sounds like you want a data warehouse, not a text search engine.
Splunk and Pentaho are good things to try.
On Thu, Apr 29, 2010 at 12:03 PM, Jon Baer wrote:
> To follow up it ... it seems dumping to Solr is common ...
>
> http://highscalability.com/how-rackspace-now-uses-mapreduce-and-hadoop-
Do you really need the *:* stuff in the date range subqueries? That
may add to the execution time.
On Thu, Apr 29, 2010 at 9:52 AM, Erick Erickson wrote:
> Hmmm, what does the rest of your query look like? And does adding
> &debugQuery=on show anything interesting?
>
> Best
> Erick
>
> On Thu, Ap
What you want is:
All results within the area and whatever results the
QueryElevateComponent adds, sorted by some relevance function.
If this is it, you can get the results, with the elevated output, and
do a second query with all of the ids, sorted by distance. This second
query would not use
I think the problems one has to solve are depending on the usecases one has
to deal with.
It makes a difference whether I got much documents that are bloody similar
but with different contexts and I have to determine what query applies to
what context in what probability for which document - or if
On Fri, Apr 16, 2010 at 1:34 PM, Sascha Szott wrote:
> In my case the whole application hangs and never recovers (CPU utilization
> goes down to near 0%). Interestingly, the problem reproducibly occurs only
> if SUSS is created with *more than 2* threads.
Is your application also using multiple t
Thanks Koji,
That was the information I was looking for. I'll be sure to post the test
results to the list. It may be a few weeks before we can schedule the tests
for our test server.
Tom
>>I've never tried it but NoMergePolicy and NoMergeScheduler
>>can be specified in solrconfig.xml:
>>
Folks,
Greetings.
Using dismax query parser is there a way to perform prefix match. For
example: If I have a field called 'booktitle' with the actual values as
'Code Complete', 'Coding standard 101', then I'd like to search for the
query string 'cod' and have the dismax match against both the book
Thanks for this Peter. I have managed to get this working with Tomcat.
Andrew
On 29 April 2010 12:11, Peter Sturge wrote:
> Hi Andrew,
>
> Today, authentication is handled by the container (e.g. Tomcat, Jetty etc.).
>
>
> There's a thread I found to be very useful on this topic here:
>
> http:/
On Wed, Apr 28, 2010 at 11:22 AM, Nagelberg, Kallin
wrote:
> Does anyone have an idea about the performance benefits of searching across
> floats compared to strings? I have one multi-valued field that contains about
> 3000 distinct IDs across 5 million documents. I am going to be a lot of
> qu
Floats are Trie types and are stored in a compressed format. They will
search faster. They will also sort with much less space.
One thing to point out is that doing bitwise comparison on floats is
to live in a state of sin. Your string representations must parse
exactly right.
On Wed, Apr 28, 201
ColdFusion 9 is now shipping with it, as well.
Thanks everyone for the inputs.
-Original Message-
From: Grant Ingersoll [mailto:gsi...@gmail.com] On Behalf Of Grant Ingersoll
Sent: Thursday, April 29, 2010 1:35 PM
To: solr-user@lucene.apache.org
Subject: Re: Evangelism
Hi Daniel,
There
Hi Daniel,
There are lots of sites running Solr ranging from very large to very small.
Because it is open source, people aren't required to report, but there are
several places where people have reported:
http://wiki.apache.org/solr/PublicServers
http://www.lucidimagination.com/developer/Commu
To follow up it ... it seems dumping to Solr is common ...
http://highscalability.com/how-rackspace-now-uses-mapreduce-and-hadoop-query-terabytes-data
- Jon
On Apr 29, 2010, at 1:58 PM, Jon Baer wrote:
> Good question, +1 on finding answer, my take ...
>
> Depending on how large of log files y
Forgot the link.
http://www.lucidimagination.com/Community/Marketplace/Application-Showca
se-Wiki/Netflix
-Original Message-
From: Jason Chaffee [mailto:jchaf...@ebates.com]
Sent: Thursday, April 29, 2010 11:52 AM
To: solr-user@lucene.apache.org
Subject: RE: Evangelism
Netflix search i
Netflix search is built with Solr. That seems like a fairly big and
recognizable company.
-Original Message-
From: Erick Erickson [mailto:erickerick...@gmail.com]
Sent: Thursday, April 29, 2010 11:44 AM
To: solr-user@lucene.apache.org
Subject: Re: Evangelism
This is a Lucene story, but
This is a Lucene story, but may well apply... By the time I'd sent a request
for assistance
to the vendor of one of our search tools and received the reply "you didn't
give us the
right license number", I'd found Lucene, indexed part of my corpus and run
successful
searches against it. And had answ
A lot of high performing websites use MySQL, Oracle and Microsoft SQL Server
for data storage and other RDBMS needs without necessarily putting the
"powered by" logo on the sites.
If you need the certified version of Apache Solr, you can contact Lucid
Imagination.
Just like MySQL, Apache Solr and
I had a very hard time selling Solr to business folks. Most are of the mind
that if you're not paying for something it can't be any good. That might also
be why they refrain from posting 'powered by solr' on their website, as if it
might show them to be cheap. They are also fearful of lack of su
Their main search page has the "Powered by Solr" logo
http://www.lucidimagination.com/search/
On Thu, Apr 29, 2010 at 2:18 PM, Israel Ekpo wrote:
> Checkout Lucid Imagination
>
> http://www.lucidimagination.com/About-Search
>
> This should convince you.
>
>
> On Thu, Apr 29, 2010 at 2:10 PM,
Checkout Lucid Imagination
http://www.lucidimagination.com/About-Search
This should convince you.
On Thu, Apr 29, 2010 at 2:10 PM, Daniel Baughman wrote:
> Hi I'm new to the list here,
>
>
>
> I'd like to steer someone in the direction of Solr, and I see the list of
> companies using solr, but
A very abbreviated list of sites using Apache Solr + Drupal here:
http://drupal.org/node/447564
-Peter
On Thu, Apr 29, 2010 at 2:10 PM, Daniel Baughman wrote:
> Hi I'm new to the list here,
>
>
>
> I'd like to steer someone in the direction of Solr, and I see the list of
> companies using solr,
Hi I'm new to the list here,
I'd like to steer someone in the direction of Solr, and I see the list of
companies using solr, but none have a "power by solr" logo or anything.
Does anyone have any great links with evidence to majorly successful solr
projects?
Thanks in advance,
Dan B
Good question, +1 on finding answer, my take ...
Depending on how large of log files you are talking about it might be better
off to do this w/ HDFS / Hadoop (and a script language like Pig) (or Amazon EMR)
http://developer.amazonwebservices.com/connect/entry.jspa?externalID=873
Theoretically y
Thanks, Im looking @ the atomic broadcast messaging protocol of Zookeeper and
think I have found what I was looking for ...
- Jon
On Apr 28, 2010, at 11:27 PM, Yonik Seeley wrote:
> On Wed, Apr 28, 2010 at 2:23 PM, Jon Baer wrote:
>> From what I understand Cassandra uses a generic gossip proto
I thought i remembered seeing some information about this, but have been
unable to find it
Does anyone know if there is a configuration / module that would allow us to
setup Solr to take in the (large) log files generated by our web/app
servers, so that we can query for things like peak time requ
Hmmm, what does the rest of your query look like? And does adding
&debugQuery=on show anything interesting?
Best
Erick
On Thu, Apr 29, 2010 at 6:54 AM, Jan Simon Winkelmann <
winkelm...@newsfactory.de> wrote:
> > > ((valid_from:[* TO 2010-04-29T10:34:12Z]) AND
> > > (valid_till:[2010-04-29T10:34
On 4/29/10 8:50 PM, Marco Martinez wrote:
Hi Ranveer,
If you don't specify a field type in the q parameter, the search will be
done searching in your default search field defined in the solrconfig.xml,
its your default field a text_sync field?
Regards,
Marco Martínez Bautista
http://www.paradi
Jason Rutherglen wrote:
Tom,
Interesting, can you post your findings after you've found them? :)
Jason
On Tue, Apr 27, 2010 at 2:33 PM, Burton-West, Tom wrote:
Is it possible to use the NoOpMergePolicy (
https://issues.apache.org/jira/browse/LUCENE-2331 ) from Solr?
We have very large
Hi Ranveer,
If you don't specify a field type in the q parameter, the search will be
done searching in your default search field defined in the solrconfig.xml,
its your default field a text_sync field?
Regards,
Marco Martínez Bautista
http://www.paradigmatecnologico.com
Avenida de Europa, 26. Át
Doddamani, Prakash wrote:
Thanks Jon,
Its very nice idea I dint thought about it, But I am already using order
for one more field,
"sort=field1+desc"
Can I have order for 2 fields something like
"sort=field1+desc&field5+desc"
Yes, you can:
sort=field1+desc,field5+desc
http://wiki.apache.
Thanks Jon,
Its very nice idea I dint thought about it, But I am already using order
for one more field,
"sort=field1+desc"
Can I have order for 2 fields something like
"sort=field1+desc&field5+desc"
Or is there something else I should do.
Thanks
Prakash
-Original Message-
From: Jon Ba
All that stuff happens in the JDBC driver associated w/ the DataSource so
probably not unless there is something which can be set in the Oracle driver
itself.
One thing that might have helped in this case might have been if
readFieldNames() in the JDBCDataSource dumped its return to debug log f
I'm putting on a talk at Lucene Eurocon
(http://lucene-eurocon.org/sessions-track1-day2.html#1) on "Practical
Relevance" and I'm curious as to what people put in practice for testing and
improving relevance. I have my own inclinations, but I don't want to muddy the
water just yet. So, if you
You might want to look at DateMath,
http://lucene.apache.org/solr/api/org/apache/solr/util/DateMathParser.html. I
believe the default precision is to the millisecond, so if you afford to round
to the nearest second or even minute you might see some performance gains.
-Kallin Nagelberg
-Ori
Does a "sort=field5+desc" on the query param not work?
- Jon
On Apr 29, 2010, at 9:32 AM, Doddamani, Prakash wrote:
> Hi,
>
>
>
> I am using the boost factor as below
>
>
>
> field1^20.0 field2^5 field3^2.5 field4^.5
>
>
>
>
>
> Where it searches first in field1 then field1 and
Hi,
I did a debugger session and found that the column names are case sensitive
(at least with Oracle).
The column names are retreived from the JDBC metadatas and I found that my
objectid is in fact OBJECTID.
So now, I'm able to do an update with the following config (pay attention to
the OBJECT
You should do this -
http://localhost:8080/solr/select/?q=*:*&fq=pubdate:[2010-03-25T00:00:00Z
%20TO%202010-03-25T23:59:59Z]
Ankit
-Original Message-
From: Hamid Vahedi [mailto:hvb...@yahoo.com]
Sent: Thursday, April 29, 2010 5:33 AM
To: solr-user@lucene.apache.org
Subject: Solr dat
Hi,
I am trying to configure synonym filter.
my requirement is:
when user searching by phrase like "what is solr user?" then it should
be replace with "solr user".
something like : what is solr user? => solr user
My schema for particular field is:
positionIncrementGap="100">
ignoreC
Hi,
I am using the boost factor as below
field1^20.0 field2^5 field3^2.5 field4^.5
Where it searches first in field1 then field1 and so on
Is there a way, where I can make some documents very low priority so
that they come at the end?
Scenario :
aaa
bbb
dd
Hi All,
I am using JTeam's Spatial Plugin RC3 to perform spatial searches on my index
and it works great. However, I can't seem to get it to return the computed
distances.
My query component is run before the geoDistanceComponent and the distanceField
is set to "distance"
Fields for lat/long a
hey..
try the fq parameter !?
...&fq=(title:A country:USA)
--
View this message in context:
http://lucene.472066.n3.nabble.com/Problem-in-solr-search-tp765028p765171.html
Sent from the Solr - User mailing list archive at Nabble.com.
Hi,
It looks like the deltaImportQuery needs to be changed you are using
dataimporter.delta.id which is not correct, you are selecting objected in
the deltaQuery, so the deltaImportQuery should be using
dataimporter.delta.objectid
So try this:
Colin.
> -Original Message-
> From: safl
Hi Andrew,
Today, authentication is handled by the container (e.g. Tomcat, Jetty etc.).
There's a thread I found to be very useful on this topic here:
http://www.lucidimagination.com/search/document/d1e338dc452db2e4/how_can_i_protect_the_solr_cores
This was for Jetty, but the idea is pretty mu
Hi Koji,
thanks for reply.
where should I use the KeywordTokenizerFactory in string or in text
field.
I am wondering that KeywordTokenizerFactory will work or not in
textfield. Actually as I understood about the KeywordTokenizerFactory
that : KeywordTokenizerFactory is tokenize the keyword.
Just wondering if anyone had any further thoughts on how I might do this?
On 26 April 2010 19:18, Oliver Beattie wrote:
> Hi Grant,
>
> Thanks for getting back to me. Yes, indeed, #1 is exactly what I'm looking
> for. Results are already ranked by distance (among other things), but we
> need the
> I am wondering that KeywordTokenizerFactory will work or
> not in textfield. Actually as I understood about the
> KeywordTokenizerFactory that : KeywordTokenizerFactory is
> tokenize the keyword.
> for example : 'solr user' will tokenize to 'solr' and
> 'user' because solr and user are keyword..
> > ((valid_from:[* TO 2010-04-29T10:34:12Z]) AND
> > (valid_till:[2010-04-29T10:34:12Z TO *])) OR ((*:*
> > -valid_from:[* TO *]) AND (*:* -valid_till:[* TO *])))
> >
> > I use the empty checks for datasets which do not have a
> > valid from/till range.
> >
> >
> > Is there any way to get this any
> I am currently having serious performance problems with
> date range queries. What I am doing, is validating a
> datasets published status by a valid_from and a valid_till
> date field.
>
> I did get a performance boost of ~ 100% by switching from a
> normal solr.DateField to a solr.TrieDateFie
On 4/29/10 3:45 PM, Koji Sekiguchi wrote:
Ranveer Kumar wrote:
Hi,
I require to configure synonym to exact match.
The field I need to search is string type. I tried to configure by
the text
but in text, due to whitespace tokenizer exact match not found.
My requirement is :
suppose user search
Ranveer Kumar wrote:
Hi,
I require to configure synonym to exact match.
The field I need to search is string type. I tried to configure by the text
but in text, due to whitespace tokenizer exact match not found.
My requirement is :
suppose user search by "solr user" and exact "solr user" (or equ
khirb7 wrote:
Hello every body,
In our application we are dealing with music. In our index we are storing
music tracks (3 million documents). We have popularity field which inside
the track document, this field contains the number of times the track
have been listened.
The issue is that we
Hello every body,
In our application we are dealing with music. In our index we are storing
music tracks (3 million documents). We have popularity field which inside
the track document, this field contains the number of times the track
have been listened.
The issue is that we are forced to r
we use dismax and highlighting works fine.
the only thing we had to add to the query-url was
&hl.fl=FIELD1,FIELD2
so we had to specify which fields should be used for highlighting.
> -Ursprüngliche Nachricht-
> Von: fabritw [mailto:fabr...@gmail.com]
> Gesendet: Mittwoch, 28. A
yes, that's totally fine.
On Apr 28, 2010, at 7:14 PM, Thomas Nguyen wrote:
Is there anything wrong with wrapping the text content of all fields
with CDATA whether they be analyzed, not analyzed, indexed, not
indexed
and etc.? I have a script that creates update XML documents and it's
just
I index some data include date in solr
but when search for specific date, i get some record (not all record)
include some record in next day for example:
http://localhost:8080/solr/select/?q=pubdate:[2010-03-25T00:00:00Z >TO
>2010-03-25T23:59:59Z]&start=0&rows=10&indent=on&sort=pubdate
> desc
i
Hi,
I require to configure synonym to exact match.
The field I need to search is string type. I tried to configure by the text
but in text, due to whitespace tokenizer exact match not found.
My requirement is :
suppose user search by "solr user" and exact "solr user" (or equivalant
synonym) are av
Hi, thanks for your help, I figued it out myself I guess.
All parts of an fq are always intersected, so it has no effect to put
a boolean operator inside a fq like in
fq=+tags:(Gucci) OR -tags:(watch sunglasses)
(would be a mildly strange query anyway)
The order in which the intersections are m
Hi, thanks for your help, I figued it out myself I guess.
All parts of an fq are always intersected, so it has no effect to put
a boolean operator inside a fq like in
fq=+tags:(Gucci) OR -tags:(watch sunglasses)
(would be a mildly strange query anyway)
The order in which the intersections are m
Hi,
I am currently having serious performance problems with date range queries.
What I am doing, is validating a datasets published status by a valid_from and
a valid_till date field.
I did get a performance boost of ~ 100% by switching from a normal
solr.DateField to a solr.TrieDateField with
Hi
I'm planning on adding some protection to our solr servers and would
like to know what others are doing in this area.
Basically I have a few solr cores running under tomcat6 and all use DH
to populate the solr index. This is all behind a firewall and only
accessible from certain IP addresses.
74 matches
Mail list logo