RE: No search result behavior (a la Amazon)

2008-11-19 Thread Nguyen, Joe
mazon) I understand how to do the "100% mm" part. It's the behavior when there are no matches that i'm asking about :) Nguyen, Joe-2 wrote: > > Have a look at DisMaxRequestHandler and play with mm (miminum terms > should match) > > http://wiki.a

RE: No search result behavior (a la Amazon)

2008-11-19 Thread Nguyen, Joe
Have a look at DisMaxRequestHandler and play with mm (miminum terms should match) http://wiki.apache.org/solr/DisMaxRequestHandler?highlight=%28CategorySo lrRequestHandler%29%7C%28%28CategorySolrRequestHandler%29%29#head-6c5fe4 1d68f3910ed544311435393f5727408e61 -Original Message- From:

RE: Question about autocommit

2008-11-19 Thread Nguyen, Joe
so. in a couple of hours the speed of indexing choked down from over 200 to under 100 documents per second - and all the same i had several autocommits a second. so i restarted with a limit at 8000. with the results i mentionned in the previous email. Nguyen, Joe wrote: > As far as I know, com

RE: Question about autocommit

2008-11-19 Thread Nguyen, Joe
closing). The solr autocommit controls when solr asks IndexWriter to commit what its done so far. Nguyen, Joe wrote: > Could trigger the commit in this case? > > -Original Message- > From: Nickolai Toupikov [mailto:[EMAIL PROTECTED] > Sent: Wednesday, November 19, 2008 8:36

RE: Question about autocommit

2008-11-19 Thread Nguyen, Joe
Could trigger the commit in this case? -Original Message- From: Nickolai Toupikov [mailto:[EMAIL PROTECTED] Sent: Wednesday, November 19, 2008 8:36 Joe To: solr-user@lucene.apache.org Subject: Question about autocommit Hello, I would like some details on the autocommit mechanism. I tr

RE: Query Response Doc Score -> Int Value

2008-11-18 Thread Nguyen, Joe
You don't need to hack the code since you can virtually treated these scores 2.3518934 and 2.2173865 as if they were both equal (ignoring digits after the decimal point). Score = original score(2.3518934) + function(date_created) You can scale the value of function(date_created) so that digits af

RE: Updating schema.xml without deleting index?

2008-11-17 Thread Nguyen, Joe
Don't know whether this would work... Just speculate :-) A. You'll need to create a new schema with the new field or you could use dynamic field in your current schema (assume you already config the default value to 0). B. Add a couple of new documents C. Run optimize script. Since optimize wi

RE: abt Multicore

2008-11-17 Thread Nguyen, Joe
Any suggestions? -Original Message- From: Nguyen, Joe Sent: Monday, November 17, 2008 9:40 Joe To: 'solr-user@lucene.apache.org' Subject: RE: abt Multicore "Are all the documents in the same search space? That is, for a given query, could any of the 10MM docs be retu

RE: abt Multicore

2008-11-17 Thread Nguyen, Joe
"Are all the documents in the same search space? That is, for a given query, could any of the 10MM docs be returned? If so, I don't think you need to worry about multicore. You may however need to put part of the index on various machines: http://wiki.apache.org/solr/DistributedSearch " I also

RE: Query Performance while updating teh index

2008-11-12 Thread Nguyen, Joe
Another way to handle this is not to run commit script at peak time(still pull snapshot periodically). Keeping track of the number of requests, resource utilization, etc.. If the number of request exceeds the threshold, don't commit. Also, how many segments do you see under index dir? High numb

RE: Query Performance while updating teh index

2008-11-12 Thread Nguyen, Joe
How about create a new core, index data, then swap the core? Old core is still available to handle queries till new core replaces it. -Original Message- From: Lance Norskog [mailto:[EMAIL PROTECTED] Sent: Wednesday, November 12, 2008 11:16 Joe To: solr-user@lucene.apache.org Subject: RE

RE: FW: Score customization

2008-11-12 Thread Nguyen, Joe
You could use function query with standardRequestHandler to influence the final score and sort result by score. If you want to control how much the function query would affect the original score, you could use the linear function. -Original Message- From: lajkonik86 [mailto:[EMAIL PROTECT

RE: Synonyms impacting the performance

2008-11-12 Thread Nguyen, Joe
Could you collaborate further? 20 synonyms would translated to 20 booleanQueries. Are you saying each booleanQuery requires a disk access? -Original Message- From: Walter Underwood [mailto:[EMAIL PROTECTED] Sent: Wednesday, November 12, 2008 7:46 Joe To: solr-user@lucene.apache.org Sub

RE: Handling proper names

2008-11-07 Thread Nguyen, Joe
Use synonym. Added these line to your ../conf/synonym.txt Stephen,Steven,Steve Bobby,Bob,Robert ... -Original Message- From: news [mailto:[EMAIL PROTECTED] On Behalf Of Jon Drukman Sent: Friday, November 07, 2008 3:19 Joe To: solr-user@lucene.apache.org Subject: Handling proper names Is

Bias score proximity for a given field

2008-11-05 Thread Nguyen, Joe
Hi Is there a way to specify a range boosting for a numeric/date field? Suppose I have articles whose published dates are in 2005,...,2008,...,2011. I want to boost the score of 2008 article by 20%. Articles whose published dates 3-year distance from 2008 article would be boosted by 0%, e.g. 2

RE: Query integer type

2008-10-28 Thread Nguyen, Joe
Never mind. I misused the syntax. :-) -Original Message- From: Nguyen, Joe [mailto:[EMAIL PROTECTED] Sent: Tuesday, October 28, 2008 7:00 Joe To: solr-user@lucene.apache.org Subject: Query integer type SITE is defined as integer. I wanted to select all document whose SITE=3002, but

Query integer type

2008-10-28 Thread Nguyen, Joe
SITE is defined as integer. I wanted to select all document whose SITE=3002, but SITE of the response was different. http://localhost:8080/solr/mysite/select?indent=on&qt=standard&fl=SITE&fq:SITE:3002 http://localhost:8080/solr/mysite/select?indent=on&qt=dismax&fl=SITE&fq:SITE:3002 http:/

RE: Changing field datatype

2008-10-28 Thread Nguyen, Joe
impact query time? -Original Message- From: Shalin Shekhar Mangar [mailto:[EMAIL PROTECTED] Sent: Tuesday, October 28, 2008 1:33 Joe To: solr-user@lucene.apache.org Subject: Re: Changing field datatype On Wed, Oct 29, 2008 at 1:55 AM, Nguyen, Joe <[EMAIL PROTECTED]> wrote: > &

Changing field datatype

2008-10-28 Thread Nguyen, Joe
I have a solr core having 2 million lengthy documents. 1. If I modify datatype of a field 'foo' from string to a sint and restart the server, what would happen to the existing documents? And documents added with the new schema? At query time (sort=foo desc), should I expect the documents sorte

multicore admin interface

2008-10-23 Thread Nguyen, Joe
Hi, I have two cores. When each core references the same dataDir, I could access the core admin interface. However, when core1 dirData is referencing one directory, and core2 another directory, I could not access the admin interface. Any idea? //each core references a different dir //both cor