Re: Limiting facets for huge data - setting indexed=false in schema.xml

2009-07-31 Thread Yao Ge
Having a large number of fields is not the same as having a large number of facets. To facets are something you would display to users as aid for query refinement or navigation. There is no way for a user to use 3700 facets at the same time. So it more of question on how to determine what facets t

Re: Item Facet

2009-08-07 Thread Yao Ge
Are your product_name* fields numeric fields (integer or float)? Dals wrote: > > Hi... > > Is there any way to group values like shopping.yahoo.com or > shopper.cnet.com do? > > For instance, I have documents like: > > doc1 - product_name1 - value1 > doc2 - product_name1 - value2 > doc3 - p

Re: Google Side-By-Side UI

2009-10-02 Thread Yao Ge
Yes. I think would be very helpful tool for tunning search relevancy - you can do a controlled experiment with your target audiences to understand their responses to the parameter changes. We plan to use this feature to benchmark Lucene/SOLR against our in-house commercial search engine - it will

DIH - Export to XML

2009-10-30 Thread Yao Ge
For Data Import Handler, there is a way to dump data to a SOLR feed format XML file? -- View this message in context: http://old.nabble.com/DIH---Export-to-XML-tp26138213p26138213.html Sent from the Solr - User mailing list archive at Nabble.com.

Query Boost Functions

2009-05-18 Thread Yao Ge
I have a field named "last-modified" that I like to use in bf (Boot Functions) parameter: recip(rord(last-modified),1,1000,1000) in DisMaxRequestHander. However the Solr query parser complain about the syntax of the formula. I think it is related with hyphen in the field name. I have tried to add

Re: Solr Shard - Strange results

2009-05-18 Thread Yao Ge
Maybe you want to try with docNumber field type as "string" and see it would make a difference. CB-PO wrote: > > I'm not quite sure what logs you are talking about, but in the > tomcat/logs/catalina.out logs, i found the following [note, i can't > copy/paste, so i am typing up a summary]: > >

DataImportHandler Template Transformer

2009-05-18 Thread Yao Ge
It took me a while to understand that to use the Template Transfomer (http://lucene.apache.org/solr/api/org/apache/solr/handler/dataimport/TemplateTransformer.html), all building variable names (e.g. ${e.firstName} ${e.lastName} etc). can not contain null values. I hope the parser can do a better

spell checking

2009-06-02 Thread Yao Ge
Can someone help providing a tutorial like introduction on how to get spell-checking work in Solr. It appears many steps are requires before the spell-checkering functions can be used. It also appears that a dictionary (a list of correctly spelled words) is required to setup the spell checker. Can

Re: spell checking

2009-06-02 Thread Yao Ge
dictionary or word list? Please help. Grant Ingersoll-6 wrote: > > Have you gone through: http://wiki.apache.org/solr/SpellCheckComponent > > > On Jun 2, 2009, at 8:50 AM, Yao Ge wrote: > >> >> Can someone help providing a tutorial like introduction on how to get >

Re: spell checking

2009-06-02 Thread Yao Ge
pos that I need to run spell check on. But how can these original data be used as a base for spell checking? How does Solr know what are correctly spelled words? ... ... Yao Ge wrote: > > Can someone help providing a tutorial like introduction on how to get >

Re: spell checking

2009-06-02 Thread Yao Ge
t; into so called n-grams. You can see that if you open up the SC index with > something like Luke. Please see > http://wiki.apache.org/jakarta-lucene/SpellChecker . > > Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > - Original

Faceting on text fields

2009-06-04 Thread Yao Ge
I am index a database with over 1 millions rows. Two of fields contain unstructured text but size of each fields is limited (256 characters). I come up with an idea to use visualize the text fields using text cloud by turning the two text fields in facets. The weight of font and size is of each

Re: Faceting on text fields

2009-06-04 Thread Yao Ge
Yes. I am using 1.3. When is 1.4 due for release? Yonik Seeley-2 wrote: > > Are you using Solr 1.3? > You might want to try the latest 1.4 test build - faceting has changed a > lot. > > -Yonik > http://www.lucidimagination.com > > On Thu, Jun 4, 2009 at 12:01 PM

Query Filter fq with OR operator

2009-06-05 Thread Yao Ge
If I want use OR operator with mutile query filters, I can do: fq=popularity:[10 TO *] OR section:0 Is there a more effecient alternative to this? -- View this message in context: http://www.nabble.com/Query-Filter-fq-with-OR-operator-tp23895837p23895837.html Sent from the Solr - User mailing li

Re: Faceting on text fields

2009-06-09 Thread Yao Ge
over top N docs might be more attractive. Another features I can really appreciate is to provide search time n-gram term clustering. Maybe this might be better suited for "spell checker" as it just a different way to display the alternative search terms. -Yao Michael Ludwig-4 wrote: &

Re: Faceting on text fields

2009-06-10 Thread Yao Ge
> Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > - Original Message >> From: Yao Ge >> To: solr-user@lucene.apache.org >> Sent: Tuesday, June 9, 2009 3:46:13 PM >> Subject: Re: Faceting on text fields >>

Re: Faceting on text fields

2009-06-11 Thread Yao Ge
FYI. I did a direct integration with Carrot2 with Solrj with a separate Ajax call from UI for top 100 hits to clusters terms in the two text fields. It gots comparable performance to other facets in terms of response time. In terms of algorithms, their listed two "Lingo" and "STC" which I don't

Re: Faceting on text fields

2009-06-11 Thread Yao Ge
/download.html). Yao Ge wrote: > > FYI. I did a direct integration with Carrot2 with Solrj with a separate > Ajax call from UI for top 100 hits to clusters terms in the two text > fields. It gots comparable performance to other facets in terms of > response time. > > In terms of alg

Re: Query Filter fq with OR operator

2009-06-26 Thread Yao Ge
I will like to submit a JIRA issue for this. Can anyone help me on where to go? -Yao Otis Gospodnetic wrote: > > > Brian, > > Opening a JIRA issue if it doesn't already exist is the best way. If you > can provide a patch, even better! > > Otis > -- > Sematext -- http://sematext.com/ -- Luc

Faceting with MoreLikeThis

2009-07-06 Thread Yao Ge
Does Solr support faceting on MoreLikeThis search results? -- View this message in context: http://www.nabble.com/Faceting-with-MoreLikeThis-tp24356166p24356166.html Sent from the Solr - User mailing list archive at Nabble.com.

Filtering MoreLikeThis results

2009-07-06 Thread Yao Ge
I could not find any support from http://wiki.apache.org/solr/MoreLikeThis on how to restrict MLT results to certain subsets. I passed along a fq parameter and it is ignored. Since we can not incorporate the filters in the query itself which is used to retrieve the target for similarity comparison

Re: Filtering MoreLikeThis results

2009-07-07 Thread Yao Ge
; In additional to restricting the results, I am also looking to >> >> influence >> >> > the >> >> > scores similar to the way boost query (bq) works in the >> >> > DisMaxRequestHandler. >> >> > >> >> &

Re: Filtering MoreLikeThis results

2009-07-07 Thread Yao Ge
The answer to my owner question: ... ... would work. -Yao Yao Ge wrote: > > I am not sure about the parameters for MLT the requestHandler plugin. Can > one of you share the solrconfig.xml entry for MLT? Thanks in advance. > -Yao > > > Bill Au wrote: >

Re: Faceting with MoreLikeThis

2009-07-07 Thread Yao Ge
faceting remains the same as standard request handler. -Yao Yao Ge wrote: > > Does Solr support faceting on MoreLikeThis search results? > -- View this message in context: http://www.nabble.com/Faceting-with-MoreLikeThis-tp24356166p24380459.html Sent from the Solr - User mailing list a

Re: A big question about Solr and SolrJ range query ?

2009-07-07 Thread Yao Ge
use Solr's Filter Query parameter "fq": fq=x:[10 TO 100]&fq=y:[20 TO 300]&fl=title -Yao huenzhao wrote: > > Hi all: > > Suppose that my index have 3 fields: title, x and y. > > I know one range(10 < x < 100) can query liks this: > > http://localhost:8983/solr/select?q=x:[10 TO 100]&fl=title

Re: about defaultSearchField

2009-07-07 Thread Yao Ge
Try with fl=* or fl=*,score added to your request string. -Yao Yang Lin-2 wrote: > > Hi, > I have some problems. > For my solr progame, I want to type only the Query String and get all > field > result that includ the Query String. But now I can't get any result > without > specified field. For

Re: Solr's MLT query call doesn't work

2009-07-08 Thread Yao Ge
A couple of things, your mlt.fl value, must be part of fl. In this case, content_mlt is not included in fl. I think the fl parameter value need to be comma separated. try fl=title,author,content_mlt,score -Yao SergeyG wrote: > > Hi, > > Recently, while implementing the MoreLikeThis search, I'v

DIH delta import - last modified date

2010-01-19 Thread Yao Ge
I am struggling with the concept of delta import in DIH. According the to documentation, the delta import will automatically record the last index time stamp and make it available to use for the delta query. However in many case when the last_modified date time stamp in the database lag behind the

hl.maxAlternateFieldLength defaults in solrconfig.xml

2010-02-10 Thread Yao Ge
It appears the hl.maxAlternateFieldLength parameter default setting in solrconfig.xml does not take effect. I can only get it to work by explicitly sending the parameter via the client request. It is not big deal but it appears to be a bug. -- View this message in context: http://old.nabble.com/