SOLR newbie question: How to filter the results based on my Unique Key
Hi List, Is it possible to filter out the duplicate results using a particular field in the document. e.g. 1 123 a...@b.com Now if I search for email = a...@b.com I get 2 search results but I want to send just one record cause my cust_id is same. Is it possible or do I need to handle it in the calling application. Thanks
Re: SOLR newbie question: How to filter the results based on my Unique Key
Hi Stephen, Thanks for the info. I took the latest patch (collapsing-patch-to-1.3.0-dieter.patch) and applied it on the source code. Then I took the newly created jar added it to SOLR war. But SOLR is still ignoring the new config. I am still getting 2 records in my resultset. Is there something that I am missing here ? TIA. From: Stephen Weiss To: solr-user@lucene.apache.org Sent: Saturday, February 28, 2009 10:50:26 PM Subject: Re: SOLR newbie question: How to filter the results based on my Unique Key There's an experimental patch for this I've had pretty good success with: https://issues.apache.org/jira/browse/SOLR-236 If you don't particularly need faceting support to work 100% it's already pretty perfect. Officially I guess they want it to make it in for version 1.5?? But in the meantime it's pretty easy to implement and stable, just make sure you use the latest patch. -- Steve On Feb 28, 2009, at 5:45 PM, Venu Mittal wrote: > Hi List, > > Is it possible to filter out the duplicate results using a particular field > in the document. > e.g. > > > 1 > 123 > a...@b.com > > > Now if I search for email = a...@b.com I get 2 search results but I want to > send just one record cause my cust_id is same. Is it possible or do I need to > handle it in the calling application. > > Thanks > >
Re: SOLR newbie question: How to filter the results based on my Unique Key
Ok so I tried out XSLT transformation on the resulting xml and I must say that I am very impressed with the results. I will do some more load testing tomorrow and finalize this solution. Thanks everyone. From: Venu Mittal To: solr-user@lucene.apache.org Sent: Sunday, March 1, 2009 2:03:19 AM Subject: Re: SOLR newbie question: How to filter the results based on my Unique Key Hi Stephen, Thanks for the info. I took the latest patch (collapsing-patch-to-1.3.0-dieter.patch) and applied it on the source code. Then I took the newly created jar added it to SOLR war. But SOLR is still ignoring the new config. I am still getting 2 records in my resultset. Is there something that I am missing here ? TIA. From: Stephen Weiss To: solr-user@lucene.apache.org Sent: Saturday, February 28, 2009 10:50:26 PM Subject: Re: SOLR newbie question: How to filter the results based on my Unique Key There's an experimental patch for this I've had pretty good success with: https://issues.apache.org/jira/browse/SOLR-236 If you don't particularly need faceting support to work 100% it's already pretty perfect. Officially I guess they want it to make it in for version 1.5?? But in the meantime it's pretty easy to implement and stable, just make sure you use the latest patch. -- Steve On Feb 28, 2009, at 5:45 PM, Venu Mittal wrote: > Hi List, > > Is it possible to filter out the duplicate results using a particular field > in the document. > e.g. > > > 1 > 123 > a...@b.com > > > Now if I search for email = a...@b.com I get 2 search results but I want to > send just one record cause my cust_id is same. Is it possible or do I need to > handle it in the calling application. > > Thanks > >
Re: How to search the database tables using solr.
Does anybody has any stats to share on how much time does DataImportHandler takes to index a given set of data ? I am currently indexing 18 millions rows in 1.5 - 2 hours by sending xmls to solr. From: Shalin Shekhar Mangar To: solr-user@lucene.apache.org; cra...@ceiindia.com Sent: Wednesday, March 4, 2009 8:15:07 AM Subject: Re: How to search the database tables using solr. On Wed, Mar 4, 2009 at 7:51 PM, Radha C. wrote: > Thanks Shalin, > > We just stepped on solr. This information is very much useful for me. But > before that I want some clear details about where to start.. > I want to test this in my local environment, so I need some basic > information about how to start using this ( database and solr ). Do you > have > some information on this? > I think the easiest way is to start using Solr is with the embedded jetty container. Modify the example/conf/schema.xml file and add your own fields etc. Read through the DataImportHandler wiki page and at the example/example-DIH directory in the solr zip/tarball. If you have a specific doubt/question, ask on the list. -- Regards, Shalin Shekhar Mangar.
Highlighting the searched term in resultset
I was wondering if there is any way of highlighting the searched term in the resultset directly instead of having it as a separate "lst" element. Doing it through xsl transformation would be one way. Has anybody implemented any other better solution ? e.g iPhone iphone sell buy/str> 2007-11-20T05:36:29Z 2007-11-17T06:00:00Z ARTICLE TIA.
Re: Date Search with q query parameter
Is your final query in this format ? col1:[2009-01-01T00:00:00Z+TO+2009-01-01T23:59:59Z] From: dabboo To: solr-user@lucene.apache.org Sent: Thursday, March 12, 2009 12:27:48 AM Subject: Date Search with q query parameter Hi, I am facing an issue with the date field, I have in my records. e.g. I am using q query parameter and passing some string as search criteria like "test". While creating query with q parameter, how query forms is: column1:test | column2:test | column3:test . ... I have one column as date column, which is appended with _dt like column4_dt. Now, when it creates the query like column1:test | column2:test | column3:test | column4_dt:test Here it throws an exception saying "Invalid date format". Please suggest how I can prevent this. Thanks, Amit Garg -- View this message in context: http://www.nabble.com/Date-Search-with-q-query-parameter-tp22471072p22471072.html Sent from the Solr - User mailing list archive at Nabble.com.
ExtractingRequestHandler Question
Hi, I am using ExtractingRequestHandler to index rich text documents. The way I am doing it is I get some data related to the document from database and then post an xml (containing only this data ) to solr. Then I make another call to solr, which sends the actual document to be indexed. But while doing so I am loosing out all the other data that is related to the document. Is this the right way to do handle it or am I missing out on something. TIA
Re: ExtractingRequestHandler Question
Hi Jacob, Thanks for the reply. I am still trying to nail down this problem with the best possible solution. Yeah I had thought about these 2 approaches but both of them are gonna make my indexing slower. Plus the fact that I will have atleast 5 rich text files associated with each document is not helping much either. Anyways I will explore and see if I can come up with anything better (may be a separate index for rich text docs). Thanks, Venu From: Jacob Singh To: solr-user@lucene.apache.org Sent: Saturday, April 4, 2009 9:59:13 PM Subject: Re: ExtractingRequestHandler Question Hi TIA, I have the same desired requirement. If you look up in the archives, you might find a similar thread between myself and the always super helpful Erik Hatcher. Basically, it can't be done (right now). You can however use the "ExtractOnly" request handler, and just get the extracted text back from solr, and then use xpath to get out the attributes and then add them to your XML you are sending. Not ideal because the file has to be transfered twice. The only other option is to send the file as per the instructions via POST with its attributes as POST fields. Keep in mind that Solr documents are immutable, which means they cannot change. When you update a document with the same primary key, it will simply delete the existing one and add the new one. hth, Jacob On Sat, Apr 4, 2009 at 5:59 AM, Venu Mittal wrote: > Hi, > > I am using ExtractingRequestHandler to index rich text documents. > The way I am doing it is I get some data related to the document from > database and then post an xml (containing only this data ) to solr. Then I > make another call to solr, which sends the actual document to be indexed. > But while doing so I am loosing out all the other data that is related to the > document. > > Is this the right way to do handle it or am I missing out on something. > > TIA > > > > -- +1 510 277-0891 (o) +91 33 7458 (m) web: http://pajamadesign.com Skype: pajamadesign Yahoo: jacobsingh AIM: jacobsingh gTalk: jacobsi...@gmail.com
Re: ExtractingRequestHandler Question
Hi, Wondering if somebody could help me in understanding the following behavior :- If I search on a text field with search query as "davi cla" then it does not yields any search results however if I search for "davi clai" then it yields me 100+ results. The field I am searching on is a text field and has following defination in my solr config. Thanks in advance ! Venu