SOLR newbie question: How to filter the results based on my Unique Key

2009-02-28 Thread Venu Mittal
Hi List,

Is it possible to filter out the duplicate results using a particular field in 
the document.
e.g.


 1   
123
 a...@b.com


Now if I search for email = a...@b.com I get 2 search results but I want to 
send just one record cause my cust_id is same. Is it possible or do I need to 
handle it in the calling application.
 
Thanks


  

Re: SOLR newbie question: How to filter the results based on my Unique Key

2009-02-28 Thread Venu Mittal
Hi Stephen,

Thanks for the info. 

I took the latest patch (collapsing-patch-to-1.3.0-dieter.patch) and applied it 
on the source code. Then I took the newly created jar added it to SOLR war. But 
SOLR is still ignoring the new config. I am still getting 2 records in my 
resultset. Is there something that I am missing here ?

TIA.



From: Stephen Weiss 
To: solr-user@lucene.apache.org
Sent: Saturday, February 28, 2009 10:50:26 PM
Subject: Re: SOLR newbie question: How to filter the results based on my Unique 
Key

There's an experimental patch for this I've had pretty good success with:

https://issues.apache.org/jira/browse/SOLR-236

If you don't particularly need faceting support to work 100% it's already 
pretty perfect.  Officially I guess they want it to make it in for version 
1.5??  But in the meantime it's pretty easy to implement and stable, just make 
sure you use the latest patch.

--
Steve

On Feb 28, 2009, at 5:45 PM, Venu Mittal wrote:

> Hi List,
> 
> Is it possible to filter out the duplicate results using a particular field 
> in the document.
> e.g.
> 
> 
> 1
> 123
> a...@b.com
> 
> 
> Now if I search for email = a...@b.com I get 2 search results but I want to 
> send just one record cause my cust_id is same. Is it possible or do I need to 
> handle it in the calling application.
> 
> Thanks
> 
> 


  

Re: SOLR newbie question: How to filter the results based on my Unique Key

2009-02-28 Thread Venu Mittal
Ok so I tried out XSLT transformation on the resulting xml and I must say that 
I am very impressed with the results. I will do some more load testing tomorrow 
and finalize this solution.

Thanks everyone.





From: Venu Mittal 
To: solr-user@lucene.apache.org
Sent: Sunday, March 1, 2009 2:03:19 AM
Subject: Re: SOLR newbie question: How to filter the results based on my Unique 
Key

Hi Stephen,

Thanks for the info. 

I took the latest patch (collapsing-patch-to-1.3.0-dieter.patch) and applied it 
on the source code. Then I took the newly created jar added it to SOLR war. But 
SOLR is still ignoring the new config. I am still getting 2 records in my 
resultset. Is there something that I am missing here ?

TIA.



From: Stephen Weiss 
To: solr-user@lucene.apache.org
Sent: Saturday, February 28, 2009 10:50:26 PM
Subject: Re: SOLR newbie question: How to filter the results based on my Unique 
Key

There's an experimental patch for this I've had pretty good success with:

https://issues.apache.org/jira/browse/SOLR-236

If you don't particularly need faceting support to work 100% it's already 
pretty perfect.  Officially I guess they want it to make it in for version 
1.5??  But in the meantime it's pretty easy to implement and stable, just make 
sure you use the latest patch.

--
Steve

On Feb 28, 2009, at 5:45 PM, Venu Mittal wrote:

> Hi List,
> 
> Is it possible to filter out the duplicate results using a particular field 
> in the document.
> e.g.
> 
> 
> 1
> 123
> a...@b.com
> 
> 
> Now if I search for email = a...@b.com I get 2 search results but I want to 
> send just one record cause my cust_id is same. Is it possible or do I need to 
> handle it in the calling application.
> 
> Thanks
> 
> 


  

Re: How to search the database tables using solr.

2009-03-04 Thread Venu Mittal
Does anybody has any stats to share on how much time does DataImportHandler 
takes to index a given set of data ?

I am currently indexing 18 millions rows in 1.5 - 2 hours by sending xmls to 
solr. 




From: Shalin Shekhar Mangar 
To: solr-user@lucene.apache.org; cra...@ceiindia.com
Sent: Wednesday, March 4, 2009 8:15:07 AM
Subject: Re: How to search the database tables using solr.

On Wed, Mar 4, 2009 at 7:51 PM, Radha C.  wrote:

> Thanks Shalin,
>
> We just stepped on solr. This information is very much useful for me. But
> before that I want some clear details about where to start..
> I want to test this in my local environment, so I need some basic
> information about how to start using this ( database and solr ). Do you
> have
> some information on this?
>

I think the easiest way is to start using Solr is with the embedded jetty
container. Modify the example/conf/schema.xml file and add your own fields
etc. Read through the DataImportHandler wiki page and at the
example/example-DIH directory in the solr zip/tarball.

If you have a specific doubt/question, ask on the list.

-- 
Regards,
Shalin Shekhar Mangar.



  

Highlighting the searched term in resultset

2009-03-12 Thread Venu Mittal
I was wondering if there is any way of highlighting the searched term in the 
resultset directly instead of having it as a separate "lst" element. 
Doing it through xsl transformation would be one way. 
Has anybody implemented any other better solution ?

e.g

 
  iPhone
  iphone sell buy/str>
  2007-11-20T05:36:29Z
  2007-11-17T06:00:00Z
  ARTICLE
 



TIA.



  

Re: Date Search with q query parameter

2009-03-12 Thread Venu Mittal
Is your final query in this format ?

col1:[2009-01-01T00:00:00Z+TO+2009-01-01T23:59:59Z]




From: dabboo 
To: solr-user@lucene.apache.org
Sent: Thursday, March 12, 2009 12:27:48 AM
Subject: Date Search with q query parameter


Hi,

I am facing an issue with the date field, I have in my records.

e.g. I am using q query parameter and passing some string as search criteria
like "test". While creating query with q parameter, how query forms is:

column1:test | column2:test | column3:test . ...

I have one column as date column, which is appended with _dt like
column4_dt. Now, when it creates the query like 

column1:test | column2:test | column3:test | column4_dt:test 

Here it throws an exception saying "Invalid date format".

Please suggest how I can prevent this.

Thanks,
Amit Garg

-- 
View this message in context: 
http://www.nabble.com/Date-Search-with-q-query-parameter-tp22471072p22471072.html
Sent from the Solr - User mailing list archive at Nabble.com.


  

ExtractingRequestHandler Question

2009-04-03 Thread Venu Mittal
Hi,

I am using ExtractingRequestHandler to index  rich text documents. 
The way I am doing it is I get some data related to the document from database 
and then post an xml  (containing only this data ) to solr. Then I make another 
call to solr, which sends the actual document to be indexed. 
But while doing so I am loosing out all the other data that is related to the 
document. 

Is this the right way to do handle it or am I missing out on something.

TIA



  

Re: ExtractingRequestHandler Question

2009-04-06 Thread Venu Mittal
Hi Jacob,

Thanks for the reply. I am still trying to nail down this problem with the best 
possible solution.
Yeah I had thought about these 2 approaches but both of them are gonna make my 
indexing slower.  Plus the fact that I will have atleast 5 rich text files 
associated with each document is not helping much either.

Anyways I will explore and see if I can come up with anything better (may be a 
separate index for rich text docs).

Thanks,
Venu




From: Jacob Singh 
To: solr-user@lucene.apache.org
Sent: Saturday, April 4, 2009 9:59:13 PM
Subject: Re: ExtractingRequestHandler Question

Hi TIA,

I have the same desired requirement.  If you look up in the archives,
you might find a similar thread between myself and the always super
helpful Erik Hatcher.  Basically, it can't be done (right now).

You can however use the "ExtractOnly" request handler, and just get
the extracted text back from solr, and then use xpath to get out the
attributes and then add them to your XML you are sending.

Not ideal because the file has to be transfered twice.

The only other option is to send the file as per the instructions via
POST with its attributes as POST fields.

Keep in mind that Solr documents are immutable, which means they
cannot change.  When you update a document with the same primary key,
it will simply delete the existing one and add the new one.

hth,
Jacob

On Sat, Apr 4, 2009 at 5:59 AM, Venu Mittal  wrote:
> Hi,
>
> I am using ExtractingRequestHandler to index  rich text documents.
> The way I am doing it is I get some data related to the document from 
> database and then post an xml  (containing only this data ) to solr. Then I 
> make another call to solr, which sends the actual document to be indexed.
> But while doing so I am loosing out all the other data that is related to the 
> document.
>
> Is this the right way to do handle it or am I missing out on something.
>
> TIA
>
>
>
>



-- 

+1 510 277-0891 (o)
+91  33 7458 (m)

web: http://pajamadesign.com

Skype: pajamadesign
Yahoo: jacobsingh
AIM: jacobsingh
gTalk: jacobsi...@gmail.com



  

Re: ExtractingRequestHandler Question

2009-05-10 Thread Venu Mittal
Hi,

Wondering if somebody could help me in understanding the following behavior :-

If I search on a text field with search query as "davi cla" then it does not 
yields any search results however if I search for "davi clai" then it yields me 
100+ results.

The field I am searching on is a text field and has following defination in my 
solr config.

   
  



 



  


Thanks in advance !

Venu