Re: How to apply Semantic Search in Solr

2014-03-09 Thread Sohan Kalsariya
Thanks Sujit and all for your views about semantic search in solr. But How do i proceed towards, i mean how do i start off the things to get on track ? On Sat, Mar 8, 2014 at 10:50 PM, Sujit Pal wrote: > Thanks for sharing this link Sohan, its an interesting approach. Since you > have effectiv

Does Solr/Lucene collapse Exception of multiple threads?

2014-03-09 Thread kevinma
I have observed slowness of my Solr instance co-occurred with thread stack overflow error, both happened at about the same time: 2014-03-09 03:14:43,166 ERROR SolrDispatchFilter|null:java.lang.RuntimeException: java.lang.StackOverflowError at org.apache.solr.servlet.SolrDispatchFilter.send

Re: Optimizing RAM

2014-03-09 Thread abhishek . netjain
Hi, If I go with copy field than will it increase I/O load considering I have RAM less than one third of total index size? Thanks  Abhishek   Original Message   From: Erick Erickson Sent: Monday, 10 March 2014 01:37 To: solr-user@lucene.apache.org Reply To: solr-user@lucene.apache.org Subject: R

Re: Which Tokenizer to use at searching

2014-03-09 Thread abhishek . netjain
Hi, I meant that while searching A AND B should return result individually and when together with a AND.  I want "A B" should not give result. Though A,B is indexed with StandardTokenizer.  Thanks  Abhishek   Original Message   From: Furkan KAMACI Sent: Monday, 10 March 2014 06:11 To: solr-use

RE: SolrCloud setup guidance

2014-03-09 Thread Susheel Kumar
Not sure how fast your index will grow but first you may still want to consider upgrading the single machine to 128 GB to see how the performance is coming. Current memory 7 GB is really low. After that you may want to add another node to partition the index into 2 nodes/shards (assuming you ha

Re: Which Tokenizer to use at searching

2014-03-09 Thread Furkan KAMACI
Hi; What do you mean at here: "While indexing A,B A and B should give result " Thanks; Furkan KAMACI 2014-03-09 22:36 GMT+02:00 : > Hi > Oops my bad. I actually meant > While indexing A,B > A and B should give result but > "A B" should not give result. > > Also I will look at analyser. > > Th

Re: Querying XML

2014-03-09 Thread Alexandre Rafalovitch
Are you using or have you looked at http://luxdb.org/ ? Might be relevant. Regards, Alex. Personal website: http://www.outerthoughts.com/ LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch - Time is the quality of nature that keeps events from happening all at once. Lately, it doesn't se

Querying XML

2014-03-09 Thread cju
I have some XML stored in Solr that looks like this: What I’d like to do is get a hit for this document if there’s an element in the XML where (@name=”Bob” AND @city=”Cincinnati”). If I write something like this, I’ll get a hit – although I’m not actually matching what I’m loo

Filter query not working for time range

2014-03-09 Thread Darniz
Hello i have a field in solr i need to get all documents whose cdStartDAte is null and also if the cdStartDate is present then it should be from * TO NOW i added the fq condition ((*:* AND -(cdStartDate:[* TO *])) OR (cdStartDate:[* TO NOW])) the issue its working fine except its o

Re: Which Tokenizer to use at searching

2014-03-09 Thread abhishek . netjain
‎Hi Oops my bad. I actually meant While indexing A,B  A and B should ‎give result but  "A B" should not give result. Also I will look at analyser. Thanks  Abhishek   Original Message   From: Erick Erickson Sent: Monday, 10 March 2014 01:38 To: abhishek jain Subject: Re: Which Tokenizer to use at

Re: Optimizing RAM

2014-03-09 Thread Erick Erickson
I'd go for a copyField, keep the stemmed and unstemmed version in the same index. An alternative (and I think there's a JIRA for this if not an outright patch) is implement a "special" filter that, say, puts the original tken in with a special character, say $ at the end, i.e. if indexing "running

Re: Which Tokenizer to use at searching

2014-03-09 Thread abhishek jain
Hi Erick, Thanks for replying, I want to index A,B (with or without space with comma) as separate words and also want to return results when A and B searched individually and also "A,B" . Please let me know your views. Let me know if i still havent explained correctly. I will try again. Thanks a

Optimizing RAM

2014-03-09 Thread abhishek jain
hi friends, I want to index some good amount of data, i want to keep both stemmed and unstemmed versions , I am confused should i keep two separate indexes or keep one index with two versions or column , i mean col1_stemmed and col2_unstemmed. I have multicore with multi shard configuration. My se

Re: Which Tokenizer to use at searching

2014-03-09 Thread Erick Erickson
You've contradicted yourself, so it's hard to say. Or I'm mis-reading your messages. bq: During indexing i want to token on all punctuations, so i can use StandardTokenizer, but at search time i want to consider punctuations as part of text, and in your second message: bq: when i search for "A,

Re: Solr 4.7.0 - cursorMark question

2014-03-09 Thread Greg Pendlebury
That was really clear; I just had another read through of the documentation with that explanation in mind and I can see I went off the rails. Sorry for any confusion on my part, and thanks for the details. Ta, Greg On 8 March 2014 08:36, Chris Hostetter wrote: > > : Thank-you, that all sounds

Re: Which Tokenizer to use at searching

2014-03-09 Thread abhishek jain
hi, Thanks for replying promptly, an example: I want to index for A,B but when i search A AND B, it should return result, when i search for "A,B" it should return result. Also Ideally when i search for "A , B" (with space) it should return result. please advice thanks abhishek On Sun, Ma

Re: SolrCloud setup guidance

2014-03-09 Thread Priti Solanki
Furkan, 100 request second would be ideal in out situation. Regards, Priti On Sat, Mar 8, 2014 at 3:41 AM, Furkan KAMACI wrote: > Hi; > > What's your performance expectation for qps (query per second)? > > Thanks; > Furkan KAMACI > 7 Mar 2014 08:50 tarihinde "Priti Solanki" yazdı: > > > Thanks

Re: Which Tokenizer to use at searching

2014-03-09 Thread Furkan KAMACI
Hi; Firstly you have to keep in mind that if you don't index punctuation they will not be visible for search. On the other hand you can have different analyzer for index and search. You have to give more detail about your situation. What will be your tokenizer at search time, WhiteSpaceTokenizer?

Which Tokenizer to use at searching

2014-03-09 Thread abhishek jain
Hi Friends, I am concerned on Tokenizer, my scenario is: During indexing i want to token on all punctuations, so i can use StandardTokenizer, but at search time i want to consider punctuations as part of text, I dont store contents but only indexes. What should i use. Any advices ? -- Thank