RE: Is Solr right for my business situation ?

2010-09-27 Thread Jonathan Rochkind
"Staging" the data in a non-Solr store sounds like a potentially reasonable idea to me. You might want to consider a NoSQL store of some kind like MongoDB perhaps, instead of an rdbms. The way to think about Solr is not as a store or a database -- it's an index for serving your application. Th

RE: Is Solr right for my business situation ?

2010-09-27 Thread Sharma, Raghvendra
Thanks for the responses people. @Grant 1. can you show me some direction on that.. loading data from an incoming stream.. do I need some third party tools, or need to build something myself... 4. I am basically attempting to build a very fast search interface for the existing data. The volu

Re: Solr UIMA integration

2010-09-27 Thread Tommaso Teofili
Hi Maheshkumar, I attached a patch for inclusion of this project as a Solr contrib module [1] , there you can find the patch to apply to the Solr trunk along with needed jars (attached as a zip archive). I think that your issue could be related to the fact that GC project dependency is from Solr 1.

Search Interface

2010-09-27 Thread Claudio Devecchi
Hi everybody, I`m implementing my first solr engine for conceptual tests, I`m crawling my wiki intranet to make some searches, the engine is working fine already, but I need some interface to make my searchs. Somebody knows where can I find some search interface just for customizations? Tks -- C

Re: FieldType for storing date

2010-09-27 Thread Chris Hostetter
: I was wondering what would be the best FieldType for storing date with a : millisecond precision that would allow me to sort and run range queries : against this field. We would like to achieve the best query performance, : minimal heap - fieldcache - requirements, good indexing throughput an

Re: Grouping in solr ?

2010-09-27 Thread Chris Hostetter
: References: : : In-Reply-To: : : Subject: Grouping in solr ? http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists When starting a new discussion on a mailing list, please do not reply to an existing message, instead start a fresh email. Even if you chang

Re: Renaming Solr mbean

2010-09-27 Thread Chris Hostetter
: In our setup, we run several instances of Solr under one instance of : Tomcat. I simply rename the WAR to soemthing we use internally - : solr-people, solr-connections, solr-companies, etc etc. This part works : fine and lets us have, use, and maintain invidual instances. ... : What

Re: DIH ConcurrentModificationException

2010-09-27 Thread Reuben A Christie
is this fixed in solr-1.4.1 ? I have seen ConcurrentModificationException during search operation using EmbeddedSolrServer, when tested using jmeter with on more than one Concurrent users. best, Reuben On 5/5/2009 2:25 AM, Shalin Shekhar Mangar wr

Re: Is Solr right for our project?

2010-09-27 Thread Jan Høydahl / Cominvent
Solr will match this in version 3.1 which is the next major release. Read this page: http://wiki.apache.org/solr/SolrCloud for feature descriptions Coming to a trunk near you - see https://issues.apache.org/jira/browse/SOLR-1873 -- Jan Høydahl, search solution architect Cominvent AS - www.cominven

Re: Need help with spellcheck city name

2010-09-27 Thread Savannah Beckett
No, I checked, there is a city called Swan in Iowa.  So, it is getting from the city index, so is Clerk.  But why does it favor Swan than San?  Spellcheck get weird after I treat city name as one token.  If I do it in the old way, it let San go, and correct Jos as Ojos instead of Jose because Oj

Re: Need help with spellcheck city name

2010-09-27 Thread Erick Erickson
Hmmm, did you rebuild your spelling index after the config changes? And it really looks like somehow you're getting results from a field other than city. Are you also sure that your cityname field is of type autocomplete1? Shooting in the dark here, but these results are so weird that I suspect i

Re: Need help with spellcheck city name

2010-09-27 Thread Savannah Beckett
No, it doesn't work, I got weird result. I set my city name field to be parsed as a token as following:                                 I got following result for spellcheck: -     -                       1  

Re: Need help with spellcheck city name

2010-09-27 Thread Tom Hill
Maybe process the city name as a single token? On Mon, Sep 27, 2010 at 3:25 PM, Savannah Beckett wrote: > Hi, >   I have city name as a text field, and I want to do spellcheck on it.  I use > setting in http://wiki.apache.org/solr/SpellCheckComponent > > If I setup city name as text field and do

Need help with spellcheck city name

2010-09-27 Thread Savannah Beckett
Hi,   I have city name as a text field, and I want to do spellcheck on it.  I use setting in http://wiki.apache.org/solr/SpellCheckComponent If I setup city name as text field and do spell check on "San Jos" for San Jose, I get suggestion for Jos as "ojos".  I checked the extendedresult and I f

DIH XML Entity Help (Newbie)

2010-09-27 Thread audev
I am trying to configure the data-config.xml using the XPathEntityProcessor to index nested xml entities such as the following: Drug fentanyl sublingual spray Other questionnaire administration The data-config.xml looks like this: but it only indexes the first

Re: Question Related to sorting on Date

2010-09-27 Thread Peter Sturge
Hi Ahson, You'll really want to store an additional date field (make it a TrieDateField type) that has only the date, and in the reverse order from how you've shown it. You can still keep the one you've got, just use it only for 'human viewing' rather than sorting. Something like: 20080205 if you

Re: Is Solr right for my business situation ?

2010-09-27 Thread PeterKerk
Ah, totally looked over that news: spatial search in 3.x! :-D :-D Any idea already when this will be released? Awesome to hear that it has been moved forward! :) -- View this message in context: http://lucene.472066.n3.nabble.com/Is-Solr-right-for-our-project-tp1589927p1592448.html Sent from

Re: Is Solr right for my business situation ?

2010-09-27 Thread Dennis Gearon
Wow, that is a relief! I was going to have to look at ElasticSearch instead. Dennis Gearon Signature Warning EARTH has a Right To Life, otherwise we all die. Read 'Hot, Flat, and Crowded' Laugh at http://www.yert.com/film.php --- On Mon, 9/27/10, Grant Ingersoll wrote: >

resources for relevancy score tuning

2010-09-27 Thread Luke Crouch
Can someone share some good resources (books, articles, links, etc.) for tuning relevancy scores with multiple factors? I'm playing with different fields and boosts in my 'qf', 'pf', and 'bf' defaults but I feel like I'm shooting in the dark. http://wiki.apache.org/solr/SolrRelevancyCookbook has a

Re: Is Solr right for my business situation ?

2010-09-27 Thread PeterKerk
@Walter Underwood: Walter Underwood wrote: > > Right now, I think the only way to get spatial search in Solr is to deploy > a nightly snapshot from the active development on trunk. > Could you give me the link to this trunk, I need it very much! Thanks! -- View this message in context: http

Re: Is Solr right for my business situation ?

2010-09-27 Thread Jonathan Rochkind
Right, I know, I was curious about it's current closeness to being in main distro, not a patch. Among other things, when those who know better decide it goes in core distro, that makes me more comfortable that they've decided it works acceptably, and also makes more more comfortable that it wi

Re: Is Solr right for my business situation ?

2010-09-27 Thread Ravi Julapalli
Hi Jonathan, Field collpasing is available in 1.4 by applying patch https://issues.apache.org/jira/browse/SOLR-236 -Ravi From: Jonathan Rochkind To: "solr-user@lucene.apache.org" Sent: Mon, September 27, 2010 9:18:20 PM Subject: Re: Is Solr right for my bus

Re: Is Solr right for my business situation ?

2010-09-27 Thread Jonathan Rochkind
Grant Ingersoll wrote: There is now group-by capabilities in trunk as well, which may or may not help. Really, the field collapsing stuff has been committed to trunk finally? Or are you talking about something else? If it's the field collapsing stuff, and it's been committed to trunk, doe

Re: Is Solr right for my business situation ?

2010-09-27 Thread Grant Ingersoll
Inline. On Sep 27, 2010, at 1:26 PM, Walter Underwood wrote: > When do you need to deploy? > > As I understand it, the spatial search in Solr is being rewritten and is > slated for Solr 4.0, the release after next. It will be in 3.x, the next release > > The existing spatial search has some

Question Related to sorting on Date

2010-09-27 Thread Ahson Iqbal
hi all I have a question related to sorting of date field i have Date field that is indexed like a string and look like "5/2/2008 4:33:30 PM" i want to do sorting on this field on the basis of date, time does not matters. any suggestion how i could ignore the time part from this field and

Re: The search response time is too loong

2010-09-27 Thread Simon Willnauer
2010/9/27 newsam : > I have setup a SOLR searcher instance with Tomcat 5.5.21. However, the > response time is too long. Here is my scenario: > 1. The index file is 8.2G. The doc num is 6110745. > 2. DELL Server: Intel(R) Xeon(TM) CPU (4 cores) 3.00GHZ, 6G Mem. > > I used "Key:*" to query all reco

RE: bi-grams for common terms - any analyzers do that?

2010-09-27 Thread Burton-West, Tom
Hi Yonik, >>If the new "autoGeneratePhraseQueries" is off, position doesn't matter, and >>the query will >>be treated as "index" OR "reader". Just wanted to make sure, in Solr does autoGeneratePhraseQueries = "off" treat the query with the *default* query operator as set in SolrConfig rather t

RE: bi-grams for common terms - any analyzers do that?

2010-09-27 Thread Burton-West, Tom
Hi Jonathan, >> I'm afraid I'm having trouble understanding "if the analyzer returns more >> than one position back from a "queryparser token" >>I'm not sure if "the queryparser forms a phrase query without explicit phrase >>quotes" is a problem for me, I had no idea it happened until now, ne

Re: Is Solr right for my business situation ?

2010-09-27 Thread Walter Underwood
When do you need to deploy? As I understand it, the spatial search in Solr is being rewritten and is slated for Solr 4.0, the release after next. The existing spatial search has some serious problems and is deprecated. Right now, I think the only way to get spatial search in Solr is to deploy a

Is Solr right for my business situation ?

2010-09-27 Thread Sharma, Raghvendra
I am sure these kind of questions keep coming to you guys, but I want to raise the same question in a different context...my own business situation. I am very very new to solr and though I have tried to read through the documentation, I have nowhere near completing the whole read. The need is li

Is Solr right for our project?

2010-09-27 Thread Mike Thomsen
(I apologize in advance if I missed something in your documentation, but I've read through the Wiki on the subject of distributed searches and didn't find anything conclusive) We are currently evaluating Solr and Autonomy. Solr is attractive due to its open source background, following and price.

Re: urgent SOLR query server request hangs

2010-09-27 Thread Yonik Seeley
On Mon, Sep 27, 2010 at 11:09 AM, Bharat Jain wrote: >   We are running into issues with SOLR queries. Our solr queries just hang. Are you perhaps using distributed search and accidentally set up an infinite loop? Do *not* configure a default "shards" param on your /select handler. Other than th

urgent SOLR query server request hangs

2010-09-27 Thread Bharat Jain
Hi, We are running into issues with SOLR queries. Our solr queries just hang. We are using SOLR 1.3 and below is the stack trace from threaddump. We are clueless about what can be causing this issue. We are in the midst of firefighting with our customer and any help is appreciated. Thanks,Bharat

Re: Re:The search response time is too loong

2010-09-27 Thread Timothy Potter
Also, how many rows are you requesting at one time? I've seen cases where the query time is blazing fast and the response writing is terribly slow because of too many documents being sent in the response. On Mon, Sep 27, 2010 at 6:37 AM, kenf_nc wrote: > > "mem usage is over 400M", do you mean T

Re: Re:The search response time is too loong

2010-09-27 Thread kenf_nc
"mem usage is over 400M", do you mean Tomcat mem size? If you don't give your cache sizes enough room to grow you will choke the performance. You should adjust your Tomcat settings to let the cache grow to at least 1GB or better would be 2GB. You may also want to look into http://wiki.apache.org/

Re: Concurrent DB updates and delta import misses few records

2010-09-27 Thread Shawn Heisey
You could get it from Solr, yes. That didn't even occur to me because when I was designing my scripts, I didn't yet have a fully integrated Solr index. :) With hindsight, I still wouldn't get it from Solr. I would lose some flexibility and ease of administration. It's certainly possible to

Multi-lingual auto-complete?

2010-09-27 Thread Andy
I want to provide auto-complete to users when they're inputting tags. The auto-complete tag suggestions would be based on tags that are already in the system. Multiple tags are separated by commas. A single tag could contain multiple words such as "Apple computer". One issue is that a tag coul

Re: Solr UIMA integration

2010-09-27 Thread maheshkumar
Hi Tommaso, All UIMA dependencies (uima-core,AlchemyAPIAnnotator, OpenCalaisAnnotator, Tagger, WhitespaceTokenizer) are 2.3.1-SNAPSHOT. All are checkout from svn AlchemyAPIAnnotator: http://svn.apache.org/repos/asf/uima/sandbox/trunk/AlchemyAPIAnnotator OpenCalaisAnnotator: http://svn.apache.org

Re: TokenFilter that removes payload ?

2010-09-27 Thread Teruhiko Kurosaka
Robert & Erik, I appreciate your suggestions but we use Type for other purpose. Also, the product is out and we can't change the design so easily. So it seems the conclusion there is no such TokenFilter. I'll write one. Thanks. On Sep 27, 2010, at 1:00 PM, Robert Muir wrote: > On Sun, Sep 26, 2

RE: spellcheck on multiple fields?

2010-09-27 Thread Markus Jelsma
You can use copyField to get multiple fields in the field you use for spell checking, don't forget to set it to multiValued.   -Original message- From: Savannah Beckett Sent: Mon 27-09-2010 10:08 To: solr-user@lucene.apache.org; Subject: spellcheck on multiple fields? Is it possible to

Re:The search response time is too loong

2010-09-27 Thread newsam
We used SOLR 1.4. All queries were excuted in SOLR back-end. I guess that I/O operations consume the time too much. >From: "newsam" >Reply-To: solr-user@lucene.apache.org"newsam" >To: solr-user@lucene.apache.org >Subject: Re:The search response time is too loong >Date: Mon, 27 Sep 2010 16:05:49

The search response time is too loong

2010-09-27 Thread newsam
I have setup a SOLR searcher instance with Tomcat 5.5.21. However, the response time is too long. Here is my scenario: 1. The index file is 8.2G. The doc num is 6110745. 2. DELL Server: Intel(R) Xeon(TM) CPU (4 cores) 3.00GHZ, 6G Mem. I used "Key:*" to query all records by localhost:8080. The res

spellcheck on multiple fields?

2010-09-27 Thread Savannah Beckett
Is it possible to do spellcheck on multiple fields in my solr index?  If so, how?  The following setup works for only one field:       default   solr.IndexBasedSpellChecker   myfield   ./spellchecker1   0.5   true     Thanks.