Re: Hindi language support in solr

2010-01-22 Thread Ranveer kumar
Hi Robert, Thanks for reply. As you write, I used "textgen" but still not able to search hindi text. Might be missing some important configuration. following is my schema.xml configuration

Re: determine which value produced a hit in multivalued field type

2010-01-22 Thread Erik Hatcher
Wait, let's be clear here Lance... multivalued fields are indeed guaranteed to return in the order they were indexed. However, you can't really tell, using this example, that a query for "dog" matches which instance of that field. You maybe kinda can using highlighting though? E

RE: SOLR Performance Tuning: Fuzzy Searches, Distance, BK-Tree

2010-01-22 Thread Fuad Efendi
http://issues.apache.org/jira/browse/LUCENE-2230 Enjoy! > -Original Message- > From: Fuad Efendi [mailto:f...@efendi.ca] > Sent: January-19-10 11:32 PM > To: solr-user@lucene.apache.org > Subject: SOLR Performance Tuning: Fuzzy Searches, Distance, BK-Tree > > Hi, > > > I am wondering:

RE: Solr vs. Compass

2010-01-22 Thread Fuad Efendi
Of course, I understand what "transaction" means; have you guys been thinking some about what may happen if we transfer $123.45 from one banking account to another banking account, and MySQL forgets to index "decimal" during transaction, or DBA was weird and forgot to create an index? Absolutely

RE: Solr vs. Compass

2010-01-22 Thread Fuad Efendi
Yes, "transactional", I tried it: do we really need "transactional"? Even if "commit" takes 20 minutes? It's their "selling point" nothing more. HBase is not transactional, and it has specific use case; each tool has specific use case... in some cases Compass is the best! Also, note that Compass

Re: determine which value produced a hit in multivalued field type

2010-01-22 Thread Lance Norskog
No, the return order is not guaranteed. On Fri, Jan 22, 2010 at 2:52 PM, Harsch, Timothy J. (ARC-TI)[PEROT SYSTEMS] wrote: > Hi, > If I have a multiValued field type of text, and I put values > [cat,dog,green,blue] in it.  Is there a way to tell when I execute a query > against that field for d

determine which value produced a hit in multivalued field type

2010-01-22 Thread Harsch, Timothy J. (ARC-TI)[PEROT SYSTEMS]
Hi, If I have a multiValued field type of text, and I put values [cat,dog,green,blue] in it. Is there a way to tell when I execute a query against that field for dog, that it was in the 1st element position for that multiValued field? Thanks! Tim

Re: Solr under tomcat - UTF-8 issue

2010-01-22 Thread Frank Wesemann
Glock, Thomas schrieb: My flex client httpservice by default only sets the content-type request header to "application/x-www-form-urlencoded" what it needed to do for tomcat is set the content-type request header to content-type = "application/x-www-form-urlencoded; charset=UTF-8"; As s

CoreAdmin CREATE instancedir recommendation needed

2010-01-22 Thread Joe Kessel
I have been using Solr 1.4 on Tomcat and had been making use of the current working directory unknowingly. I programmatically create cores as needed. I create a core via the CoreAdmin by passing in the core name ( CORE_01 ), instancedir ( CORE_01 ), config ( solr/conf/solrconfig.xml ), and sc

Re: MoreLikeThis - How to pass in external text?

2010-01-22 Thread ldung
David, thank you. This was what I was looking for! David Stuart-6 wrote: > > The MoreLikeThisHandler allows external text to be streamed to it see > http://wiki.apache.org/solr/MoreLikeThisHandler#Using_ContentStreams. The > url feature is quite good if you have a lot of text and start hitting

Re: solr with tomcat in cluster mode

2010-01-22 Thread Matt Mitchell
Hey Otis, We're indexing on a separate machine because we want to keep our production nodes away from processes like indexing. The indexing server also has a ton of resources available, more so than the production nodes. We set it up as an indexing server at one point and have decided to stick wit

Re: solr with tomcat in cluster mode

2010-01-22 Thread Otis Gospodnetic
This should work fine. But why are you indexing to a separate index/core? Why not index in the very same index you are searching? Slaves won't see changes until their searchers re-open. Otis -- Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch - Original Message > From: Matt

Re: performance issue

2010-01-22 Thread Otis Gospodnetic
Can you run 32-bit Java there? Will use less memory! :) Otis -- Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch - Original Message > From: Matthieu Labour > To: solr-user@lucene.apache.org > Sent: Fri, January 22, 2010 11:07:45 AM > Subject: Re: performance issue > > Hi >

RE: Is there limit on size of query string?

2010-01-22 Thread Ankit Bhatnagar
Try using POST Ankit -Original Message- From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley Sent: Friday, January 22, 2010 12:22 PM To: solr-user@lucene.apache.org Subject: Re: Is there limit on size of query string? On Fri, Jan 22, 2010 at 11:48 AM, Fuad Efend

RE: commit fails on weblogic

2010-01-22 Thread Joe Kessel
Within the weblogic console I have unchecked the Enable Keepalives and have been able to get by this error on commit, but it now fails on optimize. Using TCPMon it was noticed that multiple request where on the same connection, including the commit. As I've read that Solr runs fine on Webl

Re: Is there limit on size of query string?

2010-01-22 Thread Jérôme Charron
Hi, I recently had the same problem. The solution was to increase the maxHttpHeaderSize in tomcat server.xml file and everything works fine now. Best regards Jérôme On Fri, Jan 22, 2010 at 19:33, Jennifer Luo wrote: > From my test, they query doesn't have limit. The maximum query length I > u

RE: Is there limit on size of query string?

2010-01-22 Thread Jennifer Luo
>From my test, they query doesn't have limit. The maximum query length I used is more than 1M. However if you use Http Get, there is a length limit on querystring. You can try to use HTTP POST, using ContentType application/x-www-form-urlencoded Jennifer Luo > -Original Message- > From:

RE: How to store a dense field value efficiently

2010-01-22 Thread Dallan Quass
Sorry - I meant indexed. I don't store the fields. --dallan > -Original Message- > From: Erick Erickson [mailto:erickerick...@gmail.com] > Sent: Friday, January 22, 2010 9:30 AM > To: solr-user@lucene.apache.org > Subject: Re: How to store a dense field value efficiently > > Oops, th

Re: solr blocking on commit

2010-01-22 Thread Erick Erickson
Wait, wait, wait What about near-real-time? See: http://issues.apache.org/jira/browse/SOLR-1606 (Integrate near real time) I don't think this is production-ready yet, or if you're willing to run patched code, but it sure seems like you want to

How to Index multiple tables using SOLR

2010-01-22 Thread JavaGuy84
Hi, Eventhough I am new to SOLR I was able to successfully index a single table in a very short span of time. Now we have a requirement where the search needs to happen on multiple tables (mutiple table indexes) at the same time. I couldnt figure out a way to index more than one table in SOLR an

understanding termVector output

2010-01-22 Thread Harsch, Timothy J. (ARC-TI)[PEROT SYSTEMS]
Hi, I'm trying to see if I can use termVectors for a use case I have. Essentially I want to know is: where in the indexed value does the query hit occur? I think either tv.positions or tv.offsets would provide that info but I don't really grok the result. Below I've pasted the URL and part of

CoreContainer / getCore and create ?

2010-01-22 Thread Matthieu Labour
Hi Would it make sense to modify/ add a method to CoreContainer that creates a core if the core doesn't exist ? something like public SolrCore getCore(String name) { synchronized(cores) { SolrCore core = cores.get(name); if (core != null) core.open(); // increment the re

Re: Is there limit on size of query string?

2010-01-22 Thread Yonik Seeley
On Fri, Jan 22, 2010 at 11:48 AM, Fuad Efendi wrote: > Is there limit on size of query string? > > Looks like I have exceptions when query string is higher than 400 characters > (average) This is most likely going to be a servlet container issue. Are you using the bundled Jetty or something diffe

Is there limit on size of query string?

2010-01-22 Thread Fuad Efendi
Is there limit on size of query string? Looks like I have exceptions when query string is higher than 400 characters (average) Thanks!

Re: ArrayIndexOutOfBoundsException when highlighting (Solr 1.4)

2010-01-22 Thread Tim Underwood
Issue created: https://issues.apache.org/jira/browse/SOLR-1731 On Fri, Jan 22, 2010 at 5:42 AM, Koji Sekiguchi wrote: > Tim Underwood wrote: > >> I'm seeing an java.lang.ArrayIndexOutOfBoundsException when trying to >> highlight for certain queries. The error seems to be an issue with the >>

Re: performance issue

2010-01-22 Thread Grant Ingersoll
You likely are running into problems by having 1000 cores and simply don't have enough memory to handle that much. You should look at what you are filtering, sorting, faceting on and see if there are ways you can tune these. Take a look at your cache configurations as well. What are your inde

Re: performance issue

2010-01-22 Thread Matthieu Labour
Hi Thank you for your reponse Which version of solr? I inherited the project so not exactly sure ... in CHANGES.txt it says Apache Solr Version 1.4-dev $Id: CHANGES.txt 793090 2009-07-10 19:40:33Z yonik $ What garbage collection parameters? ulimit -n 10 ; nohup java -server -XX:+UseConcMarkS

Re: solr blocking on commit

2010-01-22 Thread Jerome L Quinn
Otis Gospodnetic wrote on 01/22/2010 12:20:45 AM: > I'm missing the bigger context of this thread here, but from the > snippet below - sure, commits cause in-memory index to get written > to disk, that causes some IO, and that *could* affect search *if* > queries are running on the same box. Wh

Re: Improvising solr queries

2010-01-22 Thread Erick Erickson
Take a look at the Wiki, here's a bit to start... http://lucene.apache.org/solr/features.html The short form is that when an index is first opened, there are various caches that are initialized. The first few queries that run against a new searcher are

Re: How to store a dense field value efficiently

2010-01-22 Thread Erick Erickson
Oops, that's a Lucene bit (got confused which list I was on). You can still control storing the raw text in SOLR, so my question is still relevant, but the solution may be different. Do you store the fields? Erick On Fri, Jan 22, 2010 at 10:27 AM, Erick Erickson wrote: > I'm surprised by a 30%

Re: How to store a dense field value efficiently

2010-01-22 Thread Erick Erickson
I'm surprised by a 30% increase. The approach of adding a special token for "not present" is one of the standard ones So just to check, when you say "stored", are you really storing the missing value? As in Field.Store.YES? As opposed to Field.Index.###? Because theres no need to Store this va

request handler defaults

2010-01-22 Thread Lukas Kahwe Smith
Hi, I have noticed that atm there doesnt seem to be a way to inherit request handler definitions. This would be nice to be able to define some basic requesthandlers (maybe even with the option of defining them "abstract") which could be extended. For example I have one dismax request handler fo

Re: solr with tomcat in cluster mode

2010-01-22 Thread Matt Mitchell
We have a similar setup and I'd be curious to see how folks are doing this as well. Our setup: A few servers and an F5 load balancer. Each Solr instance points to a shared index. We use a separate server for indexing. When the index is complete, we do some juggling using the Core Admin SWAP functi

solr with tomcat in cluster mode

2010-01-22 Thread ZAROGKIKAS,GIORGOS
Hi I'm using solr 1.4 with tomcat in a single pc and I want to turn it in cluster mode with 2 nodes and load balancing But I can't find info how to do Is there any manual or a recorded procedure on the internet to do that Or is there anyone to help me ?

Re: ArrayIndexOutOfBoundsException when highlighting (Solr 1.4)

2010-01-22 Thread Koji Sekiguchi
Tim Underwood wrote: I'm seeing an java.lang.ArrayIndexOutOfBoundsException when trying to highlight for certain queries. The error seems to be an issue with the combination of the ShingleFilterFactory, PositionFilterFactory and the LengthFilterFactory. Here's my fieldType definition:

Re: What is this error means?

2010-01-22 Thread Oleg Burlaca
Hello, I've just installed Solr1.4 on a CentOS machine and wanted to index 300-500 documents from a PHP script. I run the script from a WindowsXP host. Sporadically I'm getting the same error as Ellery Leung: Communication Error' in C:\httpd_docs\www\lib\Apache\Solr\Service.php:385 Stack trace:

RE: Solr vs. Compass

2010-01-22 Thread Minutello, Nick
I would tend to agree. -Original Message- From: Otis Gospodnetic [mailto:otis_gospodne...@yahoo.com] Sent: 22 January 2010 05:18 To: solr-user@lucene.apache.org Subject: Re: Solr vs. Compass Hi Ken, Based on this, Solr sounds like the way to go. Otis -- Sematext -- http://sematext.c

RE: Solr vs. Compass

2010-01-22 Thread Minutello, Nick
Agree with everything you said. -Original Message- From: Uri Boness [mailto:ubon...@gmail.com] Sent: 22 January 2010 01:25 To: solr-user@lucene.apache.org Subject: Re: Solr vs. Compass > > There seems to be an implication that compass wont scale as well as solr - and I'm not sure that

Re: Find newly added documents

2010-01-22 Thread Erik Hatcher
You can do a search, sort by the special _docid_ "field" (underscores mandatory) descending and the top documents listed will be the latest added. Like this, un-url-encoded: q=*:*&sort=_docid_ desc Erik On Jan 22, 2010, at 3:39 AM, Sandeep Tagore wrote: Thanks a lot Erik. Is t

Re: Find newly added documents

2010-01-22 Thread Sandeep Tagore
Thanks a lot Erik. Is there any other alternate way? Thanks a lot for your response. Regards, Sandeep You'll be able to find them only after a commit. One way to do this is index a timestamp with every document, and find the latest ones using that field. There's an example of an automatic

Re: MoreLikeThis - How to pass in external text?

2010-01-22 Thread David Stuart
The MoreLikeThisHandler allows external text to be streamed to it see http://wiki.apache.org/solr/MoreLikeThisHandler#Using_ContentStreams. The url feature is quite good if you have a lot of text and start hitting the character limit in the url Regards, Dave On 22 Jan 2010, at 05:24, Otis Go