Re: Paoding analyzer with solr for chinese

2012-08-08 Thread Uwe Reh
Hi Rajani, I'm not really familiar with this paoding tokenizer, but it seems a bit old. We are using the CJKBigramFilter (like in the example of Solr 4.0 alpha), which should be equivalent or even better and it works. Uwe Am 09.08.2012 06:47, schrieb Rajani Maski: Hi Al

Re: Designing an index with multiple entity types, sharing field names across entity-types.

2012-08-08 Thread santamaria2
*civilized bump* -- View this message in context: http://lucene.472066.n3.nabble.com/Designing-an-index-with-multiple-entity-types-sharing-field-names-across-entity-types-tp3999727p451.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: question(s) re lucene spatial toolkit aka LSP aka spatial4j

2012-08-08 Thread David Smiley (@MITRE.org)
Hi! Sorry for the belated response; my google alerts didn't kick in for some weird reason until you posted to the lucene dev list. solr-user wrote > > hopefully someone is using the lucene spatial toolkit aka LSP aka > spatial4j, and can answer this question > > we are using this spatial tool

Re: Paoding analyzer with solr for chinese

2012-08-08 Thread Rajani Maski
Hi All, Any reply on this? On Wed, Aug 8, 2012 at 3:23 PM, Rajani Maski wrote: > Hi All, > > As said in this blog > site that paoding > analyzer is much better for chinese text, I was trying to implement it to > get accurate results f

Re: error message in solr logs

2012-08-08 Thread Chris Hostetter
: Lately we are noticing below exception in our solr logs. This happens : sometimes once or twice a day on a few cores. the error you are seing here is a really low level HTTP communications error, below hte level of solr... : Caused by: java.io.IOException: Invalid chunk header : at :

Using SolrCloud with non string type id field?

2012-08-08 Thread Mark Miller
Just curious if anyone wants to come forward as someone using SolrCloud with a non string based unique field? That is the default, so if you did not change it, you are using the string type. We are considering a change to how we handle hashing that would be back compat for the string type, but

Re: max connections in CloudSolrServer

2012-08-08 Thread Mark Miller
On Wed, Aug 8, 2012 at 1:55 PM, Jamie Johnson wrote: > I see that in other constructors you can specify an HttpClient to be > used, but I don't see this same option for the CloudSolrServer. You can pass a LBHttpSolrServer, which you can init with an HttpClient. Or you can use getLbServer() and

Limit on SOLR Cores

2012-08-08 Thread Nitin Arora
Hi Guys, I've come across a use case where I've to keep separate indexes for multiple tanents. Data directory of each tenant should be different but SOLR server instance has same schema and configuration for all the tenants. Tenants in our case can be added dynamically. I know that I can handle e

Re: Multiple Embedded Servers Pointing to single solrhome/index

2012-08-08 Thread Lance Norskog
No, you can only have one program controlling an index. This will not work! You should use a primary/failover technique where one program does all of the indexing, and then another program is the fallback for the first indexer. On Tue, Aug 7, 2012 at 7:31 AM, Bing Hua wrote: > Thanks Lance. The u

Re: /solr/admin/stats.jsp null pointer exception

2012-08-08 Thread Chris Hostetter
: New install of Solr 3.6.1, getting a Null Pointer Exception when trying to : access admin/stats.jsp: : org.mortbay.jetty.bio.SocketConnector$Connection.run(SocketConnector.java:228) : at : org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java:582) : Caused by: ja

Re: Configuration for distributed search

2012-08-08 Thread Chris Hostetter
: This command to each shard returns one document from each shard. : curl 'http://localhost:8983/solr/select?debugQuery=true&indent=true&q=conway : curl 'http://localhost:7574/solr/select?debugQuery=true&indent=true&q=conway : : This distributed search command returns 0 documents: What do those

Re: Syntax for parameter substitution in function queries?

2012-08-08 Thread Timothy Hill
Thanks very much; that does indeed work as I'd hoped/expected. On 7 August 2012 17:12, Yonik Seeley wrote: > On Tue, Aug 7, 2012 at 3:01 PM, Timothy Hill wrote: >> Hello, all ... >> >> According to >> http://wiki.apache.org/solr/FunctionQuery/#What_is_a_Function.3F, >> it is possible under Solr

Re: search on default field returns less documents

2012-08-08 Thread Jack Krupansky
Default search field handling changed in Solr 3.6. Which release of Solr are you using? In Solr 3.6, the "df" request parameter in your query request handler overrides the deprecated defaultSearchField. The out of the box default for "df" is "text", which should match your schema, but... bette

Re: Does Solr support 'Value Search'?

2012-08-08 Thread Bing Hua
Not quite understand but I'd explain the problem I had. The response would contain only fields and a list of field values that match the query. Essentially it's querying for field values rather than documents. The underlying use case would be, when typing in a quick search box, the drill down menu

Re: Setting "df" (default field) from solrj?

2012-08-08 Thread homernabble
Perfect. Thanks! Jack Krupansky-2 wrote > > You should simply set the default value for the "df" request parameter in > your Solr request handlers in solrconfig.xml. It is set to "text" out of > the > box, but you can set it to your desired field. > > If you still want to set/override "df" f

Re: No search result is returned using Solr 4

2012-08-08 Thread in.abdul
Hi Engy, Can you able to get the result for q=*.* ? And if so can you check where query is parsing properly by add a param debug=true . Syed Abdul kather send from Samsung S3 On Aug 8, 2012 4:30 PM, "engy.morsy [via Lucene]" < ml-node+s472066n3999741...@n3.nabble.com> wrote: > Hi, > > I downloa

Re: Does Solr support 'Value Search'?

2012-08-08 Thread Mikhail Khludnev
Ok. It seems to me you can configure http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimiterFilterFactoryfor index-time to produce "admin" term from all your docs above, after that you'll be able to match by simple term query. Is it what are you looking for? On Wed, Aug 8, 2

/solr/admin/stats.jsp null pointer exception

2012-08-08 Thread Jon Drukman
New install of Solr 3.6.1, getting a Null Pointer Exception when trying to access admin/stats.jsp: 2012-08-08T17:55:09 138509624 694 org.apache.solr.servlet.SolrDispatchFilter SEVERE org.apache.solr.common.SolrException log 25 org.apache.jasper.JasperException: java.lang.Nu

max connections in CloudSolrServer

2012-08-08 Thread Jamie Johnson
I see that in other constructors you can specify an HttpClient to be used, but I don't see this same option for the CloudSolrServer. Is there a way to say the maximum number of connections that should be used for CloudSolrServer? What is the current number that is supported?

search on default field returns less documents

2012-08-08 Thread Shalom
Hi All we have two fields: 'text' is our default field: text we copy the doc field to the 'text' field when indexing 10 documents that have a value with same prefix in the doc field, for example: ca067-XXX ,and searching on the default field I get only 5 results, I search for ca067 on the

Re: Recovery problem in solrcloud

2012-08-08 Thread Jam Luo
There are 400 million documents in a shard, a document is less then 1 kb. the data file _**.fdt is 149g. Does the recovering need large memory in downloading or after downloaded? I find some log before OOM as below: Aug 06, 2012 9:43:04 AM org.apache.solr.core.SolrCore execute INFO: [blog] webapp=

Re: Solr makes long requests about once a minute

2012-08-08 Thread Jack Krupansky
Check the Solr log file and see if something is happening at those slow queries. Maybe an auto-commit? -- Jack Krupansky -Original Message- From: Andy Lester Sent: Wednesday, August 08, 2012 11:30 AM To: solr-user@lucene.apache.org Subject: Solr makes long requests about once a minute

Re: Solr makes long requests about once a minute

2012-08-08 Thread Michael Della Bitta
StandardDirectoryFactory gets us partway there, but that's actually a class that chooses an appropriate implementation at runtime based on the parameters of the system it's being run on. If you go to the status page off of the admin page and do a find on "readerDir", I'm guessing you'll see "org.a

Re: Solr makes long requests about once a minute

2012-08-08 Thread Andy Lester
On Aug 8, 2012, at 10:53 AM, Michael Della Bitta wrote: > What version of Solr are you running and what Directory implementation > are you using? How much RAM does your system have, and how much is > available for use by Solr? Solr 3.6.0 I don't know what "directory implementation" means. Are

RE: numFound changes on changing start and rows

2012-08-08 Thread Rohit
I can cross check our shards once again, but I am sure this is not the case. Regards, Rohit Mobile: +91-9901768202 -Original Message- From: Chris Hostetter [mailto:hossman_luc...@fucit.org] Sent: 08 August 2012 21:04 To: solr-user@lucene.apache.org Subject: Re: numFound changes on chan

Re: Solr makes long requests about once a minute

2012-08-08 Thread Michael Della Bitta
Hi, Andy, What version of Solr are you running and what Directory implementation are you using? How much RAM does your system have, and how much is available for use by Solr? Michael Della Bitta Appinions | 18 East 41st St., Suite 1806 | New York,

Re: numFound changes on changing start and rows

2012-08-08 Thread Michael Della Bitta
Our documents are keyed with UUIDs, and we shard chronologically. The write events are issued as part of a SQS queue that only allows one reader to see the message. I think it's pretty unlikely that we have more than one document with the same uniquekey. I can actually prove this if it will help t

Re: HTTP Basic Authentication with HttpSolrServer [solved]

2012-08-08 Thread vilo
You're partly right. The solution in the link was for CommonsHttpSolrServer, it does work for HttpSolrServer, but the principle is the same. Actually, I found solution for the new HttpClient here: http://stackoverflow.com/questions/2014700/preemptive-basic-authentication-with-apache-httpclient-4/1

Re: numFound changes on changing start and rows

2012-08-08 Thread Chris Hostetter
: We are using Solr3.6 and 2 shards, we are noticing that when we fire a query : with start as 0 and rows X the total numFound and the total numFound changes : when we fire the same exact query with start as y and rows X. The only situation where i've ever heard of this happening is when multipl

Re: getting empty result but numFound > 0

2012-08-08 Thread Jack Krupansky
"status":500 means there was probably an exception on the server. Check the Solr log file for details. -- Jack Krupansky -Original Message- From: Rafael Pappert Sent: Wednesday, August 08, 2012 9:20 AM To: solr-user@lucene.apache.org Subject: getting empty result but numFound > 0 Hel

Solr makes long requests about once a minute

2012-08-08 Thread Andy Lester
I'm having a problem with Solr under Tomcat unexpectedly taking a long time to respond to queries. As part of some stress testing, I wrote a bot that just does random word searches on my Solr install, and my responses typically come back in 10-50 ms. The queries are just 1-3 random words from

Re: HTTP Basic Authentication with HttpSolrServer

2012-08-08 Thread Paul Libbrecht
Villam, this is a question for httpclient, I think you want to enable preemptive authentication so as to avoid the need to repeat the query after the "unauthorized" response is sent. http://hc.apache.org/httpclient-3.x/authentication.html#Preemptive_Authentication paul Le 8 août 2012

HTTP Basic Authentication with HttpSolrServer

2012-08-08 Thread vilo
I have protected my solr server with basic authentication. Now I want to connect to it using SOLRJ. CommonsHttpSolrServer is now deprecated, so I try to use HttpSolrServer, but I fail to send credentials. If I put them to the url, I get 401 (http://user:passw...@example.com/solr). I tried this:

Re: Problems with EDisMax field aliases for multiple fields on Solr 3.6.1

2012-08-08 Thread Jack Krupansky
There is an open Solr issue to allow commas in lists everywhere, but even in 4.0 space is still the delimiter for field name boost lists ("qf" and "f..qf"). I'll update the wiki. -- Jack Krupansky -Original Message- From: Nils Kaiser Sent: Wednesday, August 08, 2012 9:56 AM To: solr

Re: Setting "df" (default field) from solrj?

2012-08-08 Thread Jack Krupansky
You should simply set the default value for the "df" request parameter in your Solr request handlers in solrconfig.xml. It is set to "text" out of the box, but you can set it to your desired field. If you still want to set/override "df" from SolrJ anyway, use the SolrQuery.setParam method: s

Re: Does Solr support 'Value Search'?

2012-08-08 Thread Bing Hua
Thanks for the response but wait... Is it related to my question searching for field values? I was not asking how to use wildcards though. -- View this message in context: http://lucene.472066.n3.nabble.com/Does-Solr-support-Value-Search-tp3999654p3999817.html Sent from the Solr - User mailing

Setting "df" (default field) from solrj?

2012-08-08 Thread homernabble
I see in Solr 4 the defaultSearchField tag in schema.xml has been deprecated. I was looking in the Solrj API and I don't see a method for setting the default field on a SolrQuery object. This is basically what the code looks like now (stripped down): solrQuery = SolrQuery.new() solrQuery.setQue

Re: Problems with EDisMax field aliases for multiple fields on Solr 3.6.1

2012-08-08 Thread Nils Kaiser
Thanks for the quick replies. Jack was right, I switched to space as separator and it works. 2) &f.name.qf=last_name_text,first_name_text - returns 0 results, debug shows name="f.name.qf">last_name_text,first_name_text +DisjunctionMaxQuery((last_name_text,first_name_text:maier)) 8) &f.name.qf=

RE: getting empty result but numFound > 0

2012-08-08 Thread Markus Jelsma
The status is 500, check your logs for some errors. -Original message- > From:Rafael Pappert > Sent: Wed 08-Aug-2012 15:49 > To: solr-user@lucene.apache.org > Subject: getting empty result but numFound > 0 > > Hello List, > > i'm evaluate solr 4 / solrCloud and got the following pro

getting empty result but numFound > 0

2012-08-08 Thread Rafael Pappert
Hello List, i'm evaluate solr 4 / solrCloud and got the following problem. I've indexed ~1.5M Documents but the "docs" section in the response is always empty. The response for the *:* query looks like this: { "responseHeader":{ "status":500, "QTime":12, "params":{ "fl":"title

Re: Is this too much time for full Data Import?

2012-08-08 Thread Alexey Serba
9m*15 - that's a lot of queries (>400 QPS). I would try reduce the number of queries: 1. Rewrite your main (root) query to select all possible data * use SQL joins instead of DIH nested entities * select data from 1-N related tables (tags, authors, etc) in the main query using GROUP_CONCAT (that'

Re: Problems with EDisMax field aliases for multiple fields on Solr 3.6.1

2012-08-08 Thread Jack Krupansky
Jan, I did notice that you used a space rather than a comma in the alias field list. The wiki does indicate comma (which is what Nils used), but... who knows. I haven't checked the code yet. -- Jack Krupansky -Original Message- From: Jan Høydahl Sent: Wednesday, August 08, 2012 9:03

Re: Problems with EDisMax field aliases for multiple fields on Solr 3.6.1

2012-08-08 Thread Jan Høydahl
Hi, It's hard to see what's going on without knowing more about your schema and documents. Also, it would be more helpful if you could paste the "parsedquery" part of the DebugQuery, where you actually see how the query was interpreted. Your query syntax looks correct, and I just verified that

Re: Is this too much time for full Data Import?

2012-08-08 Thread Michael Della Bitta
Pranav, If possible, you may wish to consider moving a job this large outside of DataImportHandler to a standalone program, as the SQL processing is somewhat limited by the N+1 subselects problem. Michael Della Bitta Appinions | 18 East 41st St.,

Re: numFound changes on changing start and rows

2012-08-08 Thread Michael Della Bitta
Sorry, in my time range example, I forgot to mention that you can repeatedly execute the 8 hour query and receive no results, even after the 7 hour query retrieves them. Kind of an important detail to not forget. :) Michael Della Bitta Appinions |

Re: numFound changes on changing start and rows

2012-08-08 Thread Michael Della Bitta
We've noticed some pretty non-deterministic behavior with sharded setups as well. One thing we've noticed is that a query server can hang on to the set of document ids that correspond to a given query even if caching is off, which results in some weird behavior, such as a query like: timestamp:[N

numFound changes on changing start and rows

2012-08-08 Thread Rohit
Hi, We are using Solr3.6 and 2 shards, we are noticing that when we fire a query with start as 0 and rows X the total numFound and the total numFound changes when we fire the same exact query with start as y and rows X. For example. First time query=abc&start=0&rows=4000 numFound- 56

Problems with EDisMax field aliases for multiple fields on Solr 3.6.1

2012-08-08 Thread Nils Kaiser
Hey, I'm trying to use field aliases that reference multiple fields on Solr 3.6.1 (1362471) as stated in the EDisMax documentation (http://wiki.apache.org/solr/ExtendedDisMax#Field_aliasing_.2BAC8_renaming). If I use an alias for a single field, everything is fine, but once I use an alias wi

Re: Recovery problem in solrcloud

2012-08-08 Thread Yonik Seeley
Stack trace looks normal - it's just a multi-term query instantiating a bitset. The memory is being taken up somewhere else. How many documents are in your index? Can you get a heap dump or use some other memory profiler to see what's taking up the space? > if I stop query more then ten minutes,

Paoding analyzer with solr for chinese

2012-08-08 Thread Rajani Maski
Hi All, As said in this blog site that paoding analyzer is much better for chinese text, I was trying to implement it to get accurate results for chinese text. I followed the instruction specified in the below sites Site1

Re: Designing an index with multiple entity types, sharing field names across entity-types.

2012-08-08 Thread santamaria2
To clarify a wee bit more. I'm wondering the performance impact on single-entity queries if I use common field names. eg. 'name' field for all entity types. 'Author' & 'Book' together make up for 200,000+ 'name' values. Will this affect anything if I search over 'Category'? Will using fq=type:categ

Designing an index with multiple entity types, sharing field names across entity-types.

2012-08-08 Thread santamaria2
My question stems from a vague memory of reading somewhere that Solr's search performance depends on how the total number of 'terms' there are in all in a field that is searched upon. I'm setting up an index core for some autocomplete boxes on my site. There is a search box for each facet group in

Re: Connect to SOLR over socket file

2012-08-08 Thread Michael Kuhlmann
On 07.08.2012 21:43, Jason Axelson wrote: Hi, Is it possible to connect to SOLR over a socket file as is possible with mysql? I've looked around and I get the feeling that I may be mi-understanding part of SOLR's architecture. Any pointers are welcome. Thanks, Jason Hi Jason, not that I kno