Re: integrate SOLR with OSM

2016-06-27 Thread William Bell
OSM is literally Postgres data. If you want to move some of this data into SOLR, you can write SQL and import it... On Sun, Jun 26, 2016 at 10:32 AM, Youzha wrote: > does anybody can help me for this case pls? > On Jun 26, 2016 3:38 AM, "tkg_cangkul" wrote: > > > hi william, > > > > thx for you

edismax - Regex query.

2016-06-27 Thread Modassar Ather
Hi, I have a qf defined as follows: *fl1 fl2 fl3 fl4* These are all text fields. When I query *fl:/netw.{0,4}/ *I see the parsed query contains all the possible analyzed tokens of the query where as for query *fl1:/netw.{0,4}/ *it parses the query as expected to *fl1:/netw.{0,4}/*. Please let m

SimplePostTool: FATAL: IOException while posting data: java.io.IOException: too many bytes written

2016-06-27 Thread Rajendran, Prabaharan
Hi, I am trying to index a text file about 4.2 GB in size. This kind of POC to understand Solr capacity on indexing & searching. Here is my Solr configuration -Xms1024m-Xmx1024m-Xss256k java -Dtype=text/csv -Dparams="separator=%09" -Durl=http://localhost:8983/solr/mycollection/

Positions files analysis

2016-06-27 Thread asteiner
Hi I have a very large index and I'd like to see how can I reduce it. Some of the largest files in the index are the .pos files (positions). There are many excel files indexed with formulas, so I suspect that a large part of the index is used by junk terms as very long numbers. Is there a way I ca

No live SolrServers triggered by maxclausecount

2016-06-27 Thread Pablo Anzorena
Hi, I have an ensemble zookeeper consisting of 3 machines and 2 machines with solrcloud. With a high frequency I see in the logging: *No live SolrServers available to handle this request:[http://solr2:8983/solr/usa_bills_imp_2016_2016062300_shard1_replica1

json facet - date range & interval

2016-06-27 Thread Jay Potharaju
Hi, I am trying to use the json range facet with a tdate field. I tried the following but get an error. Any suggestions on how to fix the following error /examples for date range facets. json.facet={daterange : {type : range, field : datefield, start :"NOW-10DAYS", end : "NOW/DAY", gap : "+1DAY" }

Re: Limit Solr to search for only 500 records based on the search criteria

2016-06-27 Thread Thrinadh Kuppili
Thanks Eric for the info. I have figured out a solution to use time allowed parameter and it worked for my need. -- View this message in context: http://lucene.472066.n3.nabble.com/Limit-Solr-to-search-for-only-500-records-based-on-the-search-criteria-tp4282519p4284583.html Sent from the Solr

Solr PhraseQuery With Wildcard

2016-06-27 Thread Felipe Vinturini
Hi *all*! First time posting! I have been struggling with Solr v4.10.2 with a PhraseQuery with wildcard! My field definition is below: Let's suppose I have the following value added to the index of the field above (portuguese): Teste de texto; Será quebrado em espaços em

Re: edismax - Regex query.

2016-06-27 Thread Modassar Ather
Kindly provide your inputs. Thanks, Modassar On Mon, Jun 27, 2016 at 4:11 PM, Modassar Ather wrote: > Hi, > > I have a qf defined as follows: > > *fl1 fl2 fl3 fl4* > These are all text fields. > > When I query *fl:/netw.{0,4}/ *I see the parsed query contains all the > possible analyzed tokens

looking for documentation on solr.JapaneseTokenizerFactory

2016-06-27 Thread Micheal Cooper
I have a vendor-supplied Solr 4.10 set up for multisite search which indexes two large Drupal 7 sites which have content in Japanese, English, and Undefined. The English searches are OK, but the Japanese does not work well at all. The vendors are in the US, so it is understandable that they cann

Re: No live SolrServers triggered by maxclausecount

2016-06-27 Thread Erick Erickson
That error sometimes gets reported inappropriately, as long as the servers are live you can pretty much ignore it. Attachments pretty much all get stripped by the mail server so we can't see your screen shot. Setting your max clause count to over 100K is pretty much an anti-pattern, what in the w

Re: looking for documentation on solr.JapaneseTokenizerFactory

2016-06-27 Thread Erick Erickson
There's some more information in the reference guide, see: https://cwiki.apache.org/confluence/display/solr/Language+Analysis NOTE: I would _strongly_ urge you to go to the upper-left corner and follow the link for downloading older versions and pulling down the 4.10 guide. It's a bold attempt to

Re: Solr PhraseQuery With Wildcard

2016-06-27 Thread Erick Erickson
OK, you really have to get familiar with the admin/analysis page. Whitespace tokenizer is really simple, it breaks up on whitespace. So punctuation is kept in the index. Which is very rarely what you want. Use something like StandardTokenizer or maybe a filter that removes all non-alpha-num charact

Re: json facet - date range & interval

2016-06-27 Thread Erick Erickson
First thing I'd do is escape the plus. It's probably being interpreted as a space. Best, Erick On Mon, Jun 27, 2016 at 9:24 AM, Jay Potharaju wrote: > Hi, > I am trying to use the json range facet with a tdate field. I tried the > following but get an error. Any suggestions on how to fix the fol

Re: Positions files analysis

2016-06-27 Thread Erick Erickson
Positions are necessary if you need to do "phrase searches". If that's not necessary, simply turn that option off in your schema for the fields where it's unnecessary. See the reference guide for termVectors termPositions termOffsets I'm really not sure what you're asking by: "Is there a way I can

Re: SimplePostTool: FATAL: IOException while posting data: java.io.IOException: too many bytes written

2016-06-27 Thread Erick Erickson
You're most likely not getting _near_ 4.2G written to Solr, the transport protocol is probably cutting that off as indicated by the "early EOF" exception. It's really hard to justify trying to index 4.2G as a _single_ file. First of all you won't even be able to receive it in Solr after you've giv

Re: limit stored field size

2016-06-27 Thread Erick Erickson
I'm not getting through. You do _not_ need to have a field indexed to highlight. They can (optionally) just be stored and re-analyzed just before being returned. Especially with 50 character fields to be highlighted, the re-analysis won't be very expensive. So it's perfectly reasonable to index b

Help with recovering shard range after zookeeper disaster

2016-06-27 Thread pramodEbay
We recently experienced a case where zookeeper snapshot became corrupt and would not restart. zkCli.sh (of zookeeper) would fail with an error unable to connect to / We have a solr cloud with two shards (Keys are autosharded) (Solr version 4.10.1) Unfortunately, we did not have a good snapshot t