SolrCloud: AliasAPI-Maximum number of collections

2014-06-05 Thread ku3ia
Hi all! The question is how many collections I can put to one alias, using SolrCloud alias collection API https://cwiki.apache.org/confluence/display/solr/Collections+API#CollectionsAPI-api4 to process distributed requests? Is it limited? Thanks. -- View this message in context: http://lucene.

Fwd: Integrate solr with openNLP

2014-06-05 Thread Vivekanand Ittigi
Can anyone pleas reply..? Thanks, Vivek -- Forwarded message -- From: Vivekanand Ittigi Date: Wed, Jun 4, 2014 at 4:38 PM Subject: Re: Integrate solr with openNLP To: Tommaso Teofili Cc: "solr-user@lucene.apache.org" , Ahmet Arslan Hi Tommaso, Yes, you are right. 4.4 version

Re: change in EnumField configuration - what do you think?

2014-06-05 Thread Chris Hostetter
: I have an external system that indexes the data. : One field has a closed set of values, the values are sent as integer but : should be represented as String. EnumField is perfect for it. The : problem is that the values are predefined and not order sequentially : (the values were "spaced"

Re: Does lucene uses tries?

2014-06-05 Thread Aman Tandon
Thanks shawn for the explanation and for the example, that helps to understand it quite easily. With Regards Aman Tandon On Fri, Jun 6, 2014 at 3:48 AM, Shawn Heisey wrote: > On 6/5/2014 3:39 PM, Aman Tandon wrote: > > Thanks to all. > > And as Solr provides the field datatype as int as well a

Re: Regex with local params is not working

2014-06-05 Thread Chris Hostetter
: edismax was developed before the lucene query parser syntax was : changed to include regex, so maybe that's the issue. edismax has never been explicilty updated to support regex queries... https://issues.apache.org/jira/browse/SOLR-6009 -Hoss http://www.lucidworks.com/

Re: Does lucene uses tries?

2014-06-05 Thread Shawn Heisey
On 6/5/2014 3:39 PM, Aman Tandon wrote: > Thanks to all. > And as Solr provides the field datatype as int as well as solr.TrieIntField > so which one is more better? The example has "int" and "tint" as two different types that both use TrieIntField. The difference between the two is the precision

Re: Tika: url issue

2014-06-05 Thread Paul Rogers
Hi Can you not split it using oracle's string functions (as part of your select statement)? Something along the lines of: SELECT . RIGHT(LEFT(d.doc_name, (INSTR(d.doc_name, '#') - 1)), LENGTH(LEFT(d.doc_name, (INSTR(d.doc_name, '#') - 1))) - 1) as Name, ^- (strip asterisk from fro

Re: Does lucene uses tries?

2014-06-05 Thread Aman Tandon
Thanks to all. And as Solr provides the field datatype as int as well as solr.TrieIntField so which one is more better? With Regards Aman Tandon On Thu, Jun 5, 2014 at 11:23 PM, Michael McCandless < luc...@mikemccandless.com> wrote: > The default terms dictionary (BlockTree) also uses a trie in

RE: Tomcat restart removes the Core.

2014-06-05 Thread EXTERNAL Taminidi Ravi (ETI, Automotive-Service-Solutions)
Thanks for looking my email. Below is the content in the solr.xml under solr-home\solr directory -Original Message- From: Michael Della Bitta [mailto:michael.della.bi...@appinions.com] Sent: Thursday, June 05, 2014 4:30 PM To: solr-user@lucene.apache.org Subject: Re:

Re: Tomcat restart removes the Core.

2014-06-05 Thread Erick Erickson
"the new way" of doing things is core discovery, see: http://wiki.apache.org/solr/Core%20Discovery%20(4.4%20and%20beyond). You might just sidestep the problem. What does "disappeared" mean? Not showing up in the admin UI? Files on disk erased? The former may well be the persist bit, the latter wou

Re: Automatic syncing of data on a node that was down for a while:

2014-06-05 Thread Erick Erickson
All of your links give 404 errors... But wait. You say "all three nodes are masters". First a nit: "leader" not "master" unless you're using old-style replication... That said: bq: assuming that instance3 is a replica of instance1 or instance2 if "all three are leaders", then this is not a corre

Re: Tomcat restart removes the Core.

2014-06-05 Thread Michael Della Bitta
Did you put that attribute on the root element, or somewhere else? The beginning of solr.xml should look like this: Michael Della Bitta Applications Developer o: +1 646 532 3062 appinions inc. “The Science of Influence Marketing” 18 East 41st Street New York, NY 10017 t: @appinions

RE: Tomcat restart removes the Core.

2014-06-05 Thread EXTERNAL Taminidi Ravi (ETI, Automotive-Service-Solutions)
I update persistent=true in the solr.xml but still no change , after a restart the Cores are removed.. -Original Message- From: Michael Della Bitta [mailto:michael.della.bi...@appinions.com] Sent: Wednesday, June 04, 2014 2:54 PM To: solr-user@lucene.apache.org Subject: Re: Tomcat restar

Re: null pointer on FSTCompletionLookup

2014-06-05 Thread Will Milspec
Update: this was a configuration error. In my haste/carelessness, instead of defining separate "spellcheck" and "suggest" components, I defined only "suggest". (More specifically I copied over the ch10 examples from "solr in action", but did not copy the "spellcheck" component." ) When solr com

RE: Auto complete with 50TB of data - Need your inputs?

2014-06-05 Thread Toke Eskildsen
bbi123 [bbar...@gmail.com] wrote: > We have a requirement to for large data set like Billing data for example. > The Business wants to do sorting and type ahead functions for it. For > example, when I start typing “8164…” they want to list ALL the unique number > and the associated attributes disp

Re: Does lucene uses tries?

2014-06-05 Thread Michael McCandless
The default terms dictionary (BlockTree) also uses a trie index structure to locate the block on disk that may contain a target term. Mike McCandless http://blog.mikemccandless.com On Thu, Jun 5, 2014 at 12:11 PM, Shawn Heisey wrote: >> I just have want know that does the lucene used the tries

Re: Auto complete with 50TB of data - Need your inputs?

2014-06-05 Thread Shawn Heisey
On 6/5/2014 10:55 AM, bbi123 wrote: > We have a requirement to for large data set like Billing data for example. > The Business wants to do sorting and type ahead functions for it. For > example, when I start typing “8164…” they want to list ALL the unique number > and the associated attributes d

Auto complete with 50TB of data - Need your inputs?

2014-06-05 Thread bbi123
We have a requirement to for large data set like Billing data for example. The Business wants to do sorting and type ahead functions for it. For example, when I start typing “8164…” they want to list ALL the unique number and the associated attributes displayed (name, description, etc). We hav

Re: Multivalue wild card search

2014-06-05 Thread Ethan
Wildcard search do work on multiValued field. I was able to pull up records for following multiValued field - Code : [ "12344", "4534", "674" ] q=Code:45* fetched the correct document. It doesn't work in quotes(q="Code:45*"), however. Is there a workaround? On Thu, Jun 5, 2014 at 9:34 AM, Et

Re: Multivalue wild card search

2014-06-05 Thread Ethan
Are you implying there is not way to lookup on a multiValued field with a substring? If so, then how is it usually handled? On Wed, Jun 4, 2014 at 4:44 PM, Jack Krupansky wrote: > Wildcard, fuzzy, and regex query operate on a single term of a single > tokenized field value or a single string f

Re: Solr 4.8.1 on Tomcat 7 cant find solrconfig (Ubuntu)

2014-06-05 Thread Alexandre Rafalovitch
On Thu, Jun 5, 2014 at 11:05 PM, Shawn Heisey wrote: > I do not recommend it, but if you want to run in single core mode, you'll > need to put solrconfig.xml and schema.xml in collection1/conf, not > collection1. The messages that get logged are misleading. Oops. Great catch. And sounds like a JI

Re: Does lucene uses tries?

2014-06-05 Thread Shawn Heisey
> I just have want know that does the lucene used the tries data structure > to > store the > data. Lucene (and Solr) will use whatever you tell it when you create the field. If you indicate in your schema fieldType that you want to use a class of solr.TrieIntField, then the field will use a Luce

Re: Solr 4.8.1 on Tomcat 7 cant find solrconfig (Ubuntu)

2014-06-05 Thread Shawn Heisey
> I've added a solr.xml file: > > > value="/var/www//search/" override="true" /> > > > The file /var/www//search/collection1/solrconfig.xml does exist, > the owner is tomcat7 and the permissions are 664 and contains a > configuration that is working on my mac with the same version of solr.

Re: "Fake" cached join query much faster than cached fq?

2014-06-05 Thread Brett Hoerner
Thanks Mikhail, I'll try to profile it soon. As for cardinality, on a single core: created_at_tdid:[1392768001 TO 1393954400] = 241657215 text:coffee = 117593 Oddly enough, I just tried the query with &distrib=false and both return in about 50ms... hmm. On Thu, Jun 5, 2014 at 5:09 AM, Mikhai

Re: Does lucene uses tries?

2014-06-05 Thread Jack Krupansky
Lucene uses trie for numeric and date fields: http://lucene.apache.org/core/4_8_0/core/org/apache/lucene/search/NumericRangeQuery.html -- Jack Krupansky -Original Message- From: Aman Tandon Sent: Thursday, June 5, 2014 5:32 AM To: solr-user@lucene.apache.org Subject: Does lucene uses t

Re: Easises way to insatll solr cloud with tomcat

2014-06-05 Thread Jack Krupansky
As has been suggested on other threads here, the HDS distribution of Solr from Heliosearch comes preconfigured with Tomcat, so it really is the "easiest way to install Solr with Tomcat." -- Jack Krupansky -Original Message- From: Aman Tandon Sent: Thursday, June 5, 2014 8:32 AM To: s

Re: change in EnumField configuration - what do you think?

2014-06-05 Thread Jack Krupansky
Yeah, leaving holes as reservations for future fill-in seems like a reasonable use case. Why not just have explicit="n" as an XML attribute for . If no explicit attribute is given, then simply use the previous value plus one. No need for a separate then. -- Jack Krupansky -Original Mes

Re: Solr 4.8.1 on Tomcat 7 cant find solrconfig (Ubuntu)

2014-06-05 Thread Alexandre Rafalovitch
The following bit looks weird: Can't find resource 'solrconfig.xml' in classpath or '/var/www//search/ collection1/con*f*' Where does con*f* comes from? In any case, what I would do is use "a hammer". One of the truss/strace/dtrace/opensnoop utilities based on what O/S you are doing this on. And

core recovery stalled indefinitely in solr cloud

2014-06-05 Thread Vivek Pathak
Hi I have a set of solr cloud nodes which were working correctly for a while until one of them had problems (due to incorrect java version - but the cause of how it went bad is not very important) Once the problem was fixed and the node was started, I got the following situation: The node

Re: Solr 4.8.1 on Tomcat 7 cant find solrconfig (Ubuntu)

2014-06-05 Thread Joel Stobart
Hi, The file system tree is: ├── collection1 │ ├── conf │ │ ├── currency.xml │ │ ├── data-config.xml │ │ ├── dataimport.properties │ │ ├── elevate.xml │ │ ├── lang │ │ │ ├── contractions_ca.txt │ │ │ ├── contractions_fr.txt │ │ │ ├── contractions_

Re: OOV queries

2014-06-05 Thread Dmitry Kan
Mike, I did only measurements by hand, i.e. no scientific figures yet. Basically the same oov query takes about same amount of time when repeated. Which is surprising. Ok, to add to the mix: from a list of fqs that are on that query, one is marked with cache=false and a cost > 100. Dmitry On

Re: OOV queries

2014-06-05 Thread Michael Sokolov
It seems as if 0-hit queries should be pretty fast since they can terminate very early? Are you seeing a big difference between first-time and subsequent (cached) no-match queries? -Mike On 6/5/2014 8:47 AM, Dmitry Kan wrote: Hi, Solr is good at caching: even if first "cold" query takes lo

RE: change in EnumField configuration - what do you think?

2014-06-05 Thread Elran Dvir
Thank you for your quick responses. The numeric values are determined by external system which I don't have any control of. There is a gap between values to reserve holes that may be filled in later in the future. The values are sent as integers. In Solr, there is no need to retrieve the num

Re: Solr High GC issue

2014-06-05 Thread Dmitry Kan
@rulinma: I've analyzed all your messages and noticed you keep sending useless "mark" and "good". Are you trying to spam the mailing list? Could you please stop doing this? E-mail clients these days have bookmarking / staring in case you want to save the "good" post for later reading or "mark" it

OOV queries

2014-06-05 Thread Dmitry Kan
Hi, Solr is good at caching: even if first "cold" query takes longer time, the subsequent one is much quicker, given that it shares the fq's of the first query. This is more like an idea question: what about the out-of-vocabulary (OOV) queries? According to my quick measurements, they take same

Delta Import Functionality

2014-06-05 Thread ajay59
Hi, We are using the SOLR 4.6 version and trying to implement Delta import functionality .On implementing the delta import , the indexing is gettting completed but not fetching the updated or newly inserted rows from the table while executing the query. Please suggest. Below is the data config.x

Re: Easises way to insatll solr cloud with tomcat

2014-06-05 Thread Aman Tandon
Yeah sometime i feels that too, but as per my organization i have to go with the tomcat. So any help with tomcat will be really appreciated, i am still stucked here. With Regards Aman Tandon On Thu, May 15, 2014 at 2:14 AM, Greg Walters wrote: > While solr can run under tomcat, the (strongly)

Re: sort by spatial distance in faceting

2014-06-05 Thread Aman Tandon
No, i will try it after getting the some deep knowledge, because i am just a beginner it will take some time :) Thanks for the help david. With Regards Aman Tandon On Wed, Jun 4, 2014 at 6:49 PM, david.w.smi...@gmail.com < david.w.smi...@gmail.com> wrote: > Did my suggestion work out? > > RE co

Re: Solr 4.8.1 on Tomcat 7 cant find solrconfig (Ubuntu)

2014-06-05 Thread Dmitry Kan
bq. The file /var/www//search/collection1/solrconfig.xml does exist, the owner is tomcat7 and the permissions are 664 and contains a configuration that is working on my mac with the same version of solr. The exception is self-explanatory: solr is looking for conf directory inside collection1 dire

Re: Display no results

2014-06-05 Thread Erik Hatcher
Philippe - /browse is a Solr request handler defined in solrconfig.xml. It’s configuration is shipped using edismax and q.alt=*:*, meaning that if there is no client provided query string it defaults to all documents. You could either adjust the configuration such that an empty /browse reque

Re: "Fake" cached join query much faster than cached fq?

2014-06-05 Thread Mikhail Khludnev
Brett, It's really interesting observation. I can only speculate. It's worth to check cache hit stats and cache content via http://wiki.apache.org/solr/SolrCaching#showItems (the key question what are cached doc sets classes). Also if you tell the overall number of docs in the index, and cardinal

Does lucene uses tries?

2014-06-05 Thread Aman Tandon
Hi, I just have want know that does the lucene used the tries data structure to store the data. With Regards Aman Tandon

Re: Solr-Distributed search

2014-06-05 Thread Mahmoud Almokadem
Hi, you can search using this sample Url http://localhost:8080/solr/core1/select?q=*:*&shards=localhost:8080/solr/core1,localhost:8080/solr/core2,localhost:8080/solr/core3 Mahmoud Almokadem On Thu, Jun 5, 2014 at 8:13 AM, Anurag Verma wrote: > Hi, > Can you please help me solr distrib

Solr-Distributed search

2014-06-05 Thread Anurag Verma
Hi, Can you please help me solr distribued search in multicore? i would be very happy as i am stuck here. In java code how do i implement distributed search? -- Thanks & Regards Anurag Verma

Solr 4.8.1 on Tomcat 7 cant find solrconfig (Ubuntu)

2014-06-05 Thread Joel Stobart
I can't get Solr to start up inside a tomcat7 server. Other Wars are starting but this one isn't. That's it really. It works on my Mac. The bit is a slight redaction, the work is for a client. I've added a solr.xml file: The file /var/www//search/collection1/solrconfig.xml does exist, the

Display no results

2014-06-05 Thread DE ROCHAMBEAU, Philippe
Hi, the Velocity example that comes with SOLR displays all data contained in the database when you connect to the following page with a browser: http://localhost:8983/solr/collection1/browse . I have looked through collection1's Velocity templates, but unfortunately can't locate the code that cre

Fw: highlighting on hl.alternateField (copyField target) doesnt highlight

2014-06-05 Thread jay list
Anybody knowing this issue? > Gesendet: Dienstag, 03. Juni 2014 um 09:11 Uhr > Von: "jay list" > An: solr-user@lucene.apache.org > Betreff: highlighting on hl.alternateField (copyField target) doesnt highlight > > > Hello, >   > im trying to implement a user friendly search for phone numbers. Th

Re: defaultHighLight of solr3.6 does not work?

2014-06-05 Thread yo
I found someone had customised the solr1.4 default highlight before I joined the team. so It is not a bug. The standard solr1.4 default highlighter works like solr3.6. -- View this message in context: http://lucene.472066.n3.nabble.com/defaultHighLight-of-solr3-6-does-not-work-tp4138817p414002