Re: spellcheck.count v/s spellcheck.alternativeTermCount

2015-02-18 Thread Nitin Solanki
Hi James, How to see the suggestions of spellcheck.alternativeTermCount ? On Wed, Feb 18, 2015 at 11:09 AM, Nitin Solanki wrote: > Thanks James, > I tried the same thing > spellcheck.count=10&spellcheck.alternativeTermCount=5. And I got 5 > suggestions of b

Why collations are coming even I set the value of spellcheck.count to zero(0)

2015-02-18 Thread Nitin Solanki
Hi Everyone, I have set the value of spellcheck.count = 0 and spellcheck.alternativeTermCount = 0. Even though collations are coming when I search any query which is misspelled. Why so? I also set the value of spellcheck.maxCollations = 100 and spellcheck.maxCollationTries = 100

Re: Solrcloud sizing

2015-02-18 Thread Toke Eskildsen
On Wed, 2015-02-18 at 01:40 +0100, Dominique Bejean wrote: (I reordered the requirements) > - Collection size : 15 billions document > - Document size is nearly 300 bytes > - 1 billion documents indexed = 5Gb index size > - Collection update : 8 millions new documents / days + 8 millions >d

Internal document format for Solr 4.10.2

2015-02-18 Thread dinesh naik
Hi, Is there a way to read the internal document once solr does the indexing ? Also is there a possibility to store this internal document in xml format ? -- Best Regards, Dinesh Naik

How to place whole indexed data on cache

2015-02-18 Thread Nitin Solanki
Hi, How can I place whole indexed data on cache by which if I will search any query then I will get response, suggestions, collations rapidly. And also how to view that which documents are on cache and how to verify it?

Re: Solrcloud sizing

2015-02-18 Thread Dominique Bejean
Hi Toke, Thank you for your response. Here is some precisions. > >> - The same terms will occurs several time for a given field (from 10 >> to 100.000) > >Do you mean that any term is only present in a limited number (up to >about 100K) of documents or do you mean that some documents has field

Re: Internal document format for Solr 4.10.2

2015-02-18 Thread Dmitry Kan
Hello! You can try luke's export feature: https://github.com/DmitryKey/luke/wiki/Exporting-index-to-xml On Wed, Feb 18, 2015 at 12:57 PM, dinesh naik wrote: > Hi, > Is there a way to read the internal document once solr does the indexing ? > > Also is there a possibility to store this internal

Re: Confirm Solr index corruption

2015-02-18 Thread Otis Gospodnetic
Hi, It sounds like Solr simply could not index some docs. The index is not corrupt, it's just that indexing was failing while disk was full. You'll need to re-send/re-add/re-index the missing docs (or simply all of them if you don't know which ones are missing). Otis -- Monitoring * Alerting *

Re: How to place whole indexed data on cache

2015-02-18 Thread Shawn Heisey
On 2/18/2015 4:20 AM, Nitin Solanki wrote: > How can I place whole indexed data on cache by which if I will > search any query then I will get response, suggestions, collations rapidly. > And also how to view that which documents are on cache and how to verify it? Simply install enough ex

Re: Solr suggest is related to second letter, not to initial letter

2015-02-18 Thread Michael Sokolov
On 02/17/2015 03:46 AM, Volkan Altan wrote: First of all thank you for your answer. You're welcome - thanks for sending a more complete example of your problem and expected behavior. I don’t want to use KeywordTokenizer. Because, as long as the compound words written by the user are availabl

Solrcloud with map-reduce indexing and document routing

2015-02-18 Thread Dominique Bejean
Hi, I never used map-reduce indexing. My understanding is that map-reduce tasks generate one or more Solr indices, then the golive tool is used in order to merge these indices at core level to one or more shards (the shard's leaders) in a Solrcloud collection. After merge occurs in leaders the re

Get nearby suggestions for any phrase searching

2015-02-18 Thread Nitin Solanki
Hello, I want to retrieve only top- five suggestions for any phrase/query searching. How to do that? Assume, If I search like ?q="the bark night" then I need suggestion/ collation like "the dark knight". How to get nearby suggestion/ terms of the phrase?

Divide 4 Nodes into 100 nodes in Solr Cloud

2015-02-18 Thread Nitin Solanki
Hi, I have created 4 nodes having 8 shards. Now, I want to divide those 4 Nodes into 100 Nodes without any failure/ or re-indexing the data. Any help please?

Re: Divide 4 Nodes into 100 nodes in Solr Cloud

2015-02-18 Thread Yago Riveiro
You can try the SPLIT command — /Yago Riveiro On Wed, Feb 18, 2015 at 3:19 PM, Nitin Solanki wrote: > Hi, > I have created 4 nodes having 8 shards. Now, I want to divide those > 4 Nodes into 100 Nodes without any failure/ or re-indexing the data. Any > help please?

Re: Divide 4 Nodes into 100 nodes in Solr Cloud

2015-02-18 Thread Nitin Solanki
Okay, It will destroy/harm my indexed data. Right? On Wed, Feb 18, 2015 at 9:01 PM, Yago Riveiro wrote: > You can try the SPLIT command > > > — > /Yago Riveiro > > On Wed, Feb 18, 2015 at 3:19 PM, Nitin Solanki > wrote: > > > Hi, > > I have created 4 nodes having 8 shards. Now, I want t

Best platform for hosting Solr

2015-02-18 Thread Ganesh.Yadav
Guys, 1. Can anyone suggest what would be the best platform to host Solr on any Unix or windows server? 2. All I will be doing is importing lots of PDF documents into Solr. I believe Solr will automatically build the schema for imported documents. 3. Can someone suggest what

How to achieve lemmatization for english words in Solr 4.10.2

2015-02-18 Thread dinesh naik
Hi, IS there a way to achieve lemmatization in Solr? Stemming option is not meeting the requirement. -- Best Regards, Dinesh Naik

Re: Divide 4 Nodes into 100 nodes in Solr Cloud

2015-02-18 Thread Yago Riveiro
No, SPLIT operation doesn’t destroy the data. When the SPLIT operation is finished, the PARENT is deactivate and you can remove it. More info: https://cwiki.apache.org/confluence/display/solr/Collections+API#CollectionsAPI-api3 — /Yago Riveiro On Wed, Feb 18, 2015 at 3:39 PM, Nitin S

Re: Divide 4 Nodes into 100 nodes in Solr Cloud

2015-02-18 Thread Shawn Heisey
On 2/18/2015 8:17 AM, Nitin Solanki wrote: > I have created 4 nodes having 8 shards. Now, I want to divide those > 4 Nodes into 100 Nodes without any failure/ or re-indexing the data. Any > help please? I think your only real option within a strict interpretation of your requirements is sh

Fwd: Solrcloud, no puts anymore and tons of “update?update.distrib=TOLEADER”

2015-02-18 Thread Abdelali AHBIB
Hello, We use solrcloud with two shards (no replication for now), zookeeper is in a separate machine and it works well until yesterday when I renamed some cores, cleaned other unused ones that we don't need anymore etc... then I got tons of these errors when I try to put docs into my core "sellers

SOLR, Coveo, and Oracle questions

2015-02-18 Thread Eric E
Hello, I have some basic questions for the group. I would appreciate any advice you can give me. We have an Oracle RAC database that has a number of schemas on it. Various "things" query the structured data stored in these schemas, 10s of thousands of times per day. Two of these schemas in par

Re: How to achieve lemmatization for english words in Solr 4.10.2

2015-02-18 Thread Ahmet Arslan
Hi Dinesh, solr.KStemFilterFactory is dictionary based. E.g. Produced outputs are valid/legitimate English words. If you mean "finding dictionary entries" by saying lemmatizer. Ahmet On Wednesday, February 18, 2015 5:51 PM, dinesh naik wrote: Hi, IS there a way to achieve lemmatization in

Re: Solrcloud, no puts anymore and tons of “update?update.distrib=TOLEADER”

2015-02-18 Thread Dominique Bejean
Hi, When you say "I renamed some cores, cleaned other unused ones that we don't need anymore etc", how did you do this ? With Cores or Collections API or by deleting core's directories in Solr Home ? Dominique http://www.eolya.fr 2015-02-18 17:04 GMT+01:00 Abdelali AHBIB : > Hello, > > We use

Re: How to achieve lemmatization for english words in Solr 4.10.2

2015-02-18 Thread Jack Krupansky
Please provide a few examples that illustrate your requirements. Specifically, requirements that are not met by the existing Solr stemming filters. What is your specific goal? -- Jack Krupansky On Wed, Feb 18, 2015 at 10:50 AM, dinesh naik wrote: > Hi, > IS there a way to achieve lemmatization

Re: Solrcloud, no puts anymore and tons of “update?update.distrib=TOLEADER”

2015-02-18 Thread Abdelali AHBIB
with Collections API, they still some config files in /solr/config/Xunused_collection, I deleted them also manualy 2015-02-18 16:16 GMT+00:00 Dominique Bejean : > Hi, > > When you say "I renamed some cores, cleaned other unused ones that we don't > need anymore etc", how did you do this ? > With

Re: Solrcloud, no puts anymore and tons of “update?update.distrib=TOLEADER”

2015-02-18 Thread Abdelali AHBIB
sorry, no rename operation happens, juste delete (manualy from solr home and config) and duplicate a core manualy also (this core duplicated is the same core that don't have a problem), then I run the zkcli upconfig command 2015-02-18 16:22 GMT+00:00 Abdelali AHBIB : > with Collections API, they

Re: Solrcloud, no puts anymore and tons of “update?update.distrib=TOLEADER”

2015-02-18 Thread Shawn Heisey
On 2/18/2015 9:22 AM, Abdelali AHBIB wrote: > with Collections API, they still some config files in > /solr/config/Xunused_collection, I deleted them also manualy > > 2015-02-18 16:16 GMT+00:00 Dominique Bejean : > >> When you say "I renamed some cores, cleaned other unused ones that we don't >> ne

RE: How to achieve lemmatization for english words in Solr 4.10.2

2015-02-18 Thread Dinesh Naik
Hi Jack, We are looking for something like this- For example if you search for a text -go We should also get other forms of this text like going,gone,goes etc. This is not being achieved via stemming. -Original Message- From: "Jack Krupansky" Sent: ‎18-‎02-‎2015 21:50 To: "solr-user@l

Re: Solr suggest is related to second letter, not to initial letter

2015-02-18 Thread Volkan Altan
Yes. I did it. Bu it doesn’t work. New Example; TSTLookup doc 1 : shoe adidas 2 hiking doc 2 : galaxy samsung s5 phone doc 3 : shakeology sample packets http://localhost:8983/solr/solr/suggest?q=samsung+hi 0 1 2 0 7 samsung s5 samsung s5 phone 1 8 10 hiking (samsung s5) hiking 0

Re: Solrcloud, no puts anymore and tons of “update?update.distrib=TOLEADER”

2015-02-18 Thread Abdelali AHBIB
Thank you Dominique and Shawn, now I see that clusterstate.json does not reflect current number of cores in shard2, there are duplicated cores in all collections like this, how can I edit clusterstate.json : [image: Images intégrées 1] 2015-02-18 16:54 GMT+00:00 Shawn Heisey : > On 2/18/2015

solr output in custom json format

2015-02-18 Thread meena.sri...@mathworks.com
I need to create custom json format of solr output for a specific UI. I was wondering if there is a way to embed "highlighting" portion inside docs itself. Thanks Meena -- View this message in context: http://lucene.472066.n3.nabble.com/solr-output-in-custom-json-format-tp4187200.html Sent fro

Re: solr output in custom json format

2015-02-18 Thread meena.sri...@mathworks.com
Sorry I was missing the actual part that is without parsing the json output. I was looking in to Solrj QueryReponse.getBeans(Syndrome.class) , but how do I embed highlighting snippet inside each of the Syndrome object itself. Thanks meena -- View this message in context: http://lucene.47206

Re: solr output in custom json format

2015-02-18 Thread Erik Hatcher
I think what ideally is needed here is an implementation for this open issue: > — Erik Hatcher, Senior Solutions Architect http://www.lucidworks.com > On Feb 18,

Re: How to place whole indexed data on cache

2015-02-18 Thread Dominique Bejean
Hi, As Shawn said, install enough memory in order that all free direct memory (non heap memory) be used as disk cache. Use 40% maximum of the available memory for heap memory (Xmx JVM parameter), but never more than 32 Gb And avoid your server to swap. For most Linux systems, this is configured u

Can Solr meet my requirements?

2015-02-18 Thread Cupidvogel
I went through the Solr documentation, and it seemed pretty good. However, I have a different requirement. I have this scenario - I will provides list of words, each corresponds to a particular position. Say this array of tuples, where each tuple consists of the word and its position (the position

RE: spellcheck.count v/s spellcheck.alternativeTermCount

2015-02-18 Thread Dyer, James
It will try to give you suggestions up to the number you specify, but if fewer are available it will not give you any more. James Dyer Ingram Content Group -Original Message- From: Nitin Solanki [mailto:nitinml...@gmail.com] Sent: Tuesday, February 17, 2015 11:40 PM To: solr-user@lucene

RE: Why collations are coming even I set the value of spellcheck.count to zero(0)

2015-02-18 Thread Dyer, James
I think when you set "count"/"alternativeTermCount" to zero, the defaults (10?) are used instead. Instead of setting these to zero, just use "spellcheck=false". These 2 parameters control suggestions, not collations. To turn off collations, set "spellcheck.collate=false". Also, I wouldn't set

Re: Simple Sort Is Not Working In Solr 4.7?

2015-02-18 Thread Simon Cheng
Great help and thanks to you, Alex. On Wed, Feb 18, 2015 at 2:48 PM, Alexandre Rafalovitch wrote: > Like I mentioned before. You could use string type if you just want > title it is. Or you can use a custom type to normalize the indexed > value, as long as you end up with a single token. > > So

Re: Boosting by calculated distance buckets

2015-02-18 Thread sraav
David, I just subscriped to the solr list..lets see if that will allow me to posting this. I will write a Custom ValueSource. I tried the map function that you suggested, it works but it is not so great on performance. I will try referring funtion query as a sort instead of bq..may be it wil

Collections API - HTTP verbs

2015-02-18 Thread Hrishikesh Gadre
Hi, Can we please document which HTTP method is supposed to be used with each of these APIs? https://cwiki.apache.org/confluence/display/solr/Collections+API I am trying to invoke following API curl http:// :8983/solr/admin/collections?action=CLUSTERPROP&name=urlScheme&val=https This request i

Re: Collections API - HTTP verbs

2015-02-18 Thread Mark Miller
Perhaps try quotes around the url you are providing to curl. It's not complaining about the http method - Solr has historically always taken simple GET's for http - for good or bad, you pretty much only post documents / updates. It's saying the name param is required and not being found and since

Re: ApacheCon 2015 at Austin, TX

2015-02-18 Thread CP Mishra
Dmitry, that would be great. CP On Thu, Feb 12, 2015 at 5:35 AM, Dmitry Kan wrote: > Hi, > > Looks like I'll be there. So if you want to discuss luke / lucene / solr, > will be happy to de-virtualize. > > Dmitry > > On Mon, Jan 12, 2015 at 6:32 PM, CP Mishra wrote: > > > Hi, > > > > I am plann