Re: Solr fails to start with log file not found error

2015-01-14 Thread Graeme Pietersz
I use the same user every time and the /logs directory and everything in it is owned by that user. I get the same problem occasionally developing on my Ubuntu 14.10 laptop as well, and all the files in the solr directory are owned by me on that machine (and I run Solr as me there as well). Conf

Load existing Lucene sharded indexes onto single Solr collection

2015-01-14 Thread Jaikit Savla
Folks, I have generated multiple (count of 100) sharded Lucene indexes on Hadoop and they are of format. The total indexed data (sum of all the index-*) is of size 500GB and hence the number of shards.drwxr-x--- 2 index-66drwxr-x--- 2 index-68drwxr-x--- 2 index-9 and each index directory

Re: Load existing Lucene sharded indexes onto single Solr collection

2015-01-14 Thread Mikhail Khludnev
On Wed, Jan 14, 2015 at 11:42 AM, Jaikit Savla < jaikit.sa...@yahoo.com.invalid> wrote: > Now to load this index, I am currently using Lucene IndexMergeTool to > merge all the shards into one giant index. My question is, is there a way > to load shared index without merging into one giant index on

Re: Load existing Lucene sharded indexes onto single Solr collection

2015-01-14 Thread Jaikit Savla
This solution will merge the index as well. I want to find out if merge is "required" before loading indexes onto Solr ?  If that is possible than I can just point solrconfig.xml to directory where I have all the shards. Jaikit On Wednesday, January 14, 2015 1:11 AM, Mikhail Khludnev wro

How to do fuzzy search on phrases

2015-01-14 Thread madhav bahuguna
HI Iam using solr 4.7 and solr php client. So heres the issue ,i have data indexed in solr eg mist spa I have applied fuzzy to my search and If i search myst or mysty i get the correct result i get mist spa in result. But if i write mysty spa or must spa i do not get ant results.Iam not able t

Distributed search across Solr cores in a collection - NPE

2015-01-14 Thread Jaikit Savla
Folks, I have set up 3 cores in a single collection and they all have same schema but different index. I have set unique Id required field to false. When I run query against single core, it works fine. But when I add the shard param and point to different core than request fails with NPE. I looke

Re: Solr grouping problem - need help

2015-01-14 Thread Naresh Yadav
Thanks much, now i got better idea on stored & indexed works internally in solr. >From this i tried and modified few things to understand grouping logic. *Schema :* *Code :* SolrQuery q = new SolrQuery().setQuery("type:t1"); q.set(GroupParams.GROUP, true); q.set(GroupParams.GROUP_FIELD, "tenant

Re: Distributed search across Solr cores in a collection - NPE

2015-01-14 Thread Mikhail Khludnev
Jaikit, uniq key is mandatory for distributed search. if most of your docs have ids assigned, you can drop remaining ones by adding something like ..&fq=id:[* TO *] On Wed, Jan 14, 2015 at 12:53 PM, Jaikit Savla < jaikit.sa...@yahoo.com.invalid> wrote: > Folks, > I have set up 3 cores in a single

Re: Solr grouping problem - need help

2015-01-14 Thread Norgorn
Can u get raw SOLR response? For me grouping works exactly the way u expect it to work. Try direct query in browser to be sure the problem is not in your code. http://192.168.0.1:8983/solr/collection1/select?q=*:*&group=true&group.field=tenant_pool -- View this message in context: http://luce

Re: Solr grouping problem - need help

2015-01-14 Thread Naresh Yadav
I tried what you said also appended group.ngroups=true and got same result not expected onengroups coming is 1. i am on solr-4.6.1 single machine default setup. On Wed, Jan 14, 2015 at 4:43 PM, Norgorn wrote: > Can u get raw SOLR response? > > For me grouping works exactly the way u expect i

Re: Tokenizer or Filter ?

2015-01-14 Thread tomas.kalas
Jack, thanks for help, but if i used PatternReplaceCharFilterFactory for example for this : text d1text d2text d1text 2 ok then at output i only get segment text 2 ok when is text d2 between marks . ...so the filter probably takes only first d1 and last d1 and if is something between it s

Re: Occasionally getting error in solr suggester component.

2015-01-14 Thread Dhanesh Radhakrishnan
Hi, Thanks for the reply. As you mentioned in the previous mail I changed buildOnCommit=false in solrConfig. After that change, suggestions are not working. In Solr 4.7 introduced a new approach based on a dedicated SuggestComponent I'm using that component to build suggestions and lookup implement

Re: Tokenizer or Filter ?

2015-01-14 Thread Jack Krupansky
It should replace all occurrences of the pattern. Post your specific filter XML. Patterns can be very tricky. Use the Solr Admin UI analysis page to see how the filtering is occurring. -- Jack Krupansky On Wed, Jan 14, 2015 at 7:16 AM, tomas.kalas wrote: > Jack, thanks for help, but if i used

Re: How to do fuzzy search on phrases

2015-01-14 Thread Tomoko Uchida
Hi, I suspect you are likely to misunderstand fuzzy search. You should append "~N" to end of each query term, not whole query string / phrase. (You can debug your query and get useful information by specifying "debugQuery=true" parameter, try it if you have not.) At first glance, I guess Complex

Re: Tokenizer or Filter ?

2015-01-14 Thread tomas.kalas
I just used Solr UI Analyzer for my test, or must i indexed it firstly? I used this XML code in my schema: This is my result: -- View this message in context: http://lucene.472066.n3.nabble.com/Tokenize

Re: Tokenizer or Filter ?

2015-01-14 Thread Jack Krupansky
I was suspecting it might do that - the pattern is "greedy" and takes the longest matching pattern. Add a question mark after the asterisk to use stingy mode that matches the shortest pattern. -- Jack Krupansky On Wed, Jan 14, 2015 at 8:37 AM, tomas.kalas wrote: > I just used Solr UI Analyzer f

Re: Solr grouping problem - need help

2015-01-14 Thread Naresh Yadav
just wanted to share schema and results for same : solr version : 4.6.1 Schema : http://www.imagesup.net/?di=10142124357616 Code :http://www.imagesup.net/?di=10142124381116 Response Group : http://www.imagesup.net/?di=1114212438351 Response Terms : http://www.imagesup.net/?di=614212438580 P

Re: Tokenizer or Filter ?

2015-01-14 Thread tomas.kalas
Oh yeah, that is it. Thank you very much for your patience. And a last question at the end what type regEx Solr actually using ? POSIX or PCRE ? Thanks. -- View this message in context: http://lucene.472066.n3.nabble.com/Tokenizer-or-Filter-tp4178346p4179505.html Sent from the Solr - User maili

RE: How to do fuzzy search on phrases

2015-01-14 Thread Adrien RUFFIE
Tomokoさん、おはようございます。 Can I use ComplexPhraseQueryParser with Core Solr 3.3.0 ? どうもありがとうございます。 お邪魔しました。 宜しくお願いします。 Bien cordialement, ルフフィエ アドリエン Adrien Ruffié LD : +33 1 73 03 26 40 Tél : +33 1 73 03 29 80 E-DEAL Innover la Relation Client -Message d'origine- De : Tomoko Uchida [mailto

Re: How to do fuzzy search on phrases

2015-01-14 Thread Tomoko Uchida
Hi Adrien, No, you cannot use ComplexPhraseQueryParser in Solr 3.3.0 since this was introduced at Solr 4.8 (it's a pretty new feature...) https://issues.apache.org/jira/browse/SOLR-1604 > お邪魔しました。 You do not need this phrase here, we rarely use this in mails. :) Thanks, Tomoko 2015-01-14 23:19

Re: Tokenizer or Filter ?

2015-01-14 Thread Jack Krupansky
It's what Java has, whatever that is: http://docs.oracle.com/javase/7/docs/api/java/util/regex/Pattern.html So, maybe the correct answer is neither, but similar to both. -- Jack Krupansky On Wed, Jan 14, 2015 at 9:06 AM, tomas.kalas wrote: > Oh yeah, that is it. Thank you very much for your pa

Re: How to do fuzzy search on phrases

2015-01-14 Thread Tomoko Uchida
> Iam using solr 4.7 and solr php client. Back to original question, I've missed it. ComplexPhraseQueryParser is not available in Solr 4.7, so sorry for misleading information. Regards, Tomoko 2015-01-14 23:44 GMT+09:00 Tomoko Uchida : > Hi Adrien, > > No, you cannot use ComplexPhraseQueryParse

Re: Load existing Lucene sharded indexes onto single Solr collection

2015-01-14 Thread Erick Erickson
You certainly can't do this into a single directory, there would be zillions of name conflicts. I believe I saw Uwe make a comment on the Lucene list about using MultiReaders and keeping the sub-indexes in different directories, but that's lower-level than Solr has access to Plus, you'd have to co

Core deletion

2015-01-14 Thread phiroc
Hello, I am running SOLR 4.10.0 on Tomcat 8. The solr.xml file in .../apache-tomcat-8.0.15_solr_8983/conf/Catalina/localhost looks like this: My SOLR instance contains four cores, including one whose instanceDir and dataDir have the following values: instanceDir:/archives/solr/exampl

Re: How to configure Solr PostingsFormat block size

2015-01-14 Thread Michael Sokolov
As a foolish dev (not malicious I hope!), I did mess around with something like this once; I was writing my own Codec. I found I had to create a file called META-INF/services/org.apache.lucene.codecs.Codec in my solr plugin jar that contained the fully-qualified class name of my codec: I guess

Distributed mode for stats component?

2015-01-14 Thread Jack Krupansky
Does anybody know for sure whether the stats component fully supports distributed mode? It is listed in the doc as supporting distributed mode (at least for old, non-SolrCloud distrib mode), but... I don't see any code that actually does that. Nor any tests, unless they are hidden somewhere I didn'

OutOfMemoryError for PDF document upload into Solr

2015-01-14 Thread Ganesh.Yadav
Hello, Can someone pass on the hints to get around following error? Is there any Heap Size parameter I can set in Tomcat or in Solr webApp that gets deployed in Solr? I am running Solr webapp inside Tomcat on my local machine which has RAM of 12 GB. I have PDF document which is 4 GB max in size

Re: OutOfMemoryError for PDF document upload into Solr

2015-01-14 Thread Michael Della Bitta
Yep, you'll have to increase the heap size for your Tomcat container. http://stackoverflow.com/questions/6897476/tomcat-7-how-to-set-initial-heap-size-correctly Michael Della Bitta Senior Software Engineer o: +1 646 532 3062 appinions inc. “The Science of Influence Marketing” 18 East 41st St

Re: Distributed mode for stats component?

2015-01-14 Thread Chris Hostetter
: Does anybody know for sure whether the stats component fully supports : distributed mode? It is listed in the doc as supporting distributed mode it's been supported for as long as i can remember -- since Day 1 of the StatsComponent i believe. : (at least for old, non-SolrCloud distrib mode),

Re: Engage custom hit collector for special search processing

2015-01-14 Thread tedsolr
Thank you so much Alex and Joel for your ideas. I am pouring through the documentation and code now to try an understand it all. A post filter sounds promising. As 99% of my doc fields are character based I should try to compliment the collapsing Q parser with an option that compares string fields

Re: Distributed mode for stats component?

2015-01-14 Thread Jack Krupansky
Thanks, Chris. I just needed to stare at the code I already knew about more intently to see what was really going on. It's super convoluted and super confusing. The keys were the handleResponses method in the main component class and the AbstractStatsValues class that is hidden in the StatsValuesFa

RE: can't make sense of spellchecker results when using techproducts example

2015-01-14 Thread Chris Hostetter
James: everything you said made perfect sense, and in hindsight was actually covered on the page -- it was just hte example that was bogus in light of the current config & defaults I went ahead and fixed it based on your feedback, and beefed up the explanation of spellcheck.collateParam.* (now

WordDelimiter Works differently in solr3X vs SolrCloud..?

2015-01-14 Thread gouthsmsimhadri
Problem: While migrating the solr version from 3.X(schema version is 1.4) to cloud 4.10.0 (schema version 1.5), I see a difference in the way the worddelimiterfilter works for the below configuration In the current version, the catentateWords is done on the last postion of the word delimited,

Re: Occasionally getting error in solr suggester component.

2015-01-14 Thread Michael Sokolov
did you build the spellcheck index using spellcheck.build as described here: https://cwiki.apache.org/confluence/display/solr/Spell+Checking ? -Mike On 01/14/2015 07:19 AM, Dhanesh Radhakrishnan wrote: Hi, Thanks for the reply. As you mentioned in the previous mail I changed buildOnCommit=fals

Re: WordDelimiter Works differently in solr3X vs SolrCloud..?

2015-01-14 Thread Ahmet Arslan
Hi, You could try passing luceneMatchVersion argument to WordDelimiterFilterFactory and see if it works for you. Factory returns Lucene47WordDelimiterFilter before LUCENE_4_8_0. Ahmet On Wednesday, January 14, 2015 11:10 PM, gouthsmsimhadri wrote: Problem: While migrating the solr version

Re: How to configure Solr PostingsFormat block size

2015-01-14 Thread Chris Hostetter
: As a foolish dev (not malicious I hope!), I did mess around with something : like this once; I was writing my own Codec. I found I had to create a file : called META-INF/services/org.apache.lucene.codecs.Codec in my solr plugin jar : that contained the fully-qualified class name of my codec: I

Re: WordDelimiter Works differently in solr3X vs SolrCloud..?

2015-01-14 Thread gouthsmsimhadri
Thanks Ahmet, that works. - -goutham -- View this message in context: http://lucene.472066.n3.nabble.com/WordDelimiter-Works-differently-in-solr3X-vs-SolrCloud-tp4179647p4179662.html Sent from the Solr - User mailing list archive at Nabble.com.

Conditions in function query

2015-01-14 Thread Shamik Bandopadhyay
Hi, Just wanted to know if it's possible to provide conditions with a function query. Right now,I'm using the following functions to boost on Likes data. bf=recip(ms(NOW/DAY,PublishDate),3.16e-11,1,1)^2.0 sum(Likes,2) What I would like to do is to apply the boost on "Likes" based on source. F

Re: Load existing Lucene sharded indexes onto single Solr collection

2015-01-14 Thread Jaikit Savla
Yes, I wanted to get rid of merge step. But looks like merge is not that cumbersome either. Thanks Mikhail and Erick for pointers, that helped. Jaikit On Wednesday, January 14, 2015 8:24 AM, Erick Erickson wrote: You certainly can't do this into a single directory, there would be zi

Re: Conditions in function query

2015-01-14 Thread Erick Erickson
Why won't the "if" clause work? See: https://cwiki.apache.org/confluence/display/solr/Function+Queries On Wed, Jan 14, 2015 at 5:29 PM, Shamik Bandopadhyay wrote: > Hi, > >Just wanted to know if it's possible to provide conditions with a > function query. Right now,I'm using the following fun

Re: Distributed search across Solr cores in a collection - NPE

2015-01-14 Thread Jaikit Savla
It was because I did not have unique id's in my index. I added that and it worked. Also it is mentioned as one of the requirement for Distributed Search. Thanks,Jaikit On Wednesday, January 14, 2015 1:53 AM, Jaikit Savla wrote: Folks, I have set up 3 cores in a single collection a

Re: Conditions in function query

2015-01-14 Thread shamik
Thanks Eric, I did take a look at the "if" condition earlier, but not sure how that can be used for multiple conditions. It works for a single condition : if(termfreq(Source2,'A'),sum(Likes,3),0) But for multiple, I'm struggling to find the right syntax. I tried using OR in conjunction but hasn'

Distributed Search returns Empty document list

2015-01-14 Thread Jaikit Savla
Hello, I am running Solr (4.10) in cloud mode by configuring multiple collections (1 for each day). Structure is as shown below. I can fetch documents for given query, if I query individual collection. However when I send distributed request to multiple shards, I only see numFound and no documen

Re: Conditions in function query

2015-01-14 Thread Erick Erickson
Nest them perhaps? Best Erick On Wed, Jan 14, 2015 at 7:07 PM, shamik wrote: > Thanks Eric, I did take a look at the "if" condition earlier, but not sure > how that can be used for multiple conditions. It works for a single > condition : > > if(termfreq(Source2,'A'),sum(Likes,3),0) > > But for

Re: Engage custom hit collector for special search processing

2015-01-14 Thread William Bell
We all need example data, and a sample query to help you. You can use "group" to group by a field and remove dupes. If you want to remove dupes you can do something like: q=field1:DOG AND NOT field2:DOG AND NOT field3:DOG That will remove DOG from field2 or field3. If you don't care if it is i

Solr groups not matching with terms in a field

2015-01-14 Thread Naresh Yadav
Hi all, I had done following configuration to test Solr grouping concept. solr version : 4.6.1 (tried in latest version 4.10.3 also) Schema : http://www.imagesup.net/?di=10142124357616 Solrj code to insert docs :http://www.imagesup.net/?di=10142124381116 Response Group's : http://www.images

Re: Solr grouping problem - need help

2015-01-14 Thread Naresh Yadav
as my problem is changed completely from first post so i had created new thread for that. On Wed, Jan 14, 2015 at 7:31 PM, Naresh Yadav wrote: > just wanted to share schema and results for same : > > solr version : 4.6.1 > Schema : http://www.imagesup.net/?di=10142124357616 > Code :http://w