Re: Field with default value and stored=false, will be reset back to the default value in case of updating other fields

2013-10-09 Thread Shawn Heisey
On 10/9/2013 8:39 PM, deniz wrote: > Billnbell wrote >> You have to update the whole record including all fields... > > so what is the point of having atomic updates if i need to update > everything? If you have any regular fields that are not stored, atomic updates will not work -- unstored fie

Re: Field with default value and stored=false, will be reset back to the default value in case of updating other fields

2013-10-09 Thread deniz
Billnbell wrote > You have to update the whole record including all fields... so what is the point of having atomic updates if i need to update everything? - Zeki ama calismiyor... Calissa yapar... -- View this message in context: http://lucene.472066.n3.nabble.com/Field-with-default-valu

Re: Field with default value and stored=false, will be reset back to the default value in case of updating other fields

2013-10-09 Thread Bill Bell
You have to update the whole record including all fields... Bill Bell Sent from mobile > On Oct 9, 2013, at 7:50 PM, deniz wrote: > > hi all, > > I have encountered some problems and post it on stackoverflow here: > http://stackoverflow.com/questions/19285251/solr-field-with-default-value-res

Field with default value and stored=false, will be reset back to the default value in case of updating other fields

2013-10-09 Thread deniz
hi all, I have encountered some problems and post it on stackoverflow here: http://stackoverflow.com/questions/19285251/solr-field-with-default-value-resets-itself-if-it-is-stored-false as you can see from the response, does it make sense to open a bug ticket for this? because, although i can w

Re: Dynamically loading synonym dictionary for solr SynonymFilter

2013-10-09 Thread Jan Høydahl
Hi, Not as I know of. You'd probably want to subclass SynonymFilter* with your own DB aware implementation, and of course contribute this back :) -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 9. okt. 2013 kl. 23:31 skrev ALEX PKB : > Hi, > All of our synonyms are m

Re: Solr 4.4 - Master/Slave configuration - Replication Issue with Commits after deleting documents using Delete by ID

2013-10-09 Thread Otis Gospodnetic
Bharat, Can you look at the logs on the Master when you issue the delete and the subsequent commits and share that? Otis -- Solr & ElasticSearch Support -- http://sematext.com/ Performance Monitoring -- http://sematext.com/spm On Tue, Oct 8, 2013 at 3:57 PM, Akkinepalli, Bharat (ELS-CON) wrot

Re: run filter queries after post filter

2013-10-09 Thread jim ferenczi
Hi Rohit, The main problem is that if the query inside the filter does not have a PostFilter implementation then your post filter is silently transformed into a simple filter. The query "field:value" is based on the inverted lists and does not have a postfilter support. If your field is a numeric f

Dynamically loading synonym dictionary for solr SynonymFilter

2013-10-09 Thread ALEX PKB
Hi, All of our synonyms are maintained in DB, we would like to fetch those synonym dynamically for query expansion (Not indexing time). Are there any code contribution? I saw some discussion years ago but without conclusion. Thanks a lot!

Re: Searching on (hyphenated/capitalized) word issue

2013-10-09 Thread Upayavira
It depends whether multicad is a special case, or whether you want micr to match the term "microsoft". If it is a special case, you can use synonyms, so that multi and multicad are considered the same term. If it isn't a special case, then ngrams could work - your document would be indexed with:

Re: run filter queries after post filter

2013-10-09 Thread Rohit Harchandani
yes i get that. actually i should have explained in more detail. - i have a query which gets certain documents. - the post filter gets these matched documents and does some processing on them and filters the results. - but after this is done i need to apply another filter - which is why i gave a h

Re: matching "starts with" only

2013-10-09 Thread adm1n
search by "starts with" is something new I have to add, as well as the data I have to index for this purpose, so it's ok to create a new field. But once I added the following field type: And: indexing, and afterwards searching

Re: Searching on (hyphenated/capitalized) word issue

2013-10-09 Thread Erick Erickson
The admin/analysis page is definitely your friend. On the surface, [catenateWords="1"] in WDFF should mash the split up bits of multiCAD into multicad and you should be. I suspect that StandardTokenizerFactory is somehow getting into the mix here. Under any circumstance, the admin/analysis page sh

Re: matching "starts with" only

2013-10-09 Thread Shawn Heisey
On 10/9/2013 2:16 PM, adm1n wrote: Why this field have to be copyField? Couldn't it be a single field, for I always assume that people already are using the existing field and type for other purposes. Offering advice without making that assumption will usually result in people making a chang

Re: How to share Schema between multicore on Solr 4.4

2013-10-09 Thread Erick Erickson
bq: ...in the sense that there's only one canonical copy. Agreed, and as you say that copy is kept in ZooKeeper. And I pretty much guarantee that the internal solrconfig object is NOT shared. I doubt the schema object is shared, but it seems like it could be with some work. But the savings p

Re: matching "starts with" only

2013-10-09 Thread adm1n
Shawn Heisey-4: thanks for the quick response. Why this field have to be copyField? Couldn't it be a single field, for example: thanks. -- View this message in context: http://lucene.472066.n3.nabble.com/matching-start

Re: run filter queries after post filter

2013-10-09 Thread Erick Erickson
Ah, I think you're misunderstanding the nature of post-filters. Or I'm confused, which happens a lot! The whole point of post filters is that they're assumed to be expensive (think ACL calculation). So you want them to run on the fewest documents possible. So only docs that make it through the pri

Re: matching "starts with" only

2013-10-09 Thread Shawn Heisey
On 10/9/2013 12:57 PM, adm1n wrote: My index contains documents which could be a single word or a short sentence which contains up to 4-5 words. I need to return documents, which "starts with" only from the searched pattern. in regex it would be [^my_query]. for example, for a docs: black beaut

matching "starts with" only

2013-10-09 Thread adm1n
My index contains documents which could be a single word or a short sentence which contains up to 4-5 words. I need to return documents, which "starts with" only from the searched pattern. in regex it would be [^my_query]. for example, for a docs: black beautiful black cat cat cat is black black

Re: {soft}Commit and cache flusing

2013-10-09 Thread Tim Vaillancourt
Apologies all. I think the suggestion that I was replying "to get noticed" is what erked me, otherwise I would have moved on. I'll follow this advice. Cheers, Tim On 9 October 2013 05:20, Erick Erickson wrote: > Tim: > > I think you're mis-interpreting. By replying to a post with the subject:

Re: SolrCloud High Availability during indexing operation

2013-10-09 Thread Saurabh Saxena
@Furkan Pastebin link is working for me. Can you try again ? On Wed, Oct 9, 2013 at 1:15 AM, Furkan KAMACI wrote: > Hi Saurabh, > Your link does not work (it is broken). > > > 2013/10/9 Saurabh Saxena > > > Pastbin link http://pastebin.com/cnkXhz7A > > > > I am doing a bulk request. I am upload

Re: limiting deep pagination

2013-10-09 Thread Michael Sokolov
On 10/8/13 6:51 PM, Peter Keegan wrote: Is there a way to configure Solr 'defaults/appends/invariants' such that the product of the 'start' and 'rows' parameters doesn't exceed a given value? This would be to prevent deep pagination. Or would this require a custom requestHandler? Peter Just wo

Re: Find documents that are composed of % words

2013-10-09 Thread Furkan KAMACI
Are you asking something like that: http://wiki.apache.org/solr/TextProfileSignature 9 Ekim 2013 Çarşamba tarihinde shahzad73 adlı kullanıcı şöyle yazdı: > Please help me formulate the query that will be easy or do i have to build a > custom filter for this ? > > Shahzad > > > > -- > View this m

Re: SolrJ best pratices

2013-10-09 Thread Mark
Thanks for the clarification. In Solr Cloud just use 1 connection. In non-cloud environments you will need one per core. On Oct 8, 2013, at 5:58 PM, Shawn Heisey wrote: > On 10/7/2013 3:08 PM, Mark wrote: >> Some specific questions: >> - When working with HttpSolrServer should we keep around

Re: Multiple schemas in the same SolrCloud ?

2013-10-09 Thread Shawn Heisey
On 10/9/2013 1:17 AM, xinwu wrote: I remember I must put the "-Dbootstrap_confdir=/opt/Solr_home/collection1/conf -Dcollection.configName=solrConfig " in the catalina.sh . Is it means that solrCloud must have one ,and only one, schema? Those bootstrap options are intended to be used ONCE, and

Solr's Filtering approaches

2013-10-09 Thread David Philip
Hi All, I have an issue in handling filters for one of our requirements and liked to get suggestion for the best approaches. *Use Case:* 1. We have List of groups and the number of groups can increase upto >1 million. Currently we have almost 90 thousand groups in the solr search system.

Re: How to share Schema between multicore on Solr 4.4

2013-10-09 Thread Shawn Heisey
On 10/9/2013 6:24 AM, Erick Erickson wrote: Hmmm, I hadn't thought about that before. The shareSchema stuff is keyed off the absolute directory (and timestamp) of the schema.xml file associated with a core and is about sharing the internal object that holds the parsed schema. Do you know for sur

Re: Find documents that are composed of % words

2013-10-09 Thread shahzad73
my client has a strange requirement, he will give a list of 500 words and then set a percentage like 80% now he want to find those pages or documents which consist of the only those 80% of 500 and only 20% unknown. like we have this document word1 word2

Re: Find documents that are composed of % words

2013-10-09 Thread shahzad73
Please help me formulate the query that will be easy or do i have to build a custom filter for this ? Shahzad -- View this message in context: http://lucene.472066.n3.nabble.com/Find-documents-that-are-composed-of-words-tp4094264p4094372.html Sent from the Solr - User mailing list archive at

Re: run filter queries after post filter

2013-10-09 Thread Rohit Harchandani
Hey, so the post filter logs the number of ids that it receives. With the above filter having cost=200, the post filter should have received the same number of ids as before ( when the filter was not present ). But that does not seem to be the case...with the filter query on the index, the number o

Update existing documents when using ExtractingRequestHandler?

2013-10-09 Thread Jeroen Steggink
Hi, In a content management system I have a document and an attachment. The document contains the meta data and the attachment the actual data. I would like to combine data of both in one Solr document. I have thought of several options: 1. Using ExtractingRequestHandler I would extract the dat

Re: Searching on (hyphenated/capitalized) word issue

2013-10-09 Thread Furkan KAMACI
If you have that word to index: "multicad" and if you want to get result when you search that: "multi" you can use ngram filter. However you should consider pros and cons of using Ngram Filter. If you use ngrams you may find "multicad" from "multi" but your index size will be much more bigger. I s

Re: Shard split issue

2013-10-09 Thread Shalin Shekhar Mangar
I opened https://issues.apache.org/jira/browse/SOLR-5324 On Mon, Oct 7, 2013 at 2:20 PM, Yago Riveiro wrote: > If the replica has 20G must probably the recovery will take more than 120 > seconds. > > In my case I have ssd's and 120 it's not enough. > > -- > Yago Riveiro > Sent with Sparrow (htt

Re: Permisions didn't check when call discoverUnder

2013-10-09 Thread Said Chavkin
I'm not sure, may be solr should skip inaccessible directory. Because it is standard rule to place service on separate filesystem. On the other hand it is possible to place solr/home not on the top of mounted fs. Anyway it would be better if error message was more clearly. 2013/10/9 Erick Erickso

RE: Searching on (hyphenated/capitalized) word issue

2013-10-09 Thread Van Tassell, Kristian
Thank you Upayavira. I'm trying to figure out what will make Solr stem on "multi" in the word "multicad" so that any attempt to search on "multicad", "Multi-CAD" or "multiCAD" will return results. The WordDelimiterFilterFactory helps with the case of multi followed by a dash or a capital letter

Re: Permisions didn't check when call discoverUnder

2013-10-09 Thread Erick Erickson
What do you think Solr should do in this case? If the process doesn't have permission to the dir, it can't write to it. You need to set the permissions, or the authority of the process that Solr is running as appropriately. Best, Erick On Wed, Oct 9, 2013 at 8:54 AM, Said Chavkin wrote: > Hello.

Re: dynamically adding core with auto-discovery in Solr 4.5

2013-10-09 Thread Erick Erickson
If you create a Wiki login, I'll be happy to add you to the contributors list. It's always valuable to have fresh eyes update docs while the ambiguities are still fresh! Erick On Wed, Oct 9, 2013 at 8:37 AM, Jan Van Besien wrote: > On Wed, Oct 9, 2013 at 2:15 PM, Erick Erickson > wrote: >> Thi

Permisions didn't check when call discoverUnder

2013-10-09 Thread Said Chavkin
Hello. When in solr/home directory exists directory to which solr do not have rights, then solr failed to start with exception: 2108 [main] INFO org.apache.solr.core.CoresLocator - Looking for core definitions underneath /var/lib/solr 2109 [main] ERROR org.apache.solr.servlet.SolrDispatchFilter -

Re: Find documents that are composed of % words

2013-10-09 Thread Aloke Ghoshal
Hi Shahzad, Have you tried with the Minimum Should Match feature: http://wiki.apache.org/solr/ExtendedDisMax#mm_.28Minimum_.27Should.27_Match.29 Regards, Aloke On Wed, Oct 9, 2013 at 4:55 PM, Otis Gospodnetic wrote: > Hi, > > You can take your words, combine some % of them with AND. Then take

Re: dynamically adding core with auto-discovery in Solr 4.5

2013-10-09 Thread Jan Van Besien
On Wed, Oct 9, 2013 at 2:15 PM, Erick Erickson wrote: > This worked for me if I do NOT have a core.properties at all in my new > core. Personally I think the behavior in 4.4 was dangerous, what > happens if you mis-type the command for instance? You could do Bad > Things to the old core you were

Re: How to share Schema between multicore on Solr 4.4

2013-10-09 Thread Erick Erickson
Shawn: Hmmm, I hadn't thought about that before. The shareSchema stuff is keyed off the absolute directory (and timestamp) of the schema.xml file associated with a core and is about sharing the internal object that holds the parsed schema. Do you know for sure if the fact that this is coming from

Re: {soft}Commit and cache flusing

2013-10-09 Thread Erick Erickson
Tim: I think you're mis-interpreting. By replying to a post with the subject: {soft}Commit and cache flushing but going in a different direction, it's easy for people to think "I'm not interested in that thread, I'll ignore it", thereby missing the fact that you're asking a somewhat different qu

Re: dynamically adding core with auto-discovery in Solr 4.5

2013-10-09 Thread Erick Erickson
Jan: This worked for me if I do NOT have a core.properties at all in my new core. Personally I think the behavior in 4.4 was dangerous, what happens if you mis-type the command for instance? You could do Bad Things to the old core you were inadvertently re-creating. The core.properties file gets

Re: Find documents that are composed of % words

2013-10-09 Thread Otis Gospodnetic
Hi, You can take your words, combine some % of them with AND. Then take another set of them OR it with the previous set, and so on. Otis Solr & ElasticSearch Support http://sematext.com/ On Oct 9, 2013 6:54 AM, "shahzad73" wrote: > Is there a way that in Solr Query i find documents that is

Re: dynamic field question

2013-10-09 Thread Otis Gospodnetic
David, Yes. Document grouping (aka field collapsing) will help you here. It should also allow you to create a better search experience on the front end - it's often better to narrow down where in a large document a match is than give users a large doc and say: we know the match is in here somewhe

Re: Collection API wrong configuration

2013-10-09 Thread maephisto
Yes, the problem described in the ticket is what I'm also confronting with. -- View this message in context: http://lucene.472066.n3.nabble.com/Collection-API-wrong-configuration-in-4-5-0-tp4094319p4094335.html Sent from the Solr - User mailing list archive at Nabble.com.

Find documents that are composed of % words

2013-10-09 Thread shahzad73
Is there a way that in Solr Query i find documents that is composed of n number of words. for example here is the list of words - Love - Ice - Cream - Sunny - I - To - A - On - Elephant - Balloon And a percentage such as: 80% Let’s assume you’re analyzing the text of the following sentence.

Re: Collection API wrong configuration

2013-10-09 Thread primoz . skale
Works fine at my end. I use Solr 4.5.0 on Windows 7. I tried: >zkcli.bat -cmd upconfig -zkhost localhost:9000 -d ..\solr\collection2\conf -n my_custom_collection >java -Djetty.port=8001 -DzkHost=localhost:9000 -jar start.jar and finally http://localhost:8001/solr/admin/collections?action=CRE

Re: Collection API wrong configuration

2013-10-09 Thread Shalin Shekhar Mangar
This may be a bug in 4.5 Another user has also reported this bug: https://issues.apache.org/jira/browse/SOLR-5307 On Wed, Oct 9, 2013 at 3:51 PM, maephisto wrote: > Using Solr 4.4.0 the same scenarion behaves as expected. > > Can anyone else try this, to check if it this only happens with 4.5.

Re: Collection API wrong configuration

2013-10-09 Thread maephisto
Using Solr 4.4.0 the same scenarion behaves as expected. Can anyone else try this, to check if it this only happens with 4.5.0 and if so, is this a desired behaviour or a bug? -- View this message in context: http://lucene.472066.n3.nabble.com/Collection-API-wrong-configuration-tp4094319p40943

Re: dynamic field question

2013-10-09 Thread Twomey, David
OK. Then the JSON returned would contain a lot of documents that are really sections. This would work fine for the use-case I mentioned but I also use the index for full-text search of the whole document. Therefore, I would need to parse the result JSON in a way that combines the solr docs return

Collection API wrong configuration

2013-10-09 Thread maephisto
I'm experimenting with SolrCloud using Solr 4.5.0 and the Collection API What i did was: 1. upload configuration to ZK zkcli.sh -cmd upconfig -zkhost 127.0.0.1:8993 -d solr/my_custom_collection/conf/ -n my_custom_collection 2. create a collection using the api: /admin/collections?action=CREATE&n

Re: synonyms and term position

2013-10-09 Thread Alvaro Cabrerizo
The synonyms.txt has defined the next associations defined. AIO=>All in one aio=>all-in-one Regards. On Wed, Oct 9, 2013 at 11:05 AM, Alvaro Cabrerizo wrote: > No, it has no synonyms. > > > On Wed, Oct 9, 2013 at 10:48 AM, Furkan KAMACI wrote: > >> Does "two" has a synonym of "in" and "one"? >

Re: synonyms and term position

2013-10-09 Thread Alvaro Cabrerizo
No, it has no synonyms. On Wed, Oct 9, 2013 at 10:48 AM, Furkan KAMACI wrote: > Does "two" has a synonym of "in" and "one"? > > > 2013/10/9 Furkan KAMACI > >> Does "two" has a synonym of "in" and "one"? >> >> >> 2013/10/9 Alvaro Cabrerizo >> >>> Sure, >>> >>> Find attached the screenshots with

Re: synonyms and term position

2013-10-09 Thread Furkan KAMACI
Does "two" has a synonym of "in" and "one"? 2013/10/9 Furkan KAMACI > Does "two" has a synonym of "in" and "one"? > > > 2013/10/9 Alvaro Cabrerizo > >> Sure, >> >> Find attached the screenshots with almost all the analysis, (dont worry >> about the lowercase and the porter stemmer) >> >> Regar

Re: no such field error:smaller big block size details while indexing doc files

2013-10-09 Thread sweety
I will try using solrJ. Now I tried indexing .docx files and I get some different error,logs are: SEVERE: null:java.lang.RuntimeException: java.lang.VerifyError: (class: org/apache/poi/extractor/ExtractorFactory, method: createExtractor signature: (Lorg/apache/poi/poifs/filesystem/DirectoryNode;

Re: no such field error:smaller big block size details while indexing doc files

2013-10-09 Thread sweety
I will try using solrj.Thanks. but I tried to index .docx file I am getting  some different error: SEVERE: null:java.lang.RuntimeException: java.lang.VerifyError: (class: org/apache/poi/extractor/ExtractorFactory, method: createExtractor signature: (Lorg/apache/poi/poifs/filesystem/DirectoryNode

Re: synonyms and term position

2013-10-09 Thread Alvaro Cabrerizo
Sure, Find attached the screenshots with almost all the analysis, (dont worry about the lowercase and the porter stemmer) Regards. On Wed, Oct 9, 2013 at 10:17 AM, Furkan KAMACI wrote: > Could you send screenshot of admin Analysis page when trying to analyze > that words? > > > 2013/10/9 Al

Re: Multiple schemas in the same SolrCloud ?

2013-10-09 Thread Furkan KAMACI
You can have more information from here: https://cwiki.apache.org/confluence/display/solr/Using+ZooKeeper+to+Manage+Configuration+Files 2013/10/9 xinwu > I remember I must put the > "-Dbootstrap_confdir=/opt/Solr_home/collection1/conf > -Dcollection.configName=solrConfig " in the catalina.sh .

Re: synonyms and term position

2013-10-09 Thread Furkan KAMACI
Could you send screenshot of admin Analysis page when trying to analyze that words? 2013/10/9 Alvaro Cabrerizo > Hi: > > I'm involved in a process o upgrade solr from 1.4 to 4.4 and I'm having a > problem using SynonymFilterFactory within the process chain > SynonymFilterFactory, StopFilterFac

Re: SolrCloud High Availability during indexing operation

2013-10-09 Thread Furkan KAMACI
Hi Saurabh, Your link does not work (it is broken). 2013/10/9 Saurabh Saxena > Pastbin link http://pastebin.com/cnkXhz7A > > I am doing a bulk request. I am uploading 100 files, each file having 100 > docs. > > -Saurabh > > > On Tue, Oct 8, 2013 at 7:39 PM, Mark Miller wrote: > > > The attachm

Re: SolrJ best pratices

2013-10-09 Thread Furkan KAMACI
I suggest you to look at here: http://wiki.apache.org/solr/Solrj?action=fullsearch&context=180&value=cloudsolrserver&titlesearch=Titles#Using_with_SolrCloud 2013/10/9 Shawn Heisey > On 10/7/2013 3:08 PM, Mark wrote: > >> Some specific questions: >> - When working with HttpSolrServer should we k

Re: Multiple schemas in the same SolrCloud ?

2013-10-09 Thread xinwu
I remember I must put the "-Dbootstrap_confdir=/opt/Solr_home/collection1/conf -Dcollection.configName=solrConfig " in the catalina.sh . Is it means that solrCloud must have one ,and only one, schema? -- View this message in context: http://lucene.472066.n3.nabble.com/Multiple-schemas-in-the-s

Re: dynamic field question

2013-10-09 Thread Aloke Ghoshal
Hi David, A separate Solr document for each section is a good option if you also need to handle phrases, case, special characters, etc. within the title field. How do you map them to dynamic fields? E.g.: "Appendix for cities", "APPENDIX 1: Cities" Regards, Aloke On Wed, Oct 9, 2013 at 9:45 AM

Re: Multiple schemas in the same SolrCloud ?

2013-10-09 Thread Anshum Gupta
You can simply have multiple collections, each independent of each other on the schema but could run on the same instance/jvm if you want. On Wed, Oct 9, 2013 at 12:36 PM, xinwu wrote: > Hi all, > > I want to use the multiple schemas in the same solrCloud, is it allowed? > > If it is allowed,ho

synonyms and term position

2013-10-09 Thread Alvaro Cabrerizo
Hi: I'm involved in a process o upgrade solr from 1.4 to 4.4 and I'm having a problem using SynonymFilterFactory within the process chain SynonymFilterFactory, StopFilterFactory . I have configured synonyms.txt to expand the word AIO as: all-in-one. Well, when using solr 1.4 I get the following r

Multiple schemas in the same SolrCloud ?

2013-10-09 Thread xinwu
Hi all, I want to use the multiple schemas in the same solrCloud, is it allowed? If it is allowed,how? These schemas may have no relation. Thank You. Dai. -- View this message in context: http://lucene.472066.n3.nabble.com/Multiple-schemas-in-the-same-SolrCloud-tp4094279.html Sent from th