Re: Sorting order of suggested words

2017-01-19 Thread Keiichi MORITA
I found the option `comparatorClass`. https://wiki.apache.org/solr/SpellCheckComponent#Custom_Comparators_and_the_Lucene_Spell_Checkers_.28IndexBasedSpellChecker.2C_FileBasedSpellChecker.2C_DirectSolrSpellChecker.29 If I want to sort suggested words alphabetically, need to set a custom comparator

Information on classifier based key word suggestion

2017-01-19 Thread Shamik Bandopadhyay
Hi, I'm exploring a way to suggest keywords/tags based on a text snippet. I have a fairly small set of the taxonomy of product, release, category, type, etc. stored in an in-memory database. What I'm looking at is a tool which will analyze a given text, suggest not only the fields associated wit

Re: CloudSolrStream can't set the setZkClientTimeout and setZkConnectTimeout properties

2017-01-19 Thread Will Martin
Default behavior. Client - Server negotiate 2/3 of min (max server, max client). This allows a client time to search for a new leader before all of its time consumed. zookeeper user @ apache org -will On 1/19/2017 12:59 PM, Yago Riveiro wrote: I can see some reconnects in my logs, the proc

Re: Will Solr flush docs to disk when ram buffer is full (time of auto commit is not reached yet)?

2017-01-19 Thread Jan Høydahl
It will flush buffer to disk as a new segment without opening a new searcher. I guess trans-log will be rotated too, but not sure. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 19. jan. 2017 kl. 21.16 skrev Ziyuan Qin : > > Hi All, > > I'm trying to understand ho

Will Solr flush docs to disk when ram buffer is full (time of auto commit is not reached yet)?

2017-01-19 Thread Ziyuan Qin
Hi All, I'm trying to understand how Solr works with disk IO during and between hard commits. Wish you can help me. Let's assume Softcommit is turned off. Autocommit is turned on. Then during a hard commit: 1. The tlog is truncated: A new tlog is started. (Disk IO involved) 2. The current in

Removing duplicate values from fields filled with copyField...

2017-01-19 Thread Georgios Petasis
Hi all, It seems that this is a popular request (remove duplicates generated from copyField), but I am not sure that I have understood the answer. Can somebody point to a correct answer for this issue? I have understand that this involves "update request processors", but I am not sure I unde

Removing duplicate values from fields filled with copyField...

2017-01-19 Thread Georgios Petasis
Hi all, It seems that this is a popular request (remove duplicates generated from copyField), but I am not sure that I have understood the answer. Can somebody point to a correct answer for this issue? I have understand that this involves "update request processors", but I am not sure I unde

Re: indexing error - 6.3.0

2017-01-19 Thread Joe Obernberger
Another data point - the 5 node cluster does have another collection on it that is large (maybe 500G in HDFS) that did have field guessing enabled on it, but it is a static collection (I'm not adding data to it). I've just removed that collection and am running the test again - it's gotten a l

Re: CloudSolrStream can't set the setZkClientTimeout and setZkConnectTimeout properties

2017-01-19 Thread Yago Riveiro
I can see some reconnects in my logs, the process of consuming the stream doesn't broke and continue as normal. The timeout is 10s but I can see in logs that after 6s the reconnect is triggered, I don't know if it's the default behaviour or the zk timeout it's not honoured. - Best regards

Re: indexing error - 6.3.0

2017-01-19 Thread Joe Obernberger
Thank you Erick! For this scenario, I was defining the schema manually (editing managed_schema and pushing to zookeeper), but didn't realize that I had left the field guessing block in the solrconfig.xml file enabled. I've now disabled the field guessing, but still getting errors when indexin

Re: Boolean disjunction with nested documents

2017-01-19 Thread Mikhail Khludnev
It's my pet peeve. Try ?q={!parent which=content_type:activity}(schedule.weekday:1) OR has_schedules:false&debugQuery=true vs ?q= {!parent which=content_type:activity}(schedule.weekday:1) OR has_schedules:false&debugQuery=true and you'll see how space matters. The pro's way is to ?q={!parent which

Re: Phonetic Search

2017-01-19 Thread Walter Underwood
Phonetic search will not match “satpuda” and “satpura” because they sound different. You want fuzzy search. To get fuzzy search that is easy to use in edismax, apply the patch in SOLR-629. https://issues.apache.org/jira/browse/SOLR-629 wunder Wa

Boolean disjunction with nested documents

2017-01-19 Thread Ivan Bianchi
I hope someone can help me because I have spent too many time looking for this issue :( I have 2 kind of documents related with an 1-n relation, in my example this is 1 activity has many schedules. To achieve this I have some inner child document with schedule fields inside the activity document.

Re: Solr on HDFS: AutoAddReplica does not add a replica

2017-01-19 Thread Hendrik Haddorp
HDFS is like a shared filesystem so every Solr Cloud instance can access the data using the same path or URL. The clusterstate.json looks like this: "shards":{"shard1":{ "range":"8000-7fff", "state":"active", "replicas":{ "core_node1":{ "core

Re: Solr on HDFS: AutoAddReplica does not add a replica

2017-01-19 Thread Shawn Heisey
On 1/19/2017 4:09 AM, Hendrik Haddorp wrote: > Given that the data is on HDFS it shouldn't matter if any active > replica is left as the data does not need to get transferred from > another instance but the new core will just take over the existing > data. Thus a replication factor of 1 should also

Re: indexing error - 6.3.0

2017-01-19 Thread Erick Erickson
It looks to me like you're using "field guessing". For production systems I generally don't recommend this. The problem is that it makes the best estimate that it can based on the first document for any given field. So it sees a field with the value 1 and tries to make the field an int. Then 100 do

Re: Solr Shard Splitting Issue

2017-01-19 Thread ekta
Hi Anshum, Thanks for the reply. I had the copy of data that i was experimenting on, and anyways i was doing it later too, after i posted the mail. Some points i want to let you know:- 1. This time i did not change the state of state.json. 2. Rest,I did the same steps as above and still the dat

Re: Phonetic Search

2017-01-19 Thread Vivek Pathak
https://cwiki.apache.org/confluence/display/solr/Phonetic+Matching didnt work for you? On 01/19/2017 05:58 AM, PAVAN wrote: Hi, I am trying to implement phonetic search in my application. Below are indexed terms in solr. "satpura private limited" when user search with "satpuda" it has to d

RE: Joining Across Collections

2017-01-19 Thread Moenieb Davids
Hi Guys Just a quick question on search and join: I have a few cores which is based on a mainframe extract, 1 core per extracted file which resembles a "DB Table" The cores are all somehow linked via 1 to many fields, with a structure similar to a normal ERD Is it possible to return the result

Re: Solr on HDFS: AutoAddReplica does not add a replica

2017-01-19 Thread Hendrik Haddorp
Hi, I'm seeing the same issue on Solr 6.3 using HDFS and a replication factor of 3, even though I believe a replication factor of 1 should work the same. When I stop a Solr instance this is detected and Solr actually wants to create a replica on a different instance. The command for that does

Phonetic Search

2017-01-19 Thread PAVAN
Hi, I am trying to implement phonetic search in my application. Below are indexed terms in solr. "satpura private limited" when user search with "satpuda" it has to display the above result. Below is the configuration

Re: How does using cacheKey and lookup behave?

2017-01-19 Thread Mikhail Khludnev
You can have left join in nested entity it gives empty child entities rows in cache. On Wed, Jan 18, 2017 at 9:59 PM, Kaushik wrote: > I use the cacheKey, cacheLookup, SortedMapBackedCache in the Data Import > Handler of Solr 5.x to join two or more entities. Does this give me an > equivalent of

Re: Joining Across Collections

2017-01-19 Thread Mikhail Khludnev
It seems like it can be done by just negating join query or I'm missing something. On Wed, Jan 18, 2017 at 11:32 AM, nabil Kouici wrote: > Hi All, > I'm using join across collection feature to do an inner join between 2 > collections. It works fine. > Is it possible to use this feature to compa