Re: SolrCloud 4.8.0 upgrade

2015-04-20 Thread Vincenzo D'Amore
Hi, I'm seriously thinking to upgrade, but how? Could I upgrade one instance at time or should I stop all the instances and upgrade and restart everything? Ciao, Vincenzo -- Vincenzo D'Amore skype: free.dev mobile: +39 349 8513251 > On Apr 18, 2015, at 2:13 AM, Vincenzo D'Amore wrote: > > G

SolrCloud 4.8.0 - load average high during indexing

2015-04-20 Thread Vincenzo D'Amore
Hi all, I'm experiencing a very high load average (6/7) during documents indexing. Some time SolrCloud response can be returned in more than 5 seconds. My SolrCloud cluster have 3 nodes, and my collection have 3 shards and 6 replicas. I suppose that all this load is due to the replica syncing, s

Re: solr 4.8.0 update synonyms in zookeeper splitted files

2015-04-20 Thread Vincenzo D'Amore
Hi Shawn, Thanks again for the answer. I'm not using implicit document routing. I have restarted all the nodes (tomcat stop/start), but after a couple of days or even less, we have again random results (sometimes). If I have different replicas of my index with different settings, how can I res

Re: Differentiating user search term in Solr

2015-04-20 Thread Walter Underwood
I’ve been wanting a “free text” query parser for a while. We could build some cool stuff on that: auto-phrasing, entity extraction and weighting, CJK tokenization, … For reference, here are some real-world user queries I have needed to deal with. These have exactly matched content. * +/- * .ha

Re: Differentiating user search term in Solr

2015-04-20 Thread Steven White
Hi Erick, I think you missed my point. My request is, Solr support a new URL parameter. If this parameter is set, than EVERYTHING in q is treated as raw text (i.e.: Solr will do the escaping vs. the client). Thanks Steve On Mon, Apr 20, 2015 at 1:08 PM, Erick Erickson wrote: > How does that

Re: Solr Index data lost

2015-04-20 Thread Erick Erickson
Did you commit before you unplugged the drive? Were you able to see data in the admin UI _before_ you unplugged the drive? Best, Erick On Mon, Apr 20, 2015 at 3:58 PM, Vijay Bhoomireddy wrote: > Shawn, > > I haven’t changed any DirectoryFactory setting in the solrconfig.xml as I am > using in

Re: Solr Index data lost

2015-04-20 Thread Shawn Heisey
On 4/20/2015 4:58 PM, Vijay Bhoomireddy wrote: > I haven’t changed any DirectoryFactory setting in the solrconfig.xml as I am > using in a local setup and using the default configurations. > > Device has been unmounted successfully (confirmed through windows message in > the lower right corner).

RE: Solr Index data lost

2015-04-20 Thread Vijay Bhoomireddy
Shawn, I haven’t changed any DirectoryFactory setting in the solrconfig.xml as I am using in a local setup and using the default configurations. Device has been unmounted successfully (confirmed through windows message in the lower right corner). I am using Solr-4.10.2. I simply run a Ctrl-C c

Multiple index.timestamp directories using up disk space

2015-04-20 Thread Rishi Easwaran
Hi All, We are seeing this problem with solr 4.6 and solr 4.10.3. For some reason, solr cloud tries to recover and creates a new index directory - (ex:index.20150420181214550), while keeping the older index as is. This creates an issues where the disk space fills up and the shard never ends up

Re: Solr Index data lost

2015-04-20 Thread Shawn Heisey
On 4/20/2015 2:55 PM, Vijay Bhoomireddy wrote: > I have configured Solr example server on a pen drive. I have indexed some > content. The data directory was under example/solr/collection1/data which is > the default one. After indexing, I stopped the Solr server and unplugged the > pen drive and re

Re: generate uuid/ id for table which do not have any primary key

2015-04-20 Thread Vishal Swaroop
Thanks... Yes that is option we will go forward with. On Apr 20, 2015 10:52 AM, "Kaushik" wrote: > Have you tried select as id, name, age ? > > On Thu, Apr 16, 2015 at 3:34 PM, Vishal Swaroop > wrote: > > > Just wondering if there is a way to generate uuid/ id in data-config > > without using c

Solr Index data lost

2015-04-20 Thread Vijay Bhoomireddy
Hi, I have configured Solr example server on a pen drive. I have indexed some content. The data directory was under example/solr/collection1/data which is the default one. After indexing, I stopped the Solr server and unplugged the pen drive and reconnected the same. Now, when I navigate to the

Re: Has anyone seen this error?

2015-04-20 Thread vsilgalis
I fixed this issue by reloading the core on the leader for the shard. Still curious how this happened, any help would be greatly appreciated. -- View this message in context: http://lucene.472066.n3.nabble.com/Has-anyone-seen-this-error-tp4200975p4201067.html Sent from the Solr - User mailing

Re: Solr Cloud reclaiming disk space from deleted documents

2015-04-20 Thread Rishi Easwaran
So is there anything that can be done from a tuning perspective, to recover a shard that is 75%-90% full, other that get rid of the index and rebuild the data? Also to prevent this issue from re-occurring, looks like we need make our system aggressive with segment merges using lower merge facto

Re: Is it possible to facet on the results of a custom solr function?

2015-04-20 Thread Motulewicz, Michael
Solved my own problem. Using multiple function range query parsers works fine against my custom function &facet.query={!frange l=1 u=1} MyCustomSolrQuery(param1,param2, param3) &facet.query={!frange l=2 u=2} MyCustomSolrQuery(param1,param2, param3) Etc… Gives me the counts for 1 then 2 etc Not

RE: search by person name

2015-04-20 Thread Pedro Figueiredo
Hi Steve, Thanks, it works! I will analyse in detail your solution because I never used the syntax using (). Best regards, Pedro Figueiredo Senior Engineer pjlfigueir...@criticalsoftware.com M. 934058150 Rua Engº Frederico Ulrich, nº 2650 4470-605 Moreira da Maia, Portugal T. +351 229 446 9

Re: Differentiating user search term in Solr

2015-04-20 Thread Erick Erickson
How does that address the example query I gave? q=field1:whatever AND (a AND field:b) OR (field2:c AND "d: is a letter followed by a colon (:)"). bq: "Solr will treat everything in the search string by first passing it to ClientUtils.escapeQueryChars()." would incorrectly escape the colons after

RE: search by person name

2015-04-20 Thread Pedro Figueiredo
Hi Erick, I apologize if I made the wrong impression it was not my intention. I've tried a few extra filters and tokens and with a few extra searchs in google I found the proximity parameter that solved my issue. Anyway, many thanks for your feedback and again, I apologize for any misunders

Re: Has anyone seen this error?

2015-04-20 Thread vsilgalis
The leader in the cluster is what is throwing the error. One of the stack traces: However I didn't notice this one before which has a bit more info: org.apache.solr.common.SolrException: Conflict request: http://:8080/sol

Re: Differentiating user search term in Solr

2015-04-20 Thread Steven White
Hi Erick, I didn't know about ClientUtils.escapeQueryChars(), this is good to know. Unfortunately I cannot use it because it means I have to import Solr classes with my client application. I want to avoid that and create a lose coupling between my application and Solr (just rely on REST). My sug

Re: search by person name

2015-04-20 Thread Yavar Husain
In this case q=name:(ana jose) will work, but suppose if it is to be searched in full text field It might have poor recall, It will also produce document like "San Jose is better than Santa Ana" which was not the user intent. Erick's solution "ana jose"~2 is capturing the intent too. On Mon, Apr

Re: Search in Solr Index

2015-04-20 Thread Vijaya Narayana Reddy Bhoomi Reddy
Thanks Matt and Yavar for the suggestion. Now I have fixed the issue. For others benefit, the issue was with defining the fields as String. Now I changed them to text_general. Also, instead of indexing these individual fields, I created corresponding copyFields for each of them, where dest field

Re: Differentiating user search term in Solr

2015-04-20 Thread Erick Erickson
Steve: In short, no. There's no good way for Solr to solve this problem in the _general_ case. Well, actually we could create parsers with rules like "if the colon is inside a paren, escape it). Which would completely break someone who wants to form queries like q=field1:whatever AND (a AND field

Re: search by person name

2015-04-20 Thread Steven White
Why not just use q=name:(ana jose) ? Than missing words or words order won't matter. No? Steve On Mon, Apr 20, 2015 at 12:26 PM, Erick Erickson wrote: > First, a little patience on your part please, we're all volunteers here. > > Second, what have you done to try to analyze the problem? Have

Re: Has anyone seen this error?

2015-04-20 Thread Erick Erickson
You have to provide a lot more context here, please review: http://wiki.apache.org/solr/UsingMailingLists. The root of the problem is often much farther down the exception trace. Best, Erick On Mon, Apr 20, 2015 at 8:16 AM, vsilgalis wrote: > We are getting this on a couple of nodes wondering if

Re: search by person name

2015-04-20 Thread Erick Erickson
First, a little patience on your part please, we're all volunteers here. Second, what have you done to try to analyze the problem? Have you tried adding &debgu=query to to your URL? Looked at the analysis page? Anything else? You might review: http://wiki.apache.org/solr/UsingMailingLists My gue

Re: Search in Solr Index

2015-04-20 Thread Yavar Husain
There might be issues with your default search field. Suppose if you are searching field named "MyTestField" then give your query as MyTestField:Birmingham and see if you get any results. As Matt suggested there might be some issues with the way you have done tokenization/analysis etc. On Mon, A

RE: Search in Solr Index

2015-04-20 Thread Matt Kuiper
What type of field are you using? String? If so try another type, like text_general. I believe with type String the contents are stored in the index exactly as they are inputted into the index. So a search hit will have to match exactly the full value of the field, I assume in your case "Birm

Re: Multilevel nested level support using Solr

2015-04-20 Thread Steven White
Thanks Andy. I have been thinking along the same line as your solution, and your solution is what looks like I will have to do. In summary, there is no Solr built-in way to achieve my need, I have to construct my document and build a query to get this working. Steve On Mon, Apr 20, 2015 at 10:5

Re: Search in Solr Index

2015-04-20 Thread Vijaya Narayana Reddy Bhoomi Reddy
To add further, initially when I give *.*, numfound returns 14170. After giving a search string, numFound returns 0. Now if I change the search string again back to *.*, numFound still returns to 0. I have to refresh the page completely to see 14170 again when *.* is given as the search string. T

Re: Differentiating user search term in Solr

2015-04-20 Thread Steven White
Hi Shawn, If the user types "title:(Apache: Solr Notes)" (without quotes) than I want Solr to treat the whole string as raw text string as if I escaped ":", "(" and ")" and any other reserved Solr keywords / tokens. Using dismax it worked for the ":" case, but I still get SyntaxError if I pass it

Search in Solr Index

2015-04-20 Thread Vijaya Narayana Reddy Bhoomi Reddy
Hi, I am indexing some data from a Database. Data is getting indexed properly and when I query in the Solr stock UI with query parameters as *.*, I could see the documents with all the fields listed and as well the numFound reflecting properly. However, if I perform a query with a simple string

Re: Solr Cloud reclaiming disk space from deleted documents

2015-04-20 Thread Shawn Heisey
On 4/20/2015 8:44 AM, Rishi Easwaran wrote: > Yeah I noticed that. Looks like optimize won't work since on some disks we > are already pretty full. > Any thoughts on increasing/decreasing 10 or > ConcurrentMergeScheduler to make solr do merges faster. You don't have to do an optimize to need 2x

Has anyone seen this error?

2015-04-20 Thread vsilgalis
We are getting this on a couple of nodes wondering if there is a way to recover the node: Setting up to try to start recovery on replica http:///solr/classic_bt/ after: org.apache.solr.common.SolrException: Conflict Thanks -- View this message in context: http://lucene.472066.n3.nabble.com/H

RE: Multilevel nested level support using Solr

2015-04-20 Thread Andrew Chillrud
Don't know if this is what you are looking for, but we had a similar requirement. In our case each folder had a unique identifier associated with it. When generating the Solr input document our code populated 2 fields, parent_folder, and folder_hierarchy (multi-valued), and for a document in the

RE: Mutli term synonyms

2015-04-20 Thread Davis, Daniel (NIH/NLM) [C]
Handling MESH descriptor preferred terms and such is similar. I encountered this during evaluation of Solr for a project here at NLM. We decided to use Solr for different projects instead. I considered the following approaches: - use a custom tokenizer at index time that indexed all of t

Re: generate uuid/ id for table which do not have any primary key

2015-04-20 Thread Kaushik
Have you tried select as id, name, age ? On Thu, Apr 16, 2015 at 3:34 PM, Vishal Swaroop wrote: > Just wondering if there is a way to generate uuid/ id in data-config > without using combination of fields in query... > > data-config.xml > > >batchSize="2000" > name="test" > type="Jdbc

Mutli term synonyms

2015-04-20 Thread Kaushik
Hello, Reading up on synonyms it looks like there is no real solution for multi term synonyms. Is that right? I have a use case where I need to map one multi term phrase to another. i.e. Tween 20 needs to be translated to Polysorbate 40. Any thoughts as to how this can be achieved? Thanks, Kaush

Re: Solr Cloud reclaiming disk space from deleted documents

2015-04-20 Thread Rishi Easwaran
Yeah I noticed that. Looks like optimize won't work since on some disks we are already pretty full. Any thoughts on increasing/decreasing 10 or ConcurrentMergeScheduler to make solr do merges faster. -Original Message- From: Gili Nachum To: solr-user Sent: Sun, Apr 19, 20

RE: search by person name

2015-04-20 Thread Pedro Figueiredo
Any help please? PF -Original Message- From: Pedro Figueiredo [mailto:pjlfigueir...@criticalsoftware.com] Sent: 20 de abril de 2015 14:19 To: solr-user@lucene.apache.org Subject: RE: search by person name yes Pedro Figueiredo Senior Engineer pjlfigueir...@criticalsoftware.com M. 93405

RE: JSON Facet & Analytics API in Solr 5.1

2015-04-20 Thread Davis, Daniel (NIH/NLM) [C]
Indeed - XML is not "human readable" if it contains colons, JSON is not "human readable" if it is too deep, and the objects/keys are not semantic. I also vote for flatter. -Original Message- From: Otis Gospodnetic [mailto:otis.gospodne...@gmail.com] Sent: Friday, April 17, 2015 11:16 PM

Re: Multilevel nested level support using Solr

2015-04-20 Thread Doug Turnbull
You might want to look at SirenDB from Sindice. It's a lucene codec that allows native modeling of arbitrary hierarchies. http://siren.sindice.com On Friday, April 17, 2015, Steven White wrote: > Hi folks, > > In my DB, my records are nested in a folder base hierarchy: > > > > rec

Deploying SolrCloud 5 on Windows

2015-04-20 Thread Rahiem Burgess
Hello all, I am new to Solr and I looking for advice and tips on deploying SolrCloud in a Windows production environment. Rahiem S. Burgess Sr. Software Engineer Enterprise Integration Services Johns Hopkins University 5801 Smith Avenue Davis Building Baltimore, MD 21209 Email: rah...@jhu.edu<

Re: Differentiating user search term in Solr

2015-04-20 Thread Shawn Heisey
On 4/20/2015 7:41 AM, Steven White wrote: > In my application, a user types "Apache Solr Notes". I take that text and > send it over to Solr like so: > > > http://localhost:8983/solr/db/select?q=title:(Apache%20Solr%20Notes)&fl=id%2Cscore%2Ctitle&wt=xml&indent=true&q.op=AND > > And I get a hit on

Re: Multilevel nested level support using Solr

2015-04-20 Thread Steven White
Re sending to see if anyone can help. Thanks Steve On Fri, Apr 17, 2015 at 12:14 PM, Steven White wrote: > Hi folks, > > In my DB, my records are nested in a folder base hierarchy: > > > > record_1 > record_2 > > record_3 > record_4 >

Re: Differentiating user search term in Solr

2015-04-20 Thread Steven White
Hi Hoss, Thanks for that lengthy feedback, it is much appreciated. Let me reset and bear in mind that I'm new to Solr. I'm using Solr 5.0 (will switch over to 5.1 later this week) and my need is as follows. In my application, a user types "Apache Solr Notes". I take that text and send it over

RE: search by person name

2015-04-20 Thread Pedro Figueiredo
yes Pedro Figueiredo Senior Engineer pjlfigueir...@criticalsoftware.com M. 934058150 Rua Engº Frederico Ulrich, nº 2650 4470-605 Moreira da Maia, Portugal T. +351 229 446 927 | F. +351 229 446 929 www.criticalsoftware.com PORTUGAL | UK | GERMANY | USA | BRAZIL | MOZAMBIQUE | ANGOLA A CMMI® LE

Re: search by person name

2015-04-20 Thread Rafal Kuc
Hello, How does you query look like? Do you use phrase query, like q=name:"ana jose" ? --- Regards, Rafał Kuć > Wiadomość napisana przez Pedro Figueiredo > w dniu 20 kwi 2015, o godz. 15:06: > > Hi all, > > Can anyone advise the tokens and filters to use, for the most common way to > s

search by person name

2015-04-20 Thread Pedro Figueiredo
Hi all, Can anyone advise the tokens and filters to use, for the most common way to search by people’s names. The basics requirements are: For field name – “Ana Maria José” The following search’s should return the example: 1. “Ana” 2. “Maria” 3. “Jose” 4. “ana

Re: Correspondance table ?

2015-04-20 Thread Bruno Mannina
Hi Jack, ok, it's not for many millions of users, just max 100 by day. it will be used on traditional "PC" and also on mobile clients. Then, I need to do test to verify the possibility. Thx Le 20/04/2015 14:20, Jack Krupansky a écrit : It depends on the specific nature of your clients. Is the

Can't find result of autophrase filter

2015-04-20 Thread Mike Thomsen
This is the content of my autophrases.txt file: al qaeda in the arabian peninsula seat belt I've attached a screenshot showing the analysis view of the index. When I query for al_qaeda_in_the_arabian_peninsula or alqaedainthearabianpeninsula, nothing comes back even though at least the latter app

Re: Correspondance table ?

2015-04-20 Thread Jack Krupansky
It depends on the specific nature of your clients. Is they in-house users, like only dozens or hundreds, or is this a large web app with many millions of users and with mobile clients as well as traditional "PC" clients? If it feels too much to do in the client, then a middleware API service layer

Re: variable length ngramfilter highlights

2015-04-20 Thread Bjørn Hjelle
Dan, you could try do add luceneMatchVersion= "4.3" to your fieldType, like so: That worked for me with Solr versions prior to Solr 5. Bjørn On Thu, Apr 9, 2015 at 2:19 PM, Dan Sullivan wrote: > Hi, > > > I apologize if this question is redundant. I've spent a few days on it and

Solr 5: hit highlight with NGram/EdgeNgram-fields

2015-04-20 Thread Bjørn Hjelle
with Solr 4.10.3 I was advised to set luceneMatchVersion to "4.3" to make hit highlight work with NGram/EdgeNgram- fields, like this: In Solr 5 and 5.1 this seems to not work any more. The complete word is highlighted, not just the part that matches the search term. In Solr admin analysis pag

Re: Correspondance table ?

2015-04-20 Thread Bruno Mannina
Hi Alex, well ok but if I have a big table ? more than 10 000 entries ? is it safe to do that client side ? note: I have one little table but I have also 2 big tables for 2 other fields Le 20/04/2015 10:57, Alexandre Rafalovitch a écrit : The best place to do so is in the client software, sinc

Re: Unsubscribe from Mailing list

2015-04-20 Thread Ere Maijala
There's a wiki page about possible issues and solutions for unsubscribing, see https://wiki.apache.org/solr/Unsubscribing%20from%20mailing%20lists. Regards, Ere 20.4.2015, 12.23, Isha Garg kirjoitti: Hi , Can anyone tell me how to unsubscribe from Solr mailing lists. I tried sending email

Unsubscribe from Mailing list

2015-04-20 Thread Isha Garg
Hi , Can anyone tell me how to unsubscribe from Solr mailing lists. I tried sending email on 'solr-user-unsubscr...@lucene.apache.org', 'general-unsubscr...@lucene.apache.org'. But it is not working for me. Thanks & Regards, Isha Garg RAGE Frameworks/CreditPointe Services Pvt. LTD India Off: +

Re: Correspondance table ?

2015-04-20 Thread Alexandre Rafalovitch
The best place to do so is in the client software, since you are not using it for search in any way. So, wherever you get your Solr's response JSON/XML/etc, map it there. Regards, Alex. Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter: http://www.solr-start.com/ On 20 Apri

Correspondance table ?

2015-04-20 Thread Bruno Mannina
Dear Solr Users, Solr 5.0.0 I have actually around 90 000 000 docs in my solr, and I have a field with one char which represents a category. i.e: value = a, definition : nature and health etc... I have fews categories, around 15. These definition categories can changed during years. Can I use

Find out which MultiValued field got a hit (and a custom highlighter)

2015-04-20 Thread Rodolfo Zitellini
Dear List, I have been studying Solr to build up an index of musical incipit encoded as strings into bibliographical record to retrofit this kind of search into an existing database. Basically we store the incipit data (filtered through a custom TokenFilter) as a multi valued field (one for each di