Re: Solr PDF parsing failing with java error

2017-06-26 Thread Erick Erickson
Sure, someone changed the system variable "solr.install.dir" (i.e. -Dsolr.install.dir=some other place). Or removed it. Or changed the startup script. Or I've gotten very skeptical of "we didn't change anything but suddenly it stopped working". Usually it's something someone's changed unbeknow

Re: Boosting Documents using the field Value

2017-06-26 Thread govind nitk
Hi Erick, I accept, I should have mentioned the what I was doing first. field types: one_query is "string", one_score is float. So No explicit analyzers. mentioned sow=false. and escaping as you mentioned. But still the error persist. - undefined field "cloud" Will get back. Regards, Givind O

Re: Solr PDF parsing failing with java error

2017-06-26 Thread MatthewMeredith
Thanks so much for the reply, Erick! I haven't touched anything in several months; I got a message from the client I built the website for saying the PDF files they're putting into the folder weren't being indexed so I went in to investigate and discovered the error. Here's the applicable part of

Re: Solr PDF parsing failing with java error

2017-06-26 Thread Erick Erickson
Well, assuming you didn't, say, install a new Solr or some such it looks like somebody removed some of the jar files that Tika depends on, they're in the contrib area. Or changed the solrconfig.xml file to not contain the something like: BTW, for various reasons I prefer to do the heavy Ti

Re: Boosting Documents using the field Value

2017-06-26 Thread Erick Erickson
bq: So, ultimate goal is when the exact query matches in field one_query, apply boost of one_score It would have been helpful to have made that statement in the first place, would have saved some false paths. What is your analysis chain here? If it's anything like "text_general" or the like then

Re: admin/metrics API or read JMX by jolokia?

2017-06-26 Thread Tech Id
Yes, this is really good to know. With Jolokia, it is difficult to parse the output a bit because of the presence of special characters in the mbean name like ' http://localhost:17330/jolokia/read/solr!/my!-collection_shard1_replica2:*' The presence of slashes etc. makes parsing and querying a bi

Re: Boosting Documents using the field Value

2017-06-26 Thread govind nitk
Hi Developers, Erick I am able to add boost through function as below: bf=if(termfreq(one_query,"google"),one_score,0) Problem is when I say "google cloud" as query, it gives error: undefined field: \"cloud\"" I tried encoding the query(%20, + for space), but not able to get it working. So, ult

Re: Problem in documentation -- authentication JSON fails validation

2017-06-26 Thread Cassandra Targett
I have a commit locally that I will push shortly that fixes the JSON on that page for 7.0 (and 6.7 if/when it happens). I ran all the JSON examples through a linter and found a few additional problems that should be fixed now. On Sat, Jun 24, 2017 at 1:13 PM, Chris Ulicny wrote: > I haven't actua

Solr PDF parsing failing with java error

2017-06-26 Thread MatthewMeredith
I have a shell script set up to clear a solr core and re-index a folder of PDF files nightly like so: cd /opt/solr/ && bin/post -c comox_core -host 67.231.17.10 -d "attr_is_pdf:true" && bin/post -c comox_core -host 67.231.17.10 -filetypes pdf /home/townofco/public_html/modx/assets/pdfs -params "

Re: async backup

2017-06-26 Thread Damien Kamerman
A regular backup creates the files in this order: drwxr-xr-x 2 root root 63 Jun 27 09:46 snapshot.shard7 drwxr-xr-x 2 root root 159 Jun 27 09:46 snapshot.shard8 drwxr-xr-x 2 root root 135 Jun 27 09:46 snapshot.shard1 drwxr-xr-x 2 root root 17

Re: async backup

2017-06-26 Thread Damien Kamerman
Yes, the async command returns, and then I poll with REQUESTSTATUS. On 27 June 2017 at 01:24, Varun Thacker wrote: > Hi Damien, > > A backup command with async is supposed to return early. It is start the > backup process and return. > > Are you using the REQUESTSTATUS ( > http://lucene.apache.o

Re: SOLR Suggester returns either the full field value or single terms only

2017-06-26 Thread Angel Todorov
hi Alessandro, Thanks very much. I've read your article - which is really great by the way! , and configured my suggester in the following way: mySuggester FreeTextLookupFactory DocumentDictionaryFactory content_suggest true 3 text_

Re: Dynamic fields vs parent child

2017-06-26 Thread Saurabh Sethi
Number of dynamic fields will be in thousands (millions of users + thousands of events shared between subsets of users). We also thought about indexing in one field with value being fieldname_fieldvalue. Since we support range queries for dates and numbers, it won't work out of box. On Mon, Jun 2

Re: Dynamic fields vs parent child

2017-06-26 Thread Erick Erickson
How many distinct fields do you expect across _all_ documents? That is, if doc1 has 10 dynamic fields and doc2 has 10 dynamic fields, will there be exactly 10 fields total or more than 10 when you consider both documents? 100s of fields total across all documents is a tractable problem. thousands

Dynamic fields vs parent child

2017-06-26 Thread Saurabh Sethi
We have two requirements: 1. Indexing and storing event id and its timestamp. 2. Indexing and storing custom field name and value. The fields can be of any type, but for now lets say they are of types string, date and number. The events and custom fields for any solr document can easily be in hun

Re: async backup

2017-06-26 Thread Varun Thacker
Hi Damien, A backup command with async is supposed to return early. It is start the backup process and return. Are you using the REQUESTSTATUS ( http://lucene.apache.org/solr/guide/6_6/collections-api.html#collections-api ) API to validate if the backup is complete? On Sun, Jun 25, 2017 at 10:28

Re: SOLR Suggester returns either the full field value or single terms only

2017-06-26 Thread govind nitk
Hi Alessandro, Thanks for clarification. On Mon, Jun 26, 2017 at 4:53 PM, alessandro.benedetti wrote: > " Don't use an heavy Analyzers, the suggested terms will come from the > index, > so be sure they are meaningful tokens. A really basic analyser is > suggested, > stop words and stemming ar

Re: Using of Streaming to join between shards

2017-06-26 Thread Susheel Kumar
You may want to start with innerJoin which is the simple typical join in database world. On Mon, Jun 26, 2017 at 1:46 AM, mganeshs wrote: > Hi Erick, > > My scenario goes with two kind of SOLR documents > > Document #1 - Real document > #D_uniqueId #D_documentId(unique), #D_documentname, #D_docu

Re: SOLR Suggester returns either the full field value or single terms only

2017-06-26 Thread alessandro.benedetti
" Don't use an heavy Analyzers, the suggested terms will come from the index, so be sure they are meaningful tokens. A really basic analyser is suggested, stop words and stemming are not " This means that your suggestions will come from the index, so if you use heavy analysers you can get terms su

Re: SOLR Suggester returns either the full field value or single terms only

2017-06-26 Thread govind nitk
Hi alessandro, Really nice article. Can you brief us on "*Don't use an heavy Analyzers*" ? Regards, Govind On Mon, Jun 26, 2017 at 2:19 PM, alessandro.benedetti wrote: > Hi Angel, > your are looking for the Free Text lookup approach. > You find more info in [1] and [2] > > [1] > https://lucen

Re: SOLR Suggester returns either the full field value or single terms only

2017-06-26 Thread alessandro.benedetti
Hi Angel, your are looking for the Free Text lookup approach. You find more info in [1] and [2] [1] https://lucene.apache.org/solr/guide/6_6/suggester.html#Suggester-FreeTextLookupFactory [2] http://alexbenedetti.blogspot.co.uk/2015/07/solr-you-complete-me.html - --- Alessandro

strdist function gives error

2017-06-26 Thread govind nitk
Hi Team, solr 6.5.1 on ubuntu 14.04: strdist function gives error for comparison as below: strdist(myfield,"google cloud","jw") I am getting below error: "error": {"metadata": ["error-class","org.apache.solr.common.SolrException", "root-error-class","org.apache.solr.search.SyntaxError"],"msg