update document stuck on: java.net.SocketInputStream.socketRead0

2017-10-26 Thread Nawab Zada Asad Iqbal
Hi, After Solr 7 upgrade, I am realizing that my '/update' request is sometimes getting stuck on this:- - java.net.SocketInputStream.socketRead0(java.io.FileDescriptor, byte[], int, int, int) @bci=0 (Compiled frame; information may be imprecise) - java.net.SocketInputStream.read(byte[], int, in

Re: Need help detecting Relatedness in documents

2017-10-26 Thread Atita Arora
Thanks for the suggestion Anshum , appreciate your response..! I tried using MLT with the field that stores the similarity index of topics this could be related to. But this wasn't really accepted as the solution, as this could not resolve my next stage of the problem where I need to get the effec

Re: Edismax - bq taking precedence over pf

2017-10-26 Thread Chris Hostetter
: ok. Shouldn't pf be applied on top of bq=? that way among the object_types : boosted, if one has "Manufacturing" then it should be listed first? No. bq is an *additive* boost ... documents must match your "main query" to be included, but if document X scores very high against the bq query, an

Re: Need help detecting Relatedness in documents

2017-10-26 Thread Anshum Gupta
I would suggest you look at the mlt query parser. That allows you to find documents similar to a particular documents, and also allows for specifying the field to use for similarity purposes. https://lucene.apache.org/solr/guide/7_0/other-parsers.html#more-like-this-query-parser

Re: Edismax - bq taking precedence over pf

2017-10-26 Thread Josh Lincoln
I was asking about the field definitions from the schema. It would also be helpful to see the debug info from the query. Just add debug=true to see how the query and params were executed by solr and how the calculation was done for each result. On Thu, Oct 26, 2017 at 1:33 PM ruby wrote: > ok.

Re: Edismax - bq taking precedence over pf

2017-10-26 Thread ruby
ok. Shouldn't pf be applied on top of bq=? that way among the object_types boosted, if one has "Manufacturing" then it should be listed first? following are my objects: 1 Configuration typeA Manufacturing <--catch all field where contents of all fields get copied to 2 Manufacturing typeA xy

Failed to create collection SOLR 6.3 HDP 2.6.2

2017-10-26 Thread Dan Caulfield
I'm creating a collection on a new cluster. There are six new Solr nodes using a HDP 2.6.2 cluster for storage. Has anyone seen similar errors? /usr/iopsolr/current/iop-solr/server/scripts/cloud-scripts/zkcli.sh -cmd upconfig -zkhost d2mitphmn1001.edc.nam.gm.com:2181,d2mitphmn1003.edc.nam.gm

Re: TimeoutException, IOException, Read timed out

2017-10-26 Thread Fengtan
Thanks Erick and Emir -- we are going to start with <1> and possibly <2>. On Thu, Oct 26, 2017 at 7:06 AM, Emir Arnautović < emir.arnauto...@sematext.com> wrote: > Hi Fengtan, > I would just add that when merging collections, you might want to use > document routing (https://lucene.apache.org/sol

Re: Edismax - bq taking precedence over pf

2017-10-26 Thread Josh Lincoln
What's the analysis configuration for the object_name field and fieldType? Perhaps the query is matching your catch-all field, but not the object_name field, and therefore the pf boost never happens. On Thu, Oct 26, 2017 at 8:55 AM ruby wrote: > I'm noticing in my following query bq= is takin

Edismax - bq taking precedence over pf

2017-10-26 Thread ruby
I'm noticing in my following query bq= is taking precedence over pf. &q=Manufacturing &qf=Catch_all_Copy_field &pf=object_id^40+object_name^700 &bq=object_rating:(best)^10 &bq=object_rating:(candidate)^8 &bq=object_rating:(placeholder)^5 &bq=object_type_:(typeA)^10 &bq=object_type_:(typeB)^10 &bq=

CVE-2016-6809: Java code execution for serialized objects embedded in MATLAB files parsed by Apache Solr using Apache Tika

2017-10-26 Thread Shalin Shekhar Mangar
CVE-2016-6809: Java code execution for serialized objects embedded in MATLAB files parsed by Apache Solr using Tika Severity: Important Vendor: The Apache Software Foundation Versions Affected: Solr 5.0.0 to 5.5.4 Solr 6.0.0 to 6.6.1 Solr 7.0.0 to 7.0.1 Description: Apache Solr uses Apache Tik

Re: TimeoutException, IOException, Read timed out

2017-10-26 Thread Emir Arnautović
Hi Fengtan, I would just add that when merging collections, you might want to use document routing (https://lucene.apache.org/solr/guide/6_6/shards-and-indexing-data-in-solrcloud.html#ShardsandIndexingDatainSolrCloud-DocumentRouting

Solr require both hl.fl and df same for correct highlighting.

2017-10-26 Thread Amrit Sarkar
Solr version: 6.5.x Why do we need to pass hl.fl and df to be same for correct highlighting? Let us suppose I am highlighting on field: fieldA which has stemming filter on its analysis. Sample doc: {"id":"1", "fieldA":"Vacation"} If I then highlighting request: > "params":{ > "q":"Vacatio

Need help detecting Relatedness in documents

2017-10-26 Thread Atita Arora
Hi , We're working with a productr where the idea is to present the users the related documents in particular timeseries. For an overview think about this as an application which picks up top trending blogposts "topics" which are picked and ingested from various social sites. Further , when you l