Indexing a Multivalued field using ContentStreamUpdateRequest in Solr

2015-02-19 Thread Ashish Vishwas Kaduskar
Hello, I use below code snippet to index data from a text file into solr. My text data is a tsv file with 3 fields - id,title and types. The field "types" is a multivalued field and these values are available as comma separated in the text file itself. Here is an example: 123 building house,s

Re: Getting unique key of a document inside of a Similarity class.

2015-02-19 Thread Chris Hostetter
: 1. name:DocumentOne^7 => doc1(score=7) : 2. name:DocumentOne^7 AND place:notExist^3 => doc1(score=7) : 3. place:(34\ High\ Street)^3 => doc1(score=3), doc2(score=3) : 4. name:DocumentOne^7 OR place:(34\ High\ Street)^3 => doc1(score=10), : doc2(score=3) ... : > it's not clear why you nee

Re: [ANNOUNCE] Apache Gora 0.6 Released

2015-02-19 Thread Talat Uyarer
Congras! On Feb 20, 2015 2:59 AM, "Lewis John Mcgibbney" wrote: > Hi Folks, > > The Apache Gora team are pleased to announce the immediate availability of > Apache Gora 0.6. > > This release addresses a modest 47 issues > with some being major improvements, new func

[ANNOUNCE] Apache Gora 0.6 Released

2015-02-19 Thread Lewis John Mcgibbney
Hi Folks, The Apache Gora team are pleased to announce the immediate availability of Apache Gora 0.6. This release addresses a modest 47 issues with some being major improvements, new functionality and dependency upgrades. Most notably the release involves key upgra

Re: Getting unique key of a document inside of a Similarity class.

2015-02-19 Thread J-Pro
how are you defining/specifying these field weights? I define weights inside of a query (name:SomeName^7). it would help if you could give a concrete example of some sample docs, a sample query, and what results you would expect ... the sample input and sample output of the system you are int

Re: Getting unique key of a document inside of a Similarity class.

2015-02-19 Thread Chris Hostetter
: Sure, sorry I did not do it before, I just wanted to take minimum of your : valuable time. So in my custom Similarity class I am trying to implement such : a logic, where score calculation is only based on field weight and a field : match - that's it. In other words, if a field matches the query

Re: Getting unique key of a document inside of a Similarity class.

2015-02-19 Thread J-Pro
Thank you for your answer, Chris. I will reply with inline comments as well. Please see below. : I need to uniquely identify a document inside of a Similarity class during : scoring. Is it possible to get value of unique key of a document at this : point? Can you tell us a bit more about your

Re: Getting unique key of a document inside of a Similarity class.

2015-02-19 Thread Chris Hostetter
: I need to uniquely identify a document inside of a Similarity class during : scoring. Is it possible to get value of unique key of a document at this : point? Can you tell us a bit more about your usecase ... your problem description is a bit vague, and sounds like it may be an "XY Problem"...

Re: Solr date retrieve back UTC

2015-02-19 Thread vsriram30
Thanks Chris for additional info. Thanks, Sriram -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-date-retrieve-back-UTC-tp4187449p4187503.html Sent from the Solr - User mailing list archive at Nabble.com.

Getting unique key of a document inside of a Similarity class.

2015-02-19 Thread J-Pro
Good afternoon. I need to uniquely identify a document inside of a Similarity class during scoring. Is it possible to get value of unique key of a document at this point? For some time I though I can use internal docID for achieving that. Method score(int doc, float freq) is called after eve

Re: Solr date retrieve back UTC

2015-02-19 Thread Chris Hostetter
: to get the UTC back which I thought might not be required as already the : cDate field in that Date class is having the UTC date. general suggestion: your life will be a lot easier if you stop looking at the implementation details of JVM classes -- just because your current JVM implements the

Re: Solr date retrieve back UTC

2015-02-19 Thread vsriram30
Thanks Chris for your quick reply. As you said, I need to do some conversion to get the UTC back which I thought might not be required as already the cDate field in that Date class is having the UTC date. The toString() doesn't actually give me timestamp in UTC format. It gives, Mon Sep 15 12:52:

Re: Solr date retrieve back UTC

2015-02-19 Thread Chris Hostetter
: But when I use SolrJ and get it as object, I am seeing that the UTC date is : of type Date and I am not able to retrieve back the UTC date from it and I : get only long timestamp from that object. : : I also see a private variable in that Date class called as cDate which has : what I want (Date

Solr date retrieve back UTC

2015-02-19 Thread vsriram30
Hi, I am having a date field in my solr schema and I am indexing a proper UTC date to that field. If I am directly querying Solr, I am able to see the field with UTC time in that in the JSON response. But when I use SolrJ and get it as object, I am seeing that the UTC date is of type Date and I a

Re: Solr Lazy startup - load-on-startup missing from web.xml?

2015-02-19 Thread Chris Hostetter
: Hi! Solr is starting up "dormant" for me, until a client wake it up with a : REST request, or I open admin UI, only then the remaining initializing : happens. : Is it something known? based on my recollection of the servlet spec, that sounds like a bug/glitch/config option in your Servlet conta

Re: Collections API - HTTP verbs

2015-02-19 Thread Hrishikesh Gadre
Thanks Mark and Scott. Adding quotes around the URL fixed the problem. Regards Hrishikesh On Thu, Feb 19, 2015 at 7:30 AM, Scott Dawson wrote: > Hrishikesh, > If you're running on Linux or Unix, the first ampersand in the URL is > interpreted as the shell's "run this in the background" operator

Re: what order does solr return the results in if the search is *:*

2015-02-19 Thread Erik Hatcher
It’ll return them in order of them being indexed, generally. If documents are being updated (delete/re-add, effectively) then order would change but still by default ordered as they are in the underlying Lucene index. — Erik Hatcher, Senior Solutions Architect http://www.lucidworks.com

Re: what order does solr return the results in if the search is *:*

2015-02-19 Thread Erick Erickson
I'm pretty sure by internal document id, which changes upon segment merge. If you depend on this, you need to include a field at index time that'll be unchanging and then sort on that. bq: Documents seem to be consistently returned in this order This won't be invariant as the index changes and se

Re: is there a constant for _vesion_-fieldname?

2015-02-19 Thread Erick Erickson
Grepping shows VersionInfo.VERSION_FIELD Best, Erick On Thu, Feb 19, 2015 at 1:45 AM, Clemens Wyss DEV wrote: > Does Solr provider a (Java)constant for "the name of the version field" (ie > _version_)?

what order does solr return the results in if the search is *:*

2015-02-19 Thread Tang, Rebecca
If user searches for *:*, what order does solr return the results in? I expected the results to be returned in index order. (I indexed the documents in the order of the numeric document id from 0 -> ~15,000,000). So when I searched with *:*, I expected the first 10 documents returned to have

Re: Collections API - HTTP verbs

2015-02-19 Thread Scott Dawson
Hrishikesh, If you're running on Linux or Unix, the first ampersand in the URL is interpreted as the shell's "run this in the background" operator and anything beyond the ampersand will not be passed to curl. So Mark is right -- put single quotes around the URL so that it's not interpreted by the s

Re: Committed before 500

2015-02-19 Thread Shawn Heisey
On 2/19/2015 6:30 AM, NareshJakher wrote: > I am using Solr cloud with 3 nodes, at times following error is observed in > logs during delete operation. Is it a performance issue ? What can be done > to resolve this issue > > "Committed before 500 {msg=Software caused connection abort: socket write

Re: Divide 4 Nodes into 100 nodes in Solr Cloud

2015-02-19 Thread Nitin Solanki
Okay, thanks Shawn.. On Thu, Feb 19, 2015 at 7:59 PM, Shawn Heisey wrote: > On 2/19/2015 4:18 AM, Nitin Solanki wrote: > >Sorry, I think, you both are taking about > > shard splitting but I want node splitting. I have 4 nodes. Each node has > 2 > > shards, So, Now

Re: Divide 4 Nodes into 100 nodes in Solr Cloud

2015-02-19 Thread Shawn Heisey
On 2/19/2015 4:18 AM, Nitin Solanki wrote: >Sorry, I think, you both are taking about > shard splitting but I want node splitting. I have 4 nodes. Each node has 2 > shards, So, Now, I want 100 Nodes from that 4 nodes and each having 2 > shards. Any Ideas? Node split

Committed before 500

2015-02-19 Thread NareshJakher
I am using Solr cloud with 3 nodes, at times following error is observed in logs during delete operation. Is it a performance issue ? What can be done to resolve this issue "Committed before 500 {msg=Software caused connection abort: socket write error,trace=org.eclipse.jetty.io.EofException" I d

Re: Question on CloudSolrServer API

2015-02-19 Thread Shalin Shekhar Mangar
No, you should reuse the same CloudSolrServer instance for all requests. It is a thread safe object. You could also create a static/common HttpClient instance and pass it to the constructor of CloudSolrServer but even if you don't, it will create one internally and use it for all requests so that c

Auto-correct the phrase/query

2015-02-19 Thread Nitin Solanki
Hello, I want to do same like google phrase/spell correction. If anyone type a query "the dark night" then I need a suggestion like "the dark knight" in Solr. Is there anyway to do this?

Re: Divide 4 Nodes into 100 nodes in Solr Cloud

2015-02-19 Thread Nitin Solanki
Hi Yago & Shawn, Sorry, I think, you both are taking about shard splitting but I want node splitting. I have 4 nodes. Each node has 2 shards, So, Now, I want 100 Nodes from that 4 nodes and each having 2 shards. Any Ideas? On Wed, Feb 18, 2015 at 9:25 PM, Shawn Heis

Re: How to place whole indexed data on cache

2015-02-19 Thread Nitin Solanki
Thanks Dominique. Got your view.. On Wed, Feb 18, 2015 at 11:55 PM, Dominique Bejean < dominique.bej...@eolya.fr> wrote: > Hi, > > As Shawn said, install enough memory in order that all free direct memory > (non heap memory) be used as disk cache. > Use 40% maximum of the available memory for hea

Re: spellcheck.count v/s spellcheck.alternativeTermCount

2015-02-19 Thread Nitin Solanki
I have 48GB of indexed data. I have set spellcheck.count=1 & spellcheck.alternativeTermCount=10 but I am getting only 1 suggestions in suggestion block but Suggestions for collations are coming. *PFA*. for details On Thu, Feb 19, 2015 at 1:50 AM, Dyer, James wrote: > It will try to give you sug

is there a constant for _vesion_-fieldname?

2015-02-19 Thread Clemens Wyss DEV
Does Solr provider a (Java)constant for "the name of the version field" (ie _version_)?

Re: Discrepancy between Full import and Delta import query

2015-02-19 Thread Aniket Bhoi
On Tue, Feb 17, 2015 at 8:21 PM, Aniket Bhoi wrote: > Hi Folks, > > I am running Solr 3.4 and using DIH for importing data from a SQL server > backend. > > The query for Full import and Delta import is the same ie both pull the > same data. > > Full and Delta import query: > > SELECT KB_ENTRY.ADD

Solr Lazy startup - load-on-startup missing from web.xml?

2015-02-19 Thread Gili Nachum
Hi! Solr is starting up "dormant" for me, until a client wake it up with a REST request, or I open admin UI, only then the remaining initializing happens. Is it something known? I can't see any load-on-startup in the web.xml, in Solr.war. Running Solr 4.7.2 over WebSphere 8.5 App loading message

Question on CloudSolrServer API

2015-02-19 Thread Manohar Sripada
Hi All, I am using CloudSolrServer API of SolrJ library from my application to query Solr. Here, I am creating a new connection to Solr for every search that I am doing. Once I got the results I am closing the connection. Is this the correct way? How does Solr create connections internally? Does