Re: OutOfMemoryError for PDF document upload into Solr

2015-01-15 Thread Siegfried Goeschl
Hi Ganesh, you can increase the heap size but parsing a 4 GB PDF document will very likely consume A LOT OF memory - I think you need to check if that large PDF can be parsed at all :-) Cheers, Siegfried Goeschl On 14.01.15 18:04, Michael Della Bitta wrote: Yep, you'll have to increase the

dynamic field value in ValueSource

2015-01-15 Thread Mathijs Corten
Hello, At this moment i'm writing my own SOLR plugin with a custom ValueSourceParser and ValueSource, it's goal is to read a few fields from the document (For now). I'm currently testing with the following fields: multiValued="false" /> stored="true" multiValued="false" /> The problem is,

Re: Core deletion

2015-01-15 Thread Dominique Bejean
Hi, Is there something in solr logs at startup that can explain the deletion ? How were created the cores ? using cores API ? Dominique http://www.eolya.fr 2015-01-14 17:43 GMT+01:00 : > > > Hello, > > I am running SOLR 4.10.0 on Tomcat 8. > > The solr.xml file in > .../apache-tomcat-8.0.15_s

groups inside groups

2015-01-15 Thread Dmitry Kan
Hi solr users & developers, Is it possible to group inside the group? First level is a group query, the second level is a single value field of each document inside the first level group with counts. I.e. the trick is, that the second level should contain facet counts on values of that single val

Easiest way to embed solr in a desktop application

2015-01-15 Thread Robert Krüger
Hi, I have been using an embedded instance of solr in my desktop application for a long time and it works fine. At the time when I made that decision (vs. firing up a solr web application within my swing application) I got the impression embedded use is somewhat unsupported and I should expect pro

Re: Easiest way to embed solr in a desktop application

2015-01-15 Thread Andrea Gazzarini
Hi Robert, I've used the EmbeddedSolrServer in a scenario like that and I never had problems. I assume you're talking about a standalone application, where the whole index resides locally and you don't need any cluster / cloud / distributed feature. I think the usage of EmbeddedSolrServer is

Re: Core deletion

2015-01-15 Thread phiroc
I duplicated an exist core, deleted the data directory and core.properties, updated solrconfig.xml and schema.xml and loaded the new core in SOLR's Admin Panel. The logs contain a few 'index locked' errors: solr.log:INFO - 2015-01-15 14:43:09.492; org.apache.solr.core.CorePropertiesLocator;

Re: Solr groups not matching with terms in a field

2015-01-15 Thread Ahmet Arslan
Hi Naresh, Everything looks correct, what is the problem here? If you want to see more than one document per group, there is a parameter for that which defaults to 1. Ahmet On Thursday, January 15, 2015 9:02 AM, Naresh Yadav wrote: Hi all, I had done following configuration to test Solr

Re: Easiest way to embed solr in a desktop application

2015-01-15 Thread Robert Krüger
Hi Andrea, you are assuming correctly. It is a local, non-distributed index that is only accessed by the containing desktop application. Do you know if there is a possibility to run the Solr admin UI on top of an embedded instance somehow? Thanks a lot, Robert On Thu, Jan 15, 2015 at 3:17 PM, A

Re: Easiest way to embed solr in a desktop application

2015-01-15 Thread Ahmet Arslan
Hi Robert, Never used by myself but is solr-packager useful in your case? http://sourcesense.github.io/solr-packager/ Ahmet On Thursday, January 15, 2015 4:45 PM, Robert Krüger wrote: Hi Andrea, you are assuming correctly. It is a local, non-distributed index that is only accessed by the c

Replicating external field files under Windows

2015-01-15 Thread Rafał Kuć
Hello! I have a slight problem with the replication and maybe someone have the same experience and know if there is a solution. I have a Windows based Solr installation where I use external field type - two fields, two external files containing data. The deployment is a standard master - slave. Th

Re: Easiest way to embed solr in a desktop application

2015-01-15 Thread Erik Hatcher
It’d certainly be easiest to just embed Jetty into your application. You don’t need to have Jetty as a separate process, you could launch it through it’s friendly Java API, configured to use solr.war. If all you needed was to make HTTP(-like) queries to Solr instead of the full admin UI, your

solr autosuggest to stop/filter suggesting the phrases that ends with stopwords

2015-01-15 Thread Rajesh Hazari
Hi Folks, Solr Version 4.7+ Do we have any analyzer or filter or any plugin in solr to stop suggesting the phrase that ends with stopwords? For ex: If the suggestion are as below for query http://localhost.com/solr/suggest?q=jazz+a "suggestion": [ "jazz and", "jazz at", "jazz at lincoln", "jazz

Re: Easiest way to embed solr in a desktop application

2015-01-15 Thread Robert Krüger
Hi Ahmet, at first glance, I'm not sure. Need to look at it more carefully. Thanks, Robert On Thu, Jan 15, 2015 at 3:53 PM, Ahmet Arslan wrote: > Hi Robert, > > Never used by myself but is solr-packager useful in your case? > > http://sourcesense.github.io/solr-packager/ > > Ahmet > > > On Th

Re: Easiest way to embed solr in a desktop application

2015-01-15 Thread Shawn Heisey
On 1/15/2015 7:44 AM, Robert Krüger wrote: > you are assuming correctly. It is a local, non-distributed index that > is only accessed by the containing desktop application. Do you know > if there is a possibility to run the Solr admin UI on top of an > embedded instance somehow? To have the admin

Re: Easiest way to embed solr in a desktop application

2015-01-15 Thread Robert Krüger
I was considering the programmatic Jetty option but then I read that Solr 5 no longer supports being run with an external servlet container but maybe they still support programmatic jetty use in some way. atm I am using solr 4.x, so this would work. No idea if this gets messy classloader-wise in an

DisMax search on field only if it exists otherwise fall-back to another

2015-01-15 Thread Neil Prosser
Hopefully this question makes sense. At the moment I'm using a DisMax query which looks something like the following (massively cut-down): ?defType=dismax &q=some query &qf=field_one^0.5 field_two^1.0 I've got some localisation work coming up where I'd like to use the value of one, sparsely popu

Re: Occasionally getting error in solr suggester component.

2015-01-15 Thread Dhanesh Radhakrishnan
Hi, >From Solr 4.7 onwards, the implementation of this Suggester is changed. The old SpellChecker based search component is replaced with a new suggester that utilizes Lucene suggester module. The latest Solr download is preconfigured with this new suggester I;m using Solr 4.10 and suggestion are b

Collection shard name

2015-01-15 Thread kuttan palliyalil
When  the shard names in a collection is same across the nodes, then posting to the collection the data gets posted to all the shards instead of distributing them . i.e. all the shards have the same data similar to replica. Is this expected ?   RegardsRajesh  

Re: Easiest way to embed solr in a desktop application

2015-01-15 Thread Alexandre Rafalovitch
On 15 January 2015 at 09:53, Ahmet Arslan wrote: > http://sourcesense.github.io/solr-packager/ Does this work with modern Solr? Seems to be 4-year-old project with no recent update. Even the parent company seems 'quiet'. But looks interesting in general. Regards, Alex. Sign up for my Sol

Re: Easiest way to embed solr in a desktop application

2015-01-15 Thread Shawn Heisey
On 1/15/2015 8:06 AM, Robert Krüger wrote: > I was considering the programmatic Jetty option but then I read that Solr 5 > no longer supports being run with an external servlet container but maybe > they still support programmatic jetty use in some way. atm I am using solr > 4.x, so this would work

Re: Occasionally getting error in solr suggester component.

2015-01-15 Thread Michael Sokolov
That sounds like a good approach to me. Of course it depends how often you commit, and what your tolerance is for delay in having suggestions appear, but it sounds as if you have a good understanding of the tradeoffs there. -Mike On 1/15/15 10:31 AM, Dhanesh Radhakrishnan wrote: Hi, From S

RE: OutOfMemoryError for PDF document upload into Solr

2015-01-15 Thread Ganesh.Yadav
Siegfried and Michael Thank you for your replies and help. -Original Message- From: Siegfried Goeschl [mailto:sgoes...@gmx.at] Sent: Thursday, January 15, 2015 3:45 AM To: solr-user@lucene.apache.org Subject: Re: OutOfMemoryError for PDF document upload into Solr Hi Ganesh, you can incr

Re: Collection shard name

2015-01-15 Thread Erick Erickson
How do you get all the shards named identically in the first place? Best Erick On Thu, Jan 15, 2015 at 8:49 AM, kuttan palliyalil wrote: > When the shard names in a collection is same across the nodes, then posting > to the collection the data gets posted to all the shards instead of > distri

Re: Collection shard name

2015-01-15 Thread Shawn Heisey
On 1/15/2015 12:24 PM, Erick Erickson wrote: > How do you get all the shards named identically in the first place? What I have always heard is that this is what Solr 4.0.0 did when creating a collection -- all the cores ended up with the same name as the collection. I have never used 4.0.0, so I

Re: Collection shard name

2015-01-15 Thread kuttan palliyalil
Erick & Shawn,  I am using Solr 4.10.2.  Here is the create command,  keeping the shard= same and changing the name=  http://''/solr/admin/cores?action=CREATE&name=shard1_1&collection=collectionName&shard=shard1&collection.configName=collconf'http://''/solr/admin/cores?action=CREATE&name=shard1_2&

Re: Conditions in function query

2015-01-15 Thread shamik
This one worked. if(termfreq(Source,'A'),sum(Likes,3),if(termfreq(Source,'B'),sum(Likes,3),0)) -- View this message in context: http://lucene.472066.n3.nabble.com/Conditions-in-Boost-function-query-tp4179687p4179906.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Collection shard name

2015-01-15 Thread Shawn Heisey
On 1/15/2015 12:59 PM, Shawn Heisey wrote: > On 1/15/2015 12:24 PM, Erick Erickson wrote: >> How do you get all the shards named identically in the first place? > What I have always heard is that this is what Solr 4.0.0 did when > creating a collection -- all the cores ended up with the same name a

Re: Collection shard name

2015-01-15 Thread Erick Erickson
By definition, all replicas in a shard should be identical. So Solr is doing exactly what I'd expect since you've created two nodes each belonging to shard1 because of this parameter: "shard=shard1" so updates will go to both exactly as they should. The name parameter will allow you to distinguish

Does DocValues improve Grouping performance ?

2015-01-15 Thread Shamik Bandopadhyay
Hi, Does use of DocValues provide any performance improvement for Grouping ? I' looked into the blog which mentions improving Grouping performance through DocValues. https://lucidworks.com/blog/fun-with-docvalues-in-solr-4-2/ Right now, Group by queries (which I can't sadly avoid) has become

SolrCloud - Enable SSL

2015-01-15 Thread Hrishikesh Gadre
Hi, If we need to enable SSL configuration for an existing Solr cluster (hosting one or more collections), do we need to manually update the clusterstate.json file? Or is there any API available which would serve the purpose? As per the Solr wiki, we need to set the urlScheme property to https h

Re: Solr groups not matching with terms in a field

2015-01-15 Thread Naresh Yadav
Hi ahmet, If you observe output ngroups is 1 and returning only one group P1. But my expectation is it should return three groups P1, L1, L2 as my field is tokenized with space. Please correct me if wrong? On 1/15/15, Ahmet Arslan wrote: > > > Hi Naresh, > > Everything looks correct, what is th

Re: OutOfMemoryError for PDF document upload into Solr

2015-01-15 Thread Dan Davis
Why re-write all the document conversion in Java ;) Tika is very slow. 5 GB PDF is very big. If you have a lot of PDF like that try pdftotext in HTML and UTF-8 output mode. The HTML mode captures some meta-data that would otherwise be lost. If you need to go faster still, you can also writ

How to select the correct number of Shards in SolrCloud

2015-01-15 Thread Manohar Sripada
Hi All, My Setup is as follows. There are 16 nodes in my SolrCloud and 4 CPU cores on each Solr Node VM. Each having 64 GB of RAM, out of which I have allocated 32 GB to Solr. I have a collection which contains around 100 million Docs, which I created with 64 shards, replication factor 2, and 8 sh

Snippets sorting in SOLR is not working correctly

2015-01-15 Thread Behzad Qureshi
Hi, I have posted a question on stack exchange related to highlighted snippets sorting. Multiple snippets returned against single document are not in sorted order. Thanks in advance. -- Regards, Be

Re: SolrCloud - Enable SSL

2015-01-15 Thread Hrishikesh Gadre
OK. I think I have figured this out. https://issues.apache.org/jira/browse/SOLR-5610 On Thu, Jan 15, 2015 at 6:00 PM, Hrishikesh Gadre wrote: > Hi, > > If we need to enable SSL configuration for an existing Solr cluster > (hosting one or more collections), do we need to manually update the > cl

Re: Solr groups not matching with terms in a field

2015-01-15 Thread Ahmet Arslan
Hi Naresh, I have never grouped on a tokenised field and I am not sure it makes sense to do so. Reading back ref-guide it says this about group.field parameter "The name of the field by which to group results. The field must be single-valued, and either be indexed or a field type that has a va