Re: SOLRJ and SOLR compatibility

2014-03-03 Thread Thomas Scheffler
Am 04.03.2014 07:21, schrieb Thomas Scheffler: Am 27.02.2014 09:15, schrieb Shawn Heisey: On 2/27/2014 12:49 AM, Thomas Scheffler wrote: What problems have you seen with mixing 4.6.0 and 4.6.1? It's possible that I'm completely ignorant here, but I have not heard of any. Actually bug reports

Re: SOLRJ and SOLR compatibility

2014-03-03 Thread Thomas Scheffler
Am 27.02.2014 09:15, schrieb Shawn Heisey: On 2/27/2014 12:49 AM, Thomas Scheffler wrote: What problems have you seen with mixing 4.6.0 and 4.6.1? It's possible that I'm completely ignorant here, but I have not heard of any. Actually bug reports arrive me that sound like "Unknown type 19"

Re: range types in SOLR

2014-03-03 Thread Thomas Scheffler
Am 03.03.2014 19:12, schrieb Smiley, David W.: The main reference for this approach is here: http://wiki.apache.org/solr/SpatialForTimeDurations Hoss’s illustrations he developed for the meetup presentation are great. However, there are bugs in the instruction — specifically it’s important to s

RE: Please add me to wiki contributors

2014-03-03 Thread Susheel Kumar
My user name is SusheelKumar for solr wiki. -Original Message- From: Susheel Kumar [mailto:susheel.ku...@thedigitalgroup.net] Sent: Monday, March 03, 2014 9:36 PM To: solr-user@lucene.apache.org Subject: Please add me to wiki contributors Hi, Can you please add me to wiki contributors.

Re: java.lang.Exception: Conflict with StreamingUpdateSolrServer

2014-03-03 Thread Gopal Patwa
Thanks Chirs, I found in our application code it was related to optimistic concurrency failure. On Mon, Mar 3, 2014 at 6:13 PM, Chris Hostetter wrote: > > : Subject: java.lang.Exception: Conflict with StreamingUpdateSolrServer > > the fact that you are using StreamingUpdateSolrServer isn't real

Automate search results filtering based on scoring

2014-03-03 Thread Susheel Kumar
Hi, We are looking to automate searches (name searches) & filter out the results based on some scoring confidence. Any suggestions on what different approaches we can use to pick only top closer matches and filter out rest of the results. Thanks, Susheel

Please add me to wiki contributors

2014-03-03 Thread Susheel Kumar
Hi, Can you please add me to wiki contributors. I wanted to add some stats on Linux vs Windows we came across recently, CSV update handler examples, and also wanted to add company name to public server page. Thanks, Susheel

Re: Searching with special chars

2014-03-03 Thread deniz
So as there was no quick work around to this issue, we simply change the http method from get to post, to avoid further problems which could be triggered by user input too. though this violates the restful standards... at least we have something running properly - Zeki ama calismiyor... Cali

Re: java.lang.Exception: Conflict with StreamingUpdateSolrServer

2014-03-03 Thread Chris Hostetter
: Subject: java.lang.Exception: Conflict with StreamingUpdateSolrServer the fact that you are using StreamingUpdateSolrServer isn't really a factor here -- what matters is the data you are sending to solr in the updates... : location=StreamingUpdateSolrServer line=162 Status for: null is 409

Re: solrconfig.xml

2014-03-03 Thread Chris Hostetter
: I have a core "geo" (with an empty file core.properties inside) and solrconfig.xml at : /srv/solr/solr4.6.1/cores/geo/conf/solrconfig.xml ... : But I get the error message: : "Could not load config file /srv/solr/solr4.6.1/cores/geo/solrconfig.xml" 1) what does your solr.xml file look

Re: is it possible to consolidate filterquery cache strings

2014-03-03 Thread solr-user
would not breaking the FQs out by state be faster for warming up the fq caches? -- View this message in context: http://lucene.472066.n3.nabble.com/is-it-possible-to-consolidate-filterquery-cache-strings-tp4121005p4121030.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: solrconfig.xml

2014-03-03 Thread Alexandre Rafalovitch
File permissions? Malformed XML? Are there any other exceptions earlier in the log? If you substitute that file with one from example distribution, does it work? Regards, Alex. Personal website: http://www.outerthoughts.com/ LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch - Time is th

Re: Boost query syntax error

2014-03-03 Thread Arun Rangarajan
All of them work like a charm! Thanks, Chris. On Mon, Mar 3, 2014 at 1:28 PM, Chris Hostetter wrote: > > : But this query does not work: > : > : q={!boost > : b=if(exists(query({!v='user_type:ADMIN'})),10,1)}id:1&rows=1&fl=*,score > : It gives an error like this: > > The problem is the way you a

Re: is it possible to consolidate filterquery cache strings

2014-03-03 Thread Chris Hostetter
: Would that work, and if so how would I know that the cache is being hit? It should work -- filters are evaluated independently, so the fact that you are using all of them in query query (vs all of them in individual queries) won't change anything as far as the filterCache goes. You can prov

solrconfig.xml

2014-03-03 Thread Thomas Fischer
Hello, I'm sorry to repeat myself but I didn't manage to get out of the thread I inadvertently slipped into. My problem now is this: I have a core "geo" (with an empty file core.properties inside) and solrconfig.xml at /srv/solr/solr4.6.1/cores/geo/conf/solrconfig.xml following the hint from th

Re: SolrCloud Startup

2014-03-03 Thread KNitin
Thanks, Shawn. Right now my solr.solr.home is not being passed from the java runtime Lets say /mnt/solr/ is my solr root. I can add all jars to /mnt/solr/lib/ and use -Dsolr.solr.home=/mnt/solr/ , that should do it right? Thanks Nitin On Mon, Mar 3, 2014 at 2:44 PM, Shawn Heisey wrote: > On

Re: is it possible to consolidate filterquery cache strings

2014-03-03 Thread solr-user
note: by partitioning I mean that I have sharded the 120M docs into 9 Solr partitions (each on a separate server) -- View this message in context: http://lucene.472066.n3.nabble.com/is-it-possible-to-consolidate-filterquery-cache-strings-tp4121005p4121012.html Sent from the Solr - User mailing

Re: Configuration problem

2014-03-03 Thread Thomas Fischer
Am 03.03.2014 um 22:43 schrieb Shawn Heisey: > On 3/3/2014 9:02 AM, Thomas Fischer wrote: >> The setting is >> solr directories (I use different solr versions at the same time): >> /srv/solr/solr4.6.1 is the solr home, in solr home is a file solr.xml of the >> new "discovery type" (no cores), and

Re: SolrCloud Startup

2014-03-03 Thread Shawn Heisey
On 3/3/2014 3:30 PM, KNitin wrote: A quick ping on this. To give more stats, I have 100's of collections on every node. The time it takes for one collection to boot up /loadonStartup is around 10-20 seconds ("and sometimes even 1 minute). I do not have any query auto warming etc. On a per collect

Re: SolrCloud Startup

2014-03-03 Thread KNitin
A quick ping on this. To give more stats, I have 100's of collections on every node. The time it takes for one collection to boot up /loadonStartup is around 10-20 seconds ("and sometimes even 1 minute). I do not have any query auto warming etc. On a per collection basis I load a bunch of libraries

is it possible to consolidate filterquery cache strings

2014-03-03 Thread solr-user
lets say I have a largish set of data (120M docs) and that I am partitioning my data by groups of states (using the state codes) Someone suggested that I could use the following format in my solrconfig.xml when defining the filterqueries work: *:* State:AL

Re: Configuration problem

2014-03-03 Thread Shawn Heisey
On 3/3/2014 9:02 AM, Thomas Fischer wrote: The setting is solr directories (I use different solr versions at the same time): /srv/solr/solr4.6.1 is the solr home, in solr home is a file solr.xml of the new "discovery type" (no cores), and inside the core directories are empty files core.propert

Re[2]: query parameters

2014-03-03 Thread Andreas Owen
ok i like the logic, you can do much more. i think this should do it for me:          (-organisations:["" TO *] -roles:["" TO *]) (+organisations:(150 42) +roles:(174 72)) i want to use this in fq and i need to set the operator to OR. My q.op is AND but I need OR in fq. I have read about ofq b

Re: Boost query syntax error

2014-03-03 Thread Chris Hostetter
: But this query does not work: : : q={!boost : b=if(exists(query({!v='user_type:ADMIN'})),10,1)}id:1&rows=1&fl=*,score : It gives an error like this: The problem is the way you are trying to nest queries inside of each other w/o any sort of quoting -- the parser has no indication that the "b"

Re: network slows when solr is running - help

2014-03-03 Thread Lan
How frequently are you committing? Frequent commits can slow everything down. -- View this message in context: http://lucene.472066.n3.nabble.com/network-slows-when-solr-is-running-help-tp4120523p4120992.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solution for reverse order of year facets?

2014-03-03 Thread Ahmet Arslan
Hi Michael, I forgot to include what I did for one customer : 1) Using StatsComponent I get min and max values of the field (year) 2) Calculate "smart gap/range values" according to minimum and maximum. 3) Re-issue the same query (for thee second time) that includes a set of facet.query. Ahmet

Re: Solr Heap, MMaps and Garbage Collection

2014-03-03 Thread Tri Cao
If you just want to see which classes are occupying the most memory in a live JVM,you can do:jmap -permstat I don't think you can dump the contents of PERM space.Hope this helps,TriOn Mar 03, 2014, at 11:41 AM, KNitin wrote:Is there a way to dump the contents of permgen and look at which classes a

Re: Solution for reverse order of year facets?

2014-03-03 Thread Ahmet Arslan
Hi, Regarding "just a simple subtraction" you do it in indexer code or in a update prcessor too. You can either modify original field or you can create an additional one. Java-script could be used :  http://wiki.apache.org/solr/ScriptUpdateProcessor Ahmet On Monday, March 3, 2014 9:11 PM, Mic

What types is supported by Solrj addBean() in the fields of POJO objects?

2014-03-03 Thread T. Kuro Kurosaka
What are supported types of the POJO objects that are sent to SolrServer.addBean(obj)? A quick glance of DocumentObjectBinder seems to suggest that an arbitrary combination of an Collection, List, ArrayList, array ([]), Map, Hashmap, of primitive types, String and Date is supported, but I'm not

Wildcard searches and tokenization

2014-03-03 Thread Hayden Muhl
I'm working on a user name autocomplete feature, and am having some issues with the way we are tokenizing user names. We're using the StandardTokenizerFactory to tokenize user names, so "foo-bar" gets split into two tokens. We take input from the user and use it as a prefix to search on the user n

SOLR and Kerberos enabled HDFS

2014-03-03 Thread Jimmy
Hello, I am trying to connect SOLR (tried 4.4 and 4.7) to kerberos enabled HDFS - I am using Cloudera CDH 4.2.1 http://maven-repository.com/artifact/com.cloudera.cdh/cdh-root/4.2.1/pom_effective the keytab and principal is valid (I tested it with flume as well as simple hdfs cli) did anobody su

Re: Solr Heap, MMaps and Garbage Collection

2014-03-03 Thread KNitin
Is there a way to dump the contents of permgen and look at which classes are occupying the most memory in that? - Nitin On Mon, Mar 3, 2014 at 11:19 AM, KNitin wrote: > Regarding PermGen: Yes we have a bunch of custom jars loaded in solrcloud > (containing custom parsing, analyzers). But I hav

Re: Solution for reverse order of year facets?

2014-03-03 Thread Michael Lackhoff
On 03.03.2014 19:58 Shawn Heisey wrote: > There's already an issue in Jira. > > https://issues.apache.org/jira/browse/SOLR-1672 Thanks, this is of course the best solution. Only problem is that I use a custom verson from a vendor (based on version 4.3) I want to enhance. But perhaps they apply t

Re: Solr Heap, MMaps and Garbage Collection

2014-03-03 Thread KNitin
Regarding PermGen: Yes we have a bunch of custom jars loaded in solrcloud (containing custom parsing, analyzers). But I haven't specifically enabled any string interning. Does solr intern all strings in a collection by default? I agree with doc and Filter Query Cache. Query Result cache hits are p

Re: Solution for reverse order of year facets?

2014-03-03 Thread Michael Lackhoff
Hi Ahmet, > There is no built in solution for this. Yes, I know, that's why I would like the TokenFilterFactory > Two workaround : > > 1) use facet.limit=-1 and invert the list (faceting response) at client side > > 2) use multiples facet.query >a)facet.query=year:[2012 TO 2014]&facet.quer

Re: Solution for reverse order of year facets?

2014-03-03 Thread Shawn Heisey
On 3/3/2014 7:35 AM, Michael Lackhoff wrote: If I understand the docs right, it is only possible to sort facets by count or value in ascending order. Both variants are not very helpful for year facets if I want the most recent years at the top (or appear at all if I restrict the number of facet e

Re: Solr Permgen Exceptions when creating/removing cores

2014-03-03 Thread Josh
Thanks Tri, I really appreciate the response. When I get some free time shortly I'll start giving some of these a try and report back. On Mon, Mar 3, 2014 at 12:42 PM, Tri Cao wrote: > If it's really the interned strings, you could try upgrade JDK, as the > newer HotSpot > JVM puts interned st

Re: Solution for reverse order of year facets?

2014-03-03 Thread Ahmet Arslan
Hi Michael, Yes you are correct, oldest comes fist.  There is no built in solution for this. Two workaround : 1) use facet.limit=-1 and invert the list (faceting response) at client side 2) use multiples facet.query    a)facet.query=year:[2012 TO 2014]&facet.query=year:[2010 TO 2012]     b)fac

Re: range types in SOLR

2014-03-03 Thread Smiley, David W.
The main reference for this approach is here: http://wiki.apache.org/solr/SpatialForTimeDurations Hoss’s illustrations he developed for the meetup presentation are great. However, there are bugs in the instruction — specifically it’s important to slightly buffer the query and choose an appropriat

Re: Facets, termvectors, relevancy and Multi word tokenizing

2014-03-03 Thread epnRui
Hi guys, I'm on my way to solve it properly. This is how my field looks like now: I still have one case where I'm facing issues because in fact I want to pres

Re: Elevation and core create

2014-03-03 Thread David Stuart
HI Erick, Thanks for the response. On the wiki it states config-file Path to the file that defines query elevation. This file must exist in $/conf/ or$/. If the file exists in the /conf/ directory it will be loaded once at startup. If it exists in the data directory, it will be reloaded for

RegexTransformer and xpath in DataImportHandler

2014-03-03 Thread eShard
Good afternoon, I have this DIH: https://redacted/"; processor="XPathEntityProcessor" forEach="/rss/channel/item" transformer="DateFormatTransformer,TemplateTransformer,RegexTransformer">

RE: How to best handle search like Dave & David

2014-03-03 Thread Susheel Kumar
Thanks, Arun for sharing the idea on EdgeNGramFilter. In our case we are doing search using automated process so may EdgeNGramFilter may not work. Wwe have used NGramFilterFactory in the past but will look into it again. For cases like Dave & David and other English names does anyone has idea

Re: Solr Permgen Exceptions when creating/removing cores

2014-03-03 Thread Tri Cao
If it's really the interned strings, you could try upgrade JDK, as the newer HotSpotJVM puts interned strings in regular heap:http://www.oracle.com/technetwork/java/javase/jdk7-relnotes-418459.html(search for String.intern() in that release)I haven't got a chance to look into the new core auto disc

RE: Solr 4.5.0 replication numDocs larger in slave

2014-03-03 Thread Geary, Frank
Thanks Greg. We optimize the master once a week (early in the day Sunday) and we do not do a commit Sunday evening (the only evening of the week when we do not commit). So now after optimization/replication the master/slave pair that were out on sync on Friday now have the same numDocs (and ev

Re: Solr Permgen Exceptions when creating/removing cores

2014-03-03 Thread Josh
In the user core there are two fields, the database core in question was 40, but in production environments the database core is dynamic. My time has been pretty crazy trying to get this out the door and we haven't tried a standard solr install yet but it's on my plate for the test app and I don't

Re: Multiple partial match

2014-03-03 Thread Zwer
AFAICS tf(name, 'co') returns 0 on the {id:1, name:'Coca-Cola Company'} because it does not support partial match. tf(name, 'company') will return 1 -- View this message in context: http://lucene.472066.n3.nabble.com/Multiple-partial-match-tp4120886p4120919.html Sent from the Solr - User maili

Re: Fetching uniqueKey and other int quickly from documentCache?

2014-03-03 Thread Yonik Seeley
On Mon, Mar 3, 2014 at 11:14 AM, Gregg Donovan wrote: > Yonik, > > That's a very clever idea. Unfortunately, I think that will skip the > distributed query optimization we were hoping to take advantage of in > SOLR-1880 [1], but it should work with the proposed distrib.singlePass > optimization in

Re: Multiple partial match

2014-03-03 Thread Jack Krupansky
Add a function query boost that uses the term frequency, "tf": bf=tf(name,'co') -- additive boost boost=tf(name,'co') -- multiplicative boost That does of course require that term frequency is not disabled for that field in the schema. You can multiply the term frequency as well in the fun

Re: Solr Permgen Exceptions when creating/removing cores

2014-03-03 Thread Tri Cao
Hey Josh,I am not an expert in Java performance, but I would start with  dumping a the heapand investigate with visualvm (the free tool that comes with JDK).In my experience, the most common cause for PermGen exception is the app createstoo many interned strings. Solr (actually Lucene) interns the

Solr Filter Cache Size

2014-03-03 Thread Benjamin Wiens
How can we calculate how much heap memory the filter cache will consume? We understand that in order to determine a good size we also need to evaluate how many filterqueries would be used over a certain time period. Here's our setting: According to the post below, 53 GB of RAM would b

Re: Fetching uniqueKey and other int quickly from documentCache?

2014-03-03 Thread Gregg Donovan
Yonik, That's a very clever idea. Unfortunately, I think that will skip the distributed query optimization we were hoping to take advantage of in SOLR-1880 [1], but it should work with the proposed distrib.singlePass optimization in SOLR-5768 [2]. Does that sound right? --Gregg [1] https://issue

Re: Solr is NoSQL database or not?

2014-03-03 Thread Jack Krupansky
For the record, I am +1 for somebody to add Solr to the NoSQL wikipedia page, in much the same way that Elasticsearch is already there. From a LucidWorks webinar blurb: "The long awaited Solr 4 release brings a large amount of new functionality that blurs the line between search engines and No

Configuration problem

2014-03-03 Thread Thomas Fischer
Hello, for some reason I have problems to get my local solr system to run (MacBook, tomcat 6.0.35). The setting is solr directories (I use different solr versions at the same time): /srv/solr/solr4.6.1 is the solr home, in solr home is a file solr.xml of the new "discovery type" (no cores), and

Re: SolrCloud: heartbeat succeeding while node has failing SSD?

2014-03-03 Thread Gregg Donovan
Thanks, Mark! The supervised process sounds very promising but complicated to get right. E.g. where does the supervisor run, where do nodes report their status to, are the checks active or passive, etc. Having each node perform a regular background self-check and remove itself from the cluster if

RE: Solr 4.5.0 replication numDocs larger in slave

2014-03-03 Thread Geary, Frank
Thanks Erick. Indexing is not happening to the slave since it has never been set up there - there aren't even any commits happening on the slave (which we normally do via cron job). But Indexing is definitely happening to the master at the time replication happens. " Sounds like the master

Re: Solr is NoSQL database or not?

2014-03-03 Thread Furkan KAMACI
Hi; I said that: "What are the main differences between ElasticSearch and Solr that makes ElasticSearc a NoSQL store but not Solr." because it is just a marketing term as Jack indicated after me. Also I said: "The first link you provided includes ElasticSearch: http://en.wikipedia.org/wiki/NoSQ

Re: Solution for reverse order of year facets?

2014-03-03 Thread Michael Lackhoff
On 03.03.2014 16:33 Ahmet Arslan wrote: > Currently there are two storing criteria available. However sort by index - > to return the constraints sorted in their index order (lexicographic by > indexed term) - should return most recent year at top, no? No, it returns them -- as you say -- in le

Multiple partial match

2014-03-03 Thread Zwer
Hi Guys, Faced with a problem: make query to SOLR *name:co*^5* It returns me two docs with equal score: {id: 1, name: 'Coca-Cola Company'}, {id: 2, name: Microsoft Corporation}. How can I boost Coca-Cola Company because it contains more partial matches ? P.S. All normalization used by TF-IDF

Re: Solution for reverse order of year facets?

2014-03-03 Thread Ahmet Arslan
Hi, Currently there are two storing criteria available. However sort by index - to return the constraints sorted in their index order (lexicographic by indexed term) - should return most recent year at top, no? Ahmet On Monday, March 3, 2014 4:36 PM, Michael Lackhoff wrote: If I understand

Re: Solr Permgen Exceptions when creating/removing cores

2014-03-03 Thread Josh
It's a windows installation using a bitnami solr installer. I incorrectly put 64M into the configuration for this, as I had copied the test configuration I was using to recreate the permgen issue we were seeing on our production system (that is configured to 512M) as it takes awhile with to recreat

Re: Solr 4.5.0 replication numDocs larger in slave

2014-03-03 Thread Greg Walters
I just ran into an issue similar to this that effected document scores on distributed searches. You might try doing an optimize and purging your deleted documents while no indexing is being done then checking your counts. Once I optimized all my indexes the document counts on all of my cores mat

Re: Solr Permgen Exceptions when creating/removing cores

2014-03-03 Thread Greg Walters
Josh, You've mentioned a couple of times that you've got PermGen set to 512M but then you say you're running with -XX:MaxPermSize=64M. These two statements are contradictory so are you *sure* that you're running with 512M of PermGen? Assuming your on a *nix box can you provide `ps` output provi

Solution for reverse order of year facets?

2014-03-03 Thread Michael Lackhoff
If I understand the docs right, it is only possible to sort facets by count or value in ascending order. Both variants are not very helpful for year facets if I want the most recent years at the top (or appear at all if I restrict the number of facet entries). It looks like a requirement that was

Re: Solr Heap, MMaps and Garbage Collection

2014-03-03 Thread Michael Sokolov
On 3/3/2014 1:54 AM, KNitin wrote: 3. 2.8 Gb - Perm Gen (I am guessing this is because of interned strings) As others have pointed out, this is really unusual for Solr. We often see high permgen in our app servers due to dynamic class loading that the framework performs; maybe you are somehow

Re: Solr Shard Query From Inside Search Component Sometimes Gives Wrong Results

2014-03-03 Thread Shalin Shekhar Mangar
What was the query you are making? What is the sort order for the query? Are you sure you are not indexing data in between making these requests? Are you able to reproduce this outside of your search component? It is hard to answer questions about custom code without actually looking at the code.

Re: Slow query time on stemmed fields

2014-03-03 Thread Jens Meiners
Sorry for the delay, I did not have access to the server and could not query anything. This is my Query: http://server:port /solr/core/select?q=keyword1+keyword2&wt=xml&indent=true&hl.fragsize=120&f.file_URI_tokenized.hl.fragsize=1000&spellcheck=true&f.file_content.hl.alternateField=spell&hl.simp

Re: SOLR cloud disaster recovery

2014-03-03 Thread Jan Van Besien
On Fri, Feb 28, 2014 at 7:50 PM, Per Steffensen wrote: > I might be able to find something for you. Which version are you using - I > have some scripts that work on 4.0 and some other scripts that work for 4.4 > (and maybe later). This sounds useful. I am using 4.6.1. Kind regards Jan

Solr Shard Query From Inside Search Component Sometimes Gives Wrong Results

2014-03-03 Thread Vishnu Mishra
Hi, I am using Solr 4.6 and doing Solr query on shard from inside Solr search component and try to use the obtained results for my custom logic. I have used a Solrj for doing distributed search, but the result coming from this distributed search vary some time. So the my questions a

Re: Solr is NoSQL database or not?

2014-03-03 Thread Charlie Hull
On 01/03/2014 23:53, Jack Krupansky wrote: NoSQL? To me it's just a marketing term, like Big Data. +1 Depends very much who you talk to. Marketing folks like to ride the current wave, so if NoSQL is current, they'll jump on that one, likewise Big Data. Technical types like to be correct in t