Re: Does sorting skip everything having to do with relevancy?

2014-04-03 Thread Alvaro Cabrerizo
Hi, By default solr is using the sort parameter over the "score field". So if you overwrite it using other sort field, yes solr will use the parameter you've provided. Remember, you can use multiple fields for sorting so you can make somethin

Does sorting skip everything having to do with relevancy?

2014-04-03 Thread Shawn Heisey
If I provide a sort parameter, will Solr (4.6.1) skip score/boost processing? In particular I would like to know what happens if I have a boost parameter (with a complex function) for edismax search, but I include a sort parameter on one of my fields. I am using distributed search. I do know tha

Re: Solr DataImport Hander

2014-04-03 Thread Candygram For Mongo
The ramBufferSizeMB was set to 6MB only on the test system to make the system crash sooner. In production that tag is commented out which I believe forces the default value to be used. On Thu, Apr 3, 2014 at 6:36 PM, Susheel Kumar < susheel.ku...@thedigitalgroup.net> wrote: > Hi Sanjay, > > Thi

RE: Solr DataImport Hander

2014-04-03 Thread Susheel Kumar
Hi Sanjay, This is how output will come since solr documents are flat. In your sub entity you queried emp name for a dept and for e.g. in case of deptno=10 you had 3 employees so all came in ename field. You are getting the data, now it will be your UI function to present in whatever format

Re: eDismax parser and the mm parameter

2014-04-03 Thread S.L
Ahmet, SpellChecker seems to be the the exact thing that I need for fuzzy type search , how can I combine SpellChecker with something like edismax parser to make use of paramerters like pf,pf2 and pf3 . Is there any resource that you can point me to do that ? Thanks. On Wed, Apr 2, 2014 at 9:1

Re: Analysis of Japanese characters

2014-04-03 Thread Alexandre Rafalovitch
No specific answers, but have you read the detailed CJK article collection: http://discovery-grindstone.blogspot.ca/ . There is a lot of information there. Regards, Alex. Personal website: http://www.outerthoughts.com/ Current project: http://www.solr-start.com/ - Accelerating your Solr profici

Re: How do I add another unrelated query results to solr index

2014-04-03 Thread Ahmet Arslan
Hi, It is possible but remove rootEntity="true" from entity declarations. On Thursday, April 3, 2014 10:29 PM, sanjay92 wrote: Hi, How do I add two different queries into one single index. e..g. I have added following                                                             But when

Re: Full Indexing is Causing a Java Heap Out of Memory Exception

2014-04-03 Thread Ahmet Arslan
Hi, out of curiosity, why did you set ramBufferSizeMB to 6?  Ahmet On Friday, April 4, 2014 3:27 AM, Candygram For Mongo wrote: *Main issue: Full Indexing is Causing a Java Heap Out of Memory Exception *SOLR/Lucene version: *4.2.1* *JVM version: Java(TM) SE Runtime Environment (build 1.7

Full Indexing is Causing a Java Heap Out of Memory Exception

2014-04-03 Thread Candygram For Mongo
*Main issue: Full Indexing is Causing a Java Heap Out of Memory Exception *SOLR/Lucene version: *4.2.1* *JVM version: Java(TM) SE Runtime Environment (build 1.7.0_07-b11) Java HotSpot(TM) 64-Bit Server VM (build 23.3-b01, mixed mode) *Indexer startup command: set JVMARGS=-XX:MaxPermSize=364

Re: DocValues and StatsComponent

2014-04-03 Thread Ahmet Arslan
Hi Harish, I re-produced your problem with example/default setup. I enabled doc values example fields. ( deleted the original ones) and indexed example documents.       Single valued fields work fine. But stats on multi-valued field cat yields  http://localhost:8983/solr/collection1/select?q=

Re: Filter query with multiple raw/literal ORs

2014-04-03 Thread Yonik Seeley
On Thu, Apr 3, 2014 at 7:04 PM, Gregory Chanan wrote: > {!raw f=foobar v=*} OR {!raw f=foobar v=a} > but this seems to give me the same results as the single-clause query. Right... the query parser is detected as "raw" and the rest of the value is used as the term (or rather it's ambiguous becaus

Re: Solr DataImport Hander

2014-04-03 Thread Ahmet Arslan
Hi Sanjay, 1) Change you SQL from "select ename,job from .." to "select concat(ename,job) as something from" 2) Use TemplateTransformer  http://wiki.apache.org/solr/DataImportHandler#TemplateTransformer                                                On Thursday, April 3, 2014 7:30 PM, sanj

Re: Filter query with multiple raw/literal ORs

2014-04-03 Thread Ahmet Arslan
Hi Greg, One way is to use _query_ hook _query_:"{!raw f=foobar v=*}" OR _query_:"{!raw f=foobar v=a}" http://searchhub.org/2009/03/31/nested-queries-in-solr/ On Friday, April 4, 2014 2:07 AM, Gregory Chanan wrote: I have a string field called foobar.  I'm constructing a FilterQuery with

Filter query with multiple raw/literal ORs

2014-04-03 Thread Gregory Chanan
I have a string field called foobar. I'm constructing a FilterQuery with ORs for some arbitrary values, e.g.: fq=foobar:(a OR b OR c) Now, this works as long as a,b,c aren't "important" in the query language. Let's say c is *, then I get all the documents in the collection, when what I actually

DocValues and StatsComponent

2014-04-03 Thread Harish Agarwal
Is there a known issue using the StatsComponent against fields indexed with docvalues? My setup is currently throwing this error (against the latest nightly build): org.apache.solr.common.Solr*Exception*; org.apache.solr.common.Solr *Exception*: Type mismatch: INTEGER_4 was indexed as SORTED_SET

Re: tf and very short text fields

2014-04-03 Thread Tom Burton-West
Hi Markus and Wunder, I'm missing the original context, but I don't think BM25 will solve this particular problem. The k1 parameter sets how quickly the contribution of tf to the score falls off with increasing tf. It would be helpful for making sure really long documents don't get too high a

How do I add another unrelated query results to solr index

2014-04-03 Thread sanjay92
Hi, How do I add two different queries into one single index. e..g. I have added following But when I query, I dont see any data from salgrade table. I have also tried another document tag but that also did not work as expected. Is i

Re: Analysis of Japanese characters

2014-04-03 Thread Tom Burton-West
Hi Shawn, >>For an input of 田中角栄 the bigram filter works like you described, and what I would expect. If I add a space at the point where the ICU >>tokenizer would have split them anyway, the bigram filter output is very different. If I'm understanding what you are reporting, I suspect this is b

Boosing Basic

2014-04-03 Thread EXTERNAL Taminidi Ravi (ETI, Automotive-Service-Solutions)
Hello, I am trying to implement boosting but I am not able to find a good example, Some places asking to add ^10 to boost score in some places it says use bf . I have query with condition (Name OR Description OR ProductType) but I like to show the results first Name and need to boost the condit

Re: how do I get search for "fort st john" to match "ft saint john"

2014-04-03 Thread solr-user
thanks guys. unfortunately the solr that contains this schema/data is in a legacy system that requires the fields to not be changed. we will, hopefully in the near future, be able to look at redesigning the schema. alternatively, I could look at boning up on Java (which I havent used in a long t

Solr DataImport Hander

2014-04-03 Thread sanjay92
Hi, I am writing very simple Dept, Emp Solr DataImport Handler. it is working but when I query using http://localhost:8983/solr/select?q=*:* I see results in XML format . See attached file. deptemp.xml Output from inner query does no

Re: Spatial maxDistErr changes

2014-04-03 Thread Steven Bower
Thanks... I noticed that.. I tried to send a mail to your Mitre address and it got returned... Not sure if you've locked something new down but if you are interested we are looking to hire for our search team at Bloomberg LP steve On Wed, Apr 2, 2014 at 11:20 AM, David Smiley wrote: > Good q

SOLR LUCENE job

2014-04-03 Thread Shahrokh
I am looking for a Solr Lucene consultant to work in Texas. It is a one year Solr Lucene/Java/NO Sql/ Hadoop job. Must have at least 1 solr lucene implementation. Please email me as soon as possible at shahrokh_sha...@oxfordcorp.com if you are at all interested in hearing more. Please provi

Re: Solr interface

2014-04-03 Thread Erick Erickson
This feels like premature optimization. Before going that route, I'd be _very_ sure that my network speed was enough of a problem to warrant the effort. Now, you might have very slow networks or some other unusual setup that justifies the effort, but it would have to be very special, there are a _

Solr interface

2014-04-03 Thread Александр Вандышев
I want to reduce the cost of encoding files for network transmission . Чт. 03 апр. 2014 16:47:21 пользователь Erik Hatcher (erik.hatc...@gmail.com) написал: Yes. But why? DataImportHandler kinda does this (still use http to kick off an indexing job). And there's EmbeddedSolrServer too. Erik

Re: Solr interface

2014-04-03 Thread Erik Hatcher
Yes. But why? DataImportHandler kinda does this (still use http to kick off an indexing job). And there's EmbeddedSolrServer too. Erik > On Apr 3, 2014, at 8:39, Александр Вандышев wrote: > > Is it possible to index files not via HTTP interface?

Re: Luke 4.7.0 released

2014-04-03 Thread Dmitry Kan
welcome! there will be a shell script in the next luke release: https://github.com/DmitryKey/luke/blob/master/luke.sh On Thu, Apr 3, 2014 at 3:39 PM, simon wrote: > adding that worked - thanks. > > > On Thu, Apr 3, 2014 at 4:18 AM, Dmitry Kan wrote: > > > Hi Joshua, Simon, > > > > do you pass

Solr interface

2014-04-03 Thread Александр Вандышев
Is it possible to index files not via HTTP interface?

Re: Luke 4.7.0 released

2014-04-03 Thread simon
adding that worked - thanks. On Thu, Apr 3, 2014 at 4:18 AM, Dmitry Kan wrote: > Hi Joshua, Simon, > > do you pass the -XX:MaxPermSize=512m to your jvm? > > java -XX:MaxPermSize=512m -jar luke-with-deps.jar > > My java runtime environment is of the same version as Simon's: build > 1.7.0_51-b13,

Re: tf and very short text fields

2014-04-03 Thread Michael Sokolov
On 4/3/14 7:46 AM, Michael Sokolov wrote: On 4/1/14 2:32 PM, Walter Underwood wrote: And here is another peculiarity of short text fields. The movie "New York, New York" should not be twice as relevant for the query "new york". Is there a way to use a binary term frequency rather than a count

Re: tf and very short text fields

2014-04-03 Thread Michael Sokolov
On 4/1/14 2:32 PM, Walter Underwood wrote: And here is another peculiarity of short text fields. The movie "New York, New York" should not be twice as relevant for the query "new york". Is there a way to use a binary term frequency rather than a count? wunder -- Walter Underwood wun...@wunderw

Re: sort by an attribute values sequence

2014-04-03 Thread Ahmet Arslan
Hi Santosh, Which field do you sort on? You can multiple sort criteria, like sort=someFied asc, score desc On Thursday, April 3, 2014 9:54 AM, santosh sidnal wrote: Hi Ahmet/All, Thanks for the reply. The Solution of boosting those product type values will work fine if i don't apply any 's

Re: Product index schema for solr

2014-04-03 Thread Ajay Patel
i am posting this question again. as per your suggestion my final schema will be like { id: ... ... [PRODUCT RELATED DATAS] ... ... ... min_qty: 1 max_qty: 50 price: 4 } [OTHER SAME LIKE ABOVE DATA] now i want to create range facet field by

Re: High CPU usage after import

2014-04-03 Thread Jack Krupansky
That could mean that the code is hung somehow. Or, maybe Solr is just working on the commit. Unless you have an explicit commit, the automatic commit will occur some time after the extract request. How much data are we talking about? What does the Solr log say? Compare that to the case where C

Solr join and lucene scoring

2014-04-03 Thread mm
Hello, referencing to this issue: https://issues.apache.org/jira/browse/SOLR-4307 Is it still not possible with the solr query time join to use scoring? Do I still have to write my own plugin or is there a plugin somewhere I could use? I never wrote a plugin for solr before, so I would prefe

Re: Luke 4.7.0 released

2014-04-03 Thread Dmitry Kan
Hi Joshua, Simon, do you pass the -XX:MaxPermSize=512m to your jvm? java -XX:MaxPermSize=512m -jar luke-with-deps.jar My java runtime environment is of the same version as Simon's: build 1.7.0_51-b13, run on ubuntu. Dmitry On Wed, Apr 2, 2014 at 6:54 PM, simon wrote: > Also seeing this on M

Re: Flush buffer exceptions

2014-04-03 Thread ku3ia
OK. I understand. Will review check it. Many thanks. -- View this message in context: http://lucene.472066.n3.nabble.com/Flush-buffer-exceptions-tp4128593p4128801.html Sent from the Solr - User mailing list archive at Nabble.com.

High CPU usage after import

2014-04-03 Thread Александр Вандышев
Thanks for the answer. I meant that the CPU does not free after the end of import.Tomtcat or Solr continue use it in max level. . Вт. 01 апр. 2014 20:09:24 пользователь Jack Krupansky (j...@basetechnology.com) написал: Some document types can consume significant CPU resources, such as large PD

Re: sort by an attribute values sequence

2014-04-03 Thread Shawn Heisey
On 4/3/2014 12:53 AM, santosh sidnal wrote: > But my requirement is i want sorting to be applied and boost a > particular/some attribute values (C,B etc) in the sorted result, which is > not working. Looks like sorting will take precedence over boosting. Correct > me if i am wrong. If you use a so