Re: Two unrelated questions

2011-09-21 Thread tamanjit.bin...@yahoo.co.in
For *1* I have faced similar issues, and have realized that it has got more to do with the data I am trying to index. In some cases when I run even a full-import with DIH, unless its a flat table that I am tryin to index, there are often issues at data end when I try to get joins and then index dat

RE: NRT and commit behavior

2011-09-21 Thread Tirthankar Chatterjee
Okay, but is there any number that if we reach on the index size or total docs in the index or the size of physical memory that sharding should be considered. I am trying to find the winning combination. Tirthankar -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com]

Re: Example setting TieredMergePolicy for Solr 3.3 or 3.4?

2011-09-21 Thread Michael Sokolov
I wonder if config-file validation would be helpful here :) I posted a patch in SOLR-1758 once. -Mike On 9/21/2011 6:22 PM, Michael Ryan wrote: I think the problem is that the config needs to be inside of the config, rather than after it as your have. -Michael

Re: OOM errors and -XX:OnOutOfMemoryError flag not working on solr?

2011-09-21 Thread Jason Toy
I am running the sun version: java version "1.6.0_26" Java(TM) SE Runtime Environment (build 1.6.0_26-b03) Java HotSpot(TM) 64-Bit Server VM (build 20.1-b02, mixed mode) I get multiple Out of memory exceptions looking at my application and the solr logs, but my script doesn't get called the first

RE: Example setting TieredMergePolicy for Solr 3.3 or 3.4?

2011-09-21 Thread Michael Ryan
I think the problem is that the config needs to be inside of the config, rather than after it as your have. -Michael

Re: SOLR error with custom FacetComponent

2011-09-21 Thread Erik Hatcher
Why create a custom facet component for this? Simply add lines like this to your request handler(s): manu_exact either in defaults or appends sections. Erik On Sep 21, 2011, at 14:00 , Ravi Bulusu wrote: > Hi All, > > > I'm trying to write a custom SOLR facet component

Re: Example setting TieredMergePolicy for Solr 3.3 or 3.4?

2011-09-21 Thread Shawn Heisey
On 9/21/2011 11:18 AM, Shawn Heisey wrote: With no mergeFactor defined, maxMergeAtOnce and segmentsPerTier seem to be ignored. I've got both set to 35, but Solr is merging every 10 segments. I haven't tried explicitly setting mergeFactor yet to see if that will make the other settings overrid

SOLR error with custom FacetComponent

2011-09-21 Thread Ravi Bulusu
Hi All, I'm trying to write a custom SOLR facet component and I'm getting some errors when I deploy my code into the SOLR server. Can you please let me know what Im doing wrong? I appreciate your help on this issue. Thanks. *Issue* I'm getting an error saying "Error instantiating SearchCompone

Re: Example setting TieredMergePolicy for Solr 3.3 or 3.4?

2011-09-21 Thread Shawn Heisey
On 9/21/2011 3:10 PM, Chris Hostetter wrote: : With no mergeFactor defined, maxMergeAtOnce and segmentsPerTier seem to be : ignored. I've got both set to 35, but Solr is merging every 10 segments. I ... : Here's the relevant config pieces. These two sections are in separate files : inc

Re: OOM errors and -XX:OnOutOfMemoryError flag not working on solr?

2011-09-21 Thread Chris Hostetter
: Usually any good piece of java code refrains from capturing Throwable : so that Errors will bubble up unlike exceptions. Having said that, Even if some piece of code catches an OutOfMemoryError, the JVM should have already called the "-XX:OnOutOfMemoryError" hook - Although from what i can te

Re: Two unrelated questions

2011-09-21 Thread Rob Casson
for #1, i don't use DIH, but is there any possibility of that column having duplicate keys, with subsequent docs replacing existing ones? and for #2, for some cases you could use a negative filterquery: http://wiki.apache.org/solr/SimpleFacetParameters#Retrieve_docs_with_facets_missing so

RE: Two unrelated questions

2011-09-21 Thread Olson, Ron
Thanks for the reply. As far as #1, my table that I'm indexing via DIH has a PK field, generated by a sequence, so there are records with ID of 1, 2, 3, etc. That same id is the one I use in my unique id field in the document (ID). I've noticed that the table has, say, 10 rows. My index only ha

Re: Example setting TieredMergePolicy for Solr 3.3 or 3.4?

2011-09-21 Thread Chris Hostetter
: With no mergeFactor defined, maxMergeAtOnce and segmentsPerTier seem to be : ignored. I've got both set to 35, but Solr is merging every 10 segments. I ... : Here's the relevant config pieces. These two sections are in separate files : incorporated into solrconfig.xml using xinclude:

Re: Slow autocomplete(terms)

2011-09-21 Thread Erick Erickson
Think about ngrams if you really need infix searches, you're right that the regex is very probably the root of your problem. The index has to examine *every* term in the field to determine if the regex will match. Best Erick On Tue, Sep 20, 2011 at 12:57 AM, roySolr wrote: > Hello, > > I used th

Re: Two unrelated questions

2011-09-21 Thread Erick Erickson
for <1> I don't quite get what you're driving at. Your DIH query assigns the uniqueKey, it's not like it's something auto-generated. Perhaps a concrete example would help. <2> There's a limit you can adjust that defaults to 1024 (maxBooleanClauses in solrconfig.xml). You can bump this very high,

Implementing a custom ResourceLoader

2011-09-21 Thread Jithin Emmanuel
Hi, As part of writing a solr plugin I need to override the ResourceLoader. My plugin is intended stop word analyzer filter factory and I need to change the way stop words are being fetched. My assumption is overriding ResourceLoader->getLines() will help me to meet my target of fetching stop word

Re: Sort five random "Top Offers" to the top

2011-09-21 Thread Sujit Pal
Hi MOuli, AFAIK (and I don't know that much about Solr), this feature does not exist out of the box in Solr. One way to achieve this could be to construct a DocSet with topoffer:true and intersect it with your result DocSet, then select the first 5 off the intersection, randomly shuffle them, subl

Re: strange copied field problem

2011-09-21 Thread Pulkit Singhal
No probs. I would still hope someone would comment on you thread with some expert opinions about making a copy of a copy :) On Wed, Sep 21, 2011 at 1:38 PM, Tanner Postert wrote: > sure enough that worked. could have sworn we had it this way before, but > either way, that fixed it. Thanks. > > On

Production Issue: SolrJ client throwing this error even though field type is not defined in schema

2011-09-21 Thread roz dev
Hi All We are getting this error in our Production Solr Setup. Message: Element type "t_sort" must be followed by either attribute specifications, ">" or "/>". Solr version is 1.4.1 Stack trace indicates that solr is returning malformed document. Caused by: org.apache.solr.client.solrj.SolrSer

Re: Debugging DIH by placing breakpoints

2011-09-21 Thread Pulkit Singhal
Correct! With that additional info, plus http://wiki.apache.org/solr/HowToContribute (ant eclipse), plus a refreshed (close/open) eclipse project ... I'm all set. Thanks Again. On Wed, Sep 21, 2011 at 1:43 PM, Gora Mohanty wrote: > On Thu, Sep 22, 2011 at 12:08 AM, Pulkit Singhal > wrote: >> He

Re: Debugging DIH by placing breakpoints

2011-09-21 Thread Gora Mohanty
On Thu, Sep 22, 2011 at 12:08 AM, Pulkit Singhal wrote: > Hello, > > I was wondering where can I find the source code for DIH? I want to > checkout the source and step-trhought it breakpoint by breakpoint to > understand it better :) Should be under contrib/dataimporthandler in your Solr source t

Re: strange copied field problem

2011-09-21 Thread Tanner Postert
sure enough that worked. could have sworn we had it this way before, but either way, that fixed it. Thanks. On Wed, Sep 21, 2011 at 11:01 AM, Tanner Postert wrote: > i believe that was the original configuration, but I can switch it back and > see if that yields any results. > > > On Wed, Sep 21,

Debugging DIH by placing breakpoints

2011-09-21 Thread Pulkit Singhal
Hello, I was wondering where can I find the source code for DIH? I want to checkout the source and step-trhought it breakpoint by breakpoint to understand it better :) Thanks! - Pulkit

Re: Solr Indexing - Null Values in date field

2011-09-21 Thread Pulkit Singhal
Also you may use the script transformer to explicitly remove the field from the document if the field is null. I do this for all my sdouble and sdate fields ... its a bit manual and I would like to see Solr enhanced to simply skip stuff like this by having a flag for its DIH code but until then it

Re: add quartz like scheduling cabalities to solr-DIH

2011-09-21 Thread Pulkit Singhal
I think what Ahmet is trying to say is that such functionality does not exist. As the functionality does not exist, there is no procedure or conf file related work to speak of. There has been request to have this work done and you can vote/watch for it here: https://issues.apache.org/jira/browse/SO

Re: OOM errors and -XX:OnOutOfMemoryError flag not working on solr?

2011-09-21 Thread Pulkit Singhal
Usually any good piece of java code refrains from capturing Throwable so that Errors will bubble up unlike exceptions. Having said that, perhaps someone in the list can help, if you share which particular Solr version you are using where you suspect that the Error is being eaten up. On Fri, Sep 16

Re: strange copied field problem

2011-09-21 Thread Tanner Postert
i believe that was the original configuration, but I can switch it back and see if that yields any results. On Wed, Sep 21, 2011 at 10:54 AM, Pulkit Singhal wrote: > I am NOT claiming that making a copy of a copy field is wrong or leads > to a race condition. I don't know that. BUT did you try to

Re: strange copied field problem

2011-09-21 Thread Pulkit Singhal
I am NOT claiming that making a copy of a copy field is wrong or leads to a race condition. I don't know that. BUT did you try to copy into the text field directly from the genre field? Instead of the genre_search field? Did that yield working queries? On Wed, Sep 21, 2011 at 12:16 PM, Tanner Post

SolrCloud state

2011-09-21 Thread Miguel Coxo
Hi there. I'm starting a new project using solr and i would like to know if solr is able to setup a cluster with fault tolerance. I'm setting up an environment with two shards. Each shard should have a replica. What i would like to know is if a shard master fails will the replica be "promoted" t

Re: FW: MMapDirectory failed to map a 23G compound index segment

2011-09-21 Thread Yongtao Liu
I hit similar issue recently. Not sure if MMapDirectory is right way to go. When index file be map to ram, JVM will call OS file mapping function. The memory usage is in share memory, it may not be calculate to JVM process space. I saw one problem is if the index file bigger then physical ram, an

strange copied field problem

2011-09-21 Thread Tanner Postert
i have 3 fields that I am working with: genre, genre_search and text. genre is a string field which comes from the data source. genre_search is a text field that is copied from genre, and text is a text field that is copied from genre_search and a few other fields. Text field is the default search

Re: Example setting TieredMergePolicy for Solr 3.3 or 3.4?

2011-09-21 Thread Shawn Heisey
On 9/20/2011 4:09 PM, Robert Muir wrote: yes, mergeFactory=10 is interpreted as both segmentsPerTier and maxMergeAtOnce. yes, specifying explicit TieredMP parameters will override whatever you set in mergeFactor (which is basically only interpreted to be backwards compatible) this is why i crea

Re: How to write core's name in log

2011-09-21 Thread Pulkit Singhal
Not sure if this is a good lead for you but when I run out-of-the-box multi-core example-DIH instance of Solr, I often see core name thrown about in the logs. Perhaps you can look there? On Thu, Sep 15, 2011 at 6:50 AM, Joan wrote: > Hi, > > I have multiple core in Solr and I want to write core n

Best Practices for indexing nested XML in Solr via DIH

2011-09-21 Thread Pulkit Singhal
Hello Everyone, I was wondering what are the various best practices that everyone follows for indexing nested XML into Solr. Please don't feel limited by examples, feel free to share your own experiences. Given an xml structure such as the following: cat001 E

Selective values for facets

2011-09-21 Thread ntsrikanth
Hi, The dataset I have got is for special offers. We got lot of offer codes. But I need to create few facets for specific conditions only. For example, I got the following codes: ABCD, AGTR, KUYH, NEWY, NEWA, NEWB, EAS1, EAS2 And I need to create a facet like 'New Year Offers' mapped with NEW

LocalParams, bq, and highlighting

2011-09-21 Thread Demian Katz
I've run into another strange behavior related to LocalParams syntax in Solr 1.4.1. If I apply Dismax boosts using bq in LocalParams syntax, the contents of the boost queries get used by the highlighter. Obviously, when I use bq as a separate parameter, this is not an issue. To clarify, here

Re: boost a document which has a field not empty

2011-09-21 Thread Zoltan Altfatter
Yes, I am using edismax and the bq parameter did the trick. Thanks a lot. On Wed, Sep 21, 2011 at 3:59 PM, Ahmet Arslan wrote: > > I have one entity called organisation. I am indexing their > > name to be able > > to search afterwards on their name. > > I store also the website of the organisati

Re: MMapDirectory failed to map a 23G compound index segment

2011-09-21 Thread Robert Muir
On Tue, Sep 20, 2011 at 12:32 PM, Michael McCandless wrote: > > Or: is it possible you reopened the reader several times against the > index (ie, after committing from Solr)?  If so, I think 2.9.x never > unmaps the mapped areas, and so this would "accumulate" against the > system limit. In order

Re: boost a document which has a field not empty

2011-09-21 Thread Ahmet Arslan
> I have one entity called organisation. I am indexing their > name to be able > to search afterwards on their name. > I store also the website of the organisation. Some > organisations have a > website some don't. > Can I achieve that when searching for organisations even if > I have a match > on

Sort five random "Top Offers" to the top

2011-09-21 Thread MOuli
Hey Community. I got a Lucene/Solr Index with many offers. Some of them are marked by a flag field "topoffer" that they are top offers. Now I want so sort randomly 5 of this offers on the top. For Example HTC Sensation - topoffer = true HTC Desire - topoffer = false Samsung Galaxy S2 - topoffe

Re: boost a document which has a field not empty

2011-09-21 Thread Alexei Martchenko
Can u assign a doc boost at index time? 2011/9/21 Zoltan Altfatter > Hi, > > I have one entity called organisation. I am indexing their name to be able > to search afterwards on their name. > I store also the website of the organisation. Some organisations have a > website some don't. > Can I ac

Re: Problem using EdgeNGram

2011-09-21 Thread O. Klein
Try using KeywordTokenizerFactory instead of StandardTokenizerFactory to get the results you want. -- View this message in context: http://lucene.472066.n3.nabble.com/Problem-using-EdgeNGram-tp3355132p3355211.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: JSON response with SolrJ

2011-09-21 Thread Parvin Gasimzade
Hi, Similar question asked before.Maybe it can help. http://lucene.472066.n3.nabble.com/SolrJ-Response-JSON-td1002024.html On Wed, Sep 21, 2011 at 3:01 PM, Kissue Kissue wrote: > Hi, > > I am using solr 3.3 with SolrJ. Does anybody have any idea how i can > retrieve JSON response with SolrJ? Is

JSON response with SolrJ

2011-09-21 Thread Kissue Kissue
Hi, I am using solr 3.3 with SolrJ. Does anybody have any idea how i can retrieve JSON response with SolrJ? Is it possible? It seems to be more focused on XML and Beans. Thanks.

Problem using EdgeNGram

2011-09-21 Thread Kissue Kissue
Hi, I am using solr 3.3 with SolrJ. I am trying to use EdgeNgram to power auto suggest feature in my application. My understanding is that using EdgeNgram would mean that results will only be returned for records starting with the search criteria but this is not happening for me. For example if i

Fuzzy Suggester

2011-09-21 Thread O. Klein
>From http://wiki.apache.org/solr/Suggester: JaspellLookup can provide "fuzzy" suggestions, though this functionality is not currently exposed (it's a one line change in JaspellLookup). Anybody know what change this would have to be? -- View this message in context: http://lucene.472066.n3.nabb

Re: Solr Indexing - Null Values in date field

2011-09-21 Thread Gora Mohanty
On Wed, Sep 21, 2011 at 4:08 PM, mechravi25 wrote: > Hi, > > I have a field in my source with data type as string and that field has NULL > values. I am trying to index this field in solr as a date data type with > multivalued = true. Following is the entry for that field in my schema.xml [...] O

Solr Indexing - Null Values in date field

2011-09-21 Thread mechravi25
Hi, I have a field in my source with data type as string and that field has NULL values. I am trying to index this field in solr as a date data type with multivalued = true. Following is the entry for that field in my schema.xml When I try to index, I get the following exception org.apache.sol

boost a document which has a field not empty

2011-09-21 Thread Zoltan Altfatter
Hi, I have one entity called organisation. I am indexing their name to be able to search afterwards on their name. I store also the website of the organisation. Some organisations have a website some don't. Can I achieve that when searching for organisations even if I have a match on their name I