Is it posible to set maximum indexed documents in solr?

2010-01-21 Thread Daniel Angelov
Is it posible to set maximum indexed documents in solr? For example, I want to insert in solr max 5000 document, after that solr must refuse unserting.

Re: Solr query single entity?

2010-01-21 Thread Shalin Shekhar Mangar
On Wed, Jan 20, 2010 at 9:47 PM, fredanthony wrote: > > Hi, I have Solr setup to use a DataImportHandler with my database. In the > data-config.xml file I have one document with two entities as follows: > > >query="SELECT user_id, user_id as pk_field, >

Find newly added documents

2010-01-21 Thread Sandeep Tagore
Hi All, Is it possible to find the newly added documents in Solr(before or after commit)? To make it clear... Lets assume that there are 1000 docs in the index and we add new docs with id's ab12, bc34, cd56 to the index. Is there a way to get the list of these new docs with any query? Thanks in

Solr configuration issue for sorting on title field

2010-01-21 Thread EL KASMI Hicham
Hello again, We have a problem with sorting on title field in Solr instance of our production repository, we get the error message: "HTTP Status 500 - there are more terms than documents in field "titleStr", but it's impossible to sort on tokenized fields". After some googling and searching in

Save a search

2010-01-21 Thread Nicolas Kern
Hi, Is there a way to save a search performed on Solr, like it's possible to do on here : http://www.yankeegroup.com/search.do?searchType=advancedSearch ? I'd like to add the same functions to my company's Solr. Best Regards and thanks, Nicolas

Re: Save a search

2010-01-21 Thread Erik Hatcher
On Jan 21, 2010, at 5:52 AM, Nicolas Kern wrote: Is there a way to save a search performed on Solr, like it's possible to do on here : http://www.yankeegroup.com/search.do?searchType=advancedSearch ? I'd like to add the same functions to my company's Solr. No, Solr doesn't have that capabi

Re: Replication Handler Severe Error: Unable to move index file

2010-01-21 Thread Noble Paul നോബിള്‍ नोब्ळ्
is it a one off case? do you observerve this frequently? On Thu, Jan 21, 2010 at 11:26 AM, Otis Gospodnetic wrote: > It's hard to tell without poking around, but one of the first things I'd do > would be to look for /home/solr/cores/core8/index.20100119103919/_6qv.fnm - > does this file/dir rea

servlet forwarding solrj request/response

2010-01-21 Thread Daniel Angelov
Is it possible to make a servlet which take some information/statistic about solrj request/response between another web application and solr server? For example I have a JBOSS web appl for add/select documents from solr, but I want to take some information about this operations in another web appl

Re: servlet forwarding solrj request/response

2010-01-21 Thread Erik Hatcher
sure, you could put a servlet filter in Solr's web.xml to capture whatever you like. another option would be to hook into Solr's logging and fire events/ data off elsewhere. or perhaps a proxy in the middle that forwards requests on to Solr, but captures however you like. Erik O

Re: servlet forwarding solrj request/response

2010-01-21 Thread Daniel Angelov
thanks Erik, "or perhaps a proxy in the middle that forwards requests on to Solr, but captures however you like. " That is what I am lookin for. How I can to implement this kind of proxy, I try with RequestDispatcher forward method od servlet API, but, when jboss appl requests doc via solrj(from to

Re: Find newly added documents

2010-01-21 Thread Erik Hatcher
You'll be able to find them only after a commit. One way to do this is index a timestamp with every document, and find the latest ones using that field. There's an example of an automatic timestamp field in the example schema. Erik On Jan 21, 2010, at 4:16 AM, Sandeep Tagore wrot

Re: question

2010-01-21 Thread Shalin Shekhar Mangar
On Thu, Jan 21, 2010 at 1:21 PM, Daniel Angelov < dani.b.ange...@googlemail.com> wrote: > Is it posible to set maximum indexed documents in solr? For example, I want > to insert in solr max 5000 document, after that solr must refuse unserting. > No but you can do it in your indexing application o

Re: question

2010-01-21 Thread Daniel Angelov
Thanks Shalin, your proposal is good. Could you give me some link, where I can read some documantation about your idea. If I write some class extending UpdateRequestProcessor, where I have to put it, so the requests to solr go through that new class. Daniel Angelov On Thu, Jan 21, 2010 at 3:05 P

Re: question

2010-01-21 Thread Daniel Angelov
My case is: I have 2 web appl, first in jboss, second - tomcat. The second knows how is max docs, but first make a new docs, so I wander , how I can control the indexing (from jboss) through tomcat appl. The solr server is in tomcat thanks On Thu, Jan 21, 2010 at 3:05 PM, Shalin Shekhar Mangar <

Re: filter query granularity

2010-01-21 Thread Andre Parodi
according to the wiki you could avoid having 3 filter queries cached by putting multiple fq parameters: "Given the following three filtering scenarios of (a) x:bla, (b) y:blub, and (c) x:bla AND y:blub, will I end up with two or three distinct filters? In other words, may filters be composites or

Choosing what document to return.

2010-01-21 Thread Noam G.
Hi Guys, I'll start by thanking every one for an amazing search engine! What am I trying to do? Basically what I need is the ability to do some kind of "switch" or an "if-else" flow based on the field value and a parameter that I will pass using the query string. The result will be documents t

Re: Choosing what document to return.

2010-01-21 Thread Smiley, David W.
I don't understand your question. Are myval1...3 fixed and controlled by your interface or are they an arbitrary N number of query words from a user query? In any case, you may need to write a request handler. ~ David Smiley Author: http://www.packtpub.com/solr-1-4-enterprise-search-server/ O

Re: Choosing what document to return.

2010-01-21 Thread Erick Erickson
What's the higher-level task you're trying to accomplish? Because on a quick read it looks like you're trying something that's akin to a join between two indexes, and if that's accurate I have great fears about performance. I guess it boils down to how costly getting your other_index_value is

Re: Choosing what document to return.

2010-01-21 Thread Chantal Ackermann
Hi Noam, if you know about those rules at index time, you should put the information into the index and simply query only for those documents that have the desired value(s). Cheers, Chantal Noam G. schrieb: Hi Guys, I'll start by thanking every one for an amazing search engine! What am I

commit fails on weblogic

2010-01-21 Thread Joe Kessel
Using Solr 1.4 and the StreamingUpdateSolrServer on Weblogic 10.3 and get the following error on commit. The data seems to load fine, and the same code works fine with Tomcat. On the client side an Internal Server Error is reported. Thanks, Joe weblogic.utils.NestedRuntimeException:

Solr vs. Compass

2010-01-21 Thread Ken Lane (kenlane)
We are knee-deep in a Solr project to provide a web services layer between our Oracle DB's and a web front end to be named later to supplement our numerous Business Intelligence dashboards. Someone from a peer group questioned why we selected Solr rather than Compass to start development. The real

Re: Choosing what document to return.

2010-01-21 Thread Wangsheng Mei
I am not very sure if you are talking about a bit complex boolean queries. According stardard boolean logic, complex if-else would be normalized by using union, intersection, not. you omited some of your logic, lets assume it as: if( FieldA == myval1) { if(FieldB == 1){ don't return the

Re: Choosing what document to return.

2010-01-21 Thread Noam G.
Hi All (Erick, David...) Thanks for replying. Let me try to elaborate: - Each document holds a string value called name, an int parameter called payout a parameter called network_id. - Name can be any string. - Payout can be any number. - Network_id can be one of maximum 20 available values (fo

Re: Choosing what document to return.

2010-01-21 Thread Noam G.
Hi Mei, The problem is that I need to do a calculation on the field. So I assumed I need to extend one of the classes in order to be able to do a calculation on a field with an if-else capabailities. Don't I? Noam. -- View this message in context: http://old.nabble.com/Choosing-what-document-

Hindi language support in solr

2010-01-21 Thread Ranveer kumar
Hi all, I am very new in solr. I download latest release 1.4 and install. For Indexing and Searching I am using SolrJ api. My Question is "How to enable solr to search hindi language text ?". Please Help me.. thanks with regards Ranveer K Kumar

Re: Solr vs. Compass

2010-01-21 Thread Lukáš Vlček
Hi, I think that these products do not compete directly that much, each fit different business case. Can you tell us more about our specific situation? What do you need to search and where your data is? (DB, Filesystem, Web ...?) Solr provides some specific extensions which are not supported dire

Re: Choosing what document to return.

2010-01-21 Thread Wangsheng Mei
hi, Noam, you can try this one, name:abc AND ((network_id:network1 AND payout:[minval TO *]) OR -network_id:network1) 2010/1/21 Noam G. > > Hi All (Erick, David...) > > Thanks for replying. > > Let me try to elaborate: > - Each document holds a string value called name, an int parameter called >

Re: Solr vs. Compass

2010-01-21 Thread Uri Boness
In addition, the biggest appealing feature in Compass is that it's transactional and therefore integrates well with your infrastructure (Spring/EJB, Hibernate, JPA, etc...). This obviously is nice for some systems (not very large scale ones) and the programming model is clean. On the other hand

Re: Improvising solr queries

2010-01-21 Thread dipti khullar
Hi Sorry for getting back late on the thread, but we are focusing on configuration of master and slave for improving performance issues. We have observed following trend on production slaves: After every 10 minutes the response time increases considerably. In between all the queries are served by

Re: Solr vs. Compass

2010-01-21 Thread Erick Erickson
SOLR is, first and foremost, a text searching tool that scales. Are you searching lots of text here or not? There are situations in which you need both in order to accomplish your business needs, so asking "which one is best" is tricky to answer FWIW Erick On Thu, Jan 21, 2010 at 10:40 AM, Ke

Re: Improvising solr queries

2010-01-21 Thread Erick Erickson
What this looks like (and I've only glanced) is that your index updates are causing a new searcher to be opened, and the first few queries after the reopen will be slow. Have you tried warmup queries after the reopen? FWIW Erick On Thu, Jan 21, 2010 at 11:48 AM, dipti khullar wrote: > Hi > > So

RE: Solr vs. Compass

2010-01-21 Thread Ken Lane (kenlane)
Uri, Lucas, Thanks for your feedback. To clarify on some specifics, 1. Yes, faceted search and DisMax are very imortant to this project. 2. Our data is imported from Oracle tables. (Unstructured sources maybe later). We manufacture each document from DB queries. 3. Our platform won't be transac

Re: Choosing what document to return.

2010-01-21 Thread Noam G.
Hi Mei, I replyed to you directlly by mistake - so here is the message again - allmost the same ;-) I think I'm getting it now :-) Let's take your suggestion: name:abc AND ((network_id:network1 AND payout:[minval TO *]) OR -network_id:network1) What I need is not the value of payout but, 50% o

Re: Choosing what document to return.

2010-01-21 Thread Wangsheng Mei
2010/1/22 Noam G. > > Hi Mei, > > I replyed to you directlly by mistake - so here is the message again - > allmost the same ;-) > > I think I'm getting it now :-) > > Let's take your suggestion: > name:abc AND ((network_id:network1 AND payout:[minval TO *]) OR > -network_id:network1) > > What I n

tomcat support

2010-01-21 Thread Adamsky, Robert
Have been running solr 1.3 on tomcat 5.0.28 without issue. Went to use 1.4 and it doesn't load - causing server not to start. It does show a few solr log messages along the way but only info. Does Solr 1.4 still support tomcat 5.0.28? I did try it with tomcat 5.5.28 without issue but the upgrad

RE: Solr vs. Compass

2010-01-21 Thread Minutello, Nick
Not sure how many here have used both ... I've used raw Lucene in the past - and after that, Compass. More recently Solr. Here are some of the things I have noticed: 1) Stating the obvious: Solr has a server capability that Compass/Lucene does not. This means indexing/searching is available to

Re: Hindi language support in solr

2010-01-21 Thread Robert Muir
hello, take a look at field type "textgen" (a general unstemmed text field) the whitespacetokenizer + worddelimiterfilter used by this type will work correctly for hindi tokenization and punctuation. On Thu, Jan 21, 2010 at 10:55 AM, Ranveer kumar wrote: > Hi all, > > I am very new in solr. > I

RE: Solr vs. Compass

2010-01-21 Thread Minutello, Nick
Oops! >> Solr does a number of things that are really nice (that aren't really >> addressed by Solr) I obviously meant: "Solr does a number of things that are really nice (that aren't really addressed by Compass)" -N -Original Message- From: Minutello, Nick Sent: 21 January 2010 1

Beyond Basic Faceted Search (SOLR-236|SOLR-64|SOLR-792)

2010-01-21 Thread Kelly Taylor
I'm currently using the latest SOLR-236 patch (12/24/2009) and field-collapsing seems to be giving me the desired results, but I'm wondering if I should focus more on a tree view of my catalog data instead, as described in "Beyond Basic Faceted Search" Is it possible that either or both of the p

performance issue

2010-01-21 Thread Matthieu Labour
Hi I have been requested to look at a solr instance that has been patched with our own home grown patch to be able to handle 1000 cores on a solr instance The solr instance doesn't perform well. Within 12 hours, I can see the garbage collection taking a lot of time and query & update requests are

RE: Solr vs. Compass

2010-01-21 Thread Adamsky, Robert
> 2) Compass does a number of things really nicely (that afaik, isn't addressed > by Solr) > + Object-search engine mapping (great for structured data - i.e. not just > text documents). I find writing the code that converts to/from a SolrDocument > a bit annoying (but in my current project, the

Re: tomcat support

2010-01-21 Thread David MARTIN
I haven't got any information about the tomcat/solr compatibility matrix, but you can easily have instances of tomcat running together, each having a different version (with different jvm versions too). It may be simpler to fine tune such a dedicated instance. David 2010/1/21, Adamsky, Robert : >

RE: Solr vs. Compass

2010-01-21 Thread Minutello, Nick
Actually, that's true. But IMO it's not that great :) After fighting it for a bit, we gave up on it ... (maybe more of a reflection of our capabilities rather than Solr's - but Id like to think we are some-way competant) -N -Original Message- From: Adamsky, Robert [mailto:radam...@techt

Re: Choosing what document to return.

2010-01-21 Thread Noam G.
Hi Mei, You are right. Suddenlly it looks clear - is it too late? :-) Let me look at it again tomorrow. Thank you very much :-) Noam. Wangsheng Mei wrote: > > 2010/1/22 Noam G. > >> >> Hi Mei, >> >> I replyed to you directlly by mistake - so here is the message again - >> allmost the sam

create requesthandler with default shard parameter for different query parser

2010-01-21 Thread Joe Calderon
hello *, what is the best way to create a requesthandler for distributed search with a default shards parameter but that can use different query parsers thus far i have *,score json host0:8080/solr/core0,host1:8080/solr/core1,host2:8080/solr/core2,localhost:8080

Re: create requesthandler with default shard parameter for different query parser

2010-01-21 Thread Yonik Seeley
On Thu, Jan 21, 2010 at 2:39 PM, Joe Calderon wrote: > hello *, what is the best way to create a requesthandler for > distributed search with a default shards parameter but that can use > different query parsers > > thus far i have > >   >     >     >       *,score >       json >       name="sha

solr 1.4 full index replication not closing previous index file handles on slaves

2010-01-21 Thread Nicholas Letourneau
Description of our setup: We rebuild our entire multi-core index nightly and have it on a master server. Replication is always triggered manually, and a post replication cleaning script is run to remove the previous days index and allow enough drive space for the following day. After the clea

Re: create requesthandler with default shard parameter for different query parser

2010-01-21 Thread Joe Calderon
thx much, i see now, having request handlers with the same name as the query parsers was confusing me, i do however have an additional problem, if i use defType it does indeed use the right query parser but is there a way to not send all the query parameters in the url (qf, pf, bf etc), its the mai

Re: TermsComponent, multiple fields, total count

2010-01-21 Thread Lukas Kahwe Smith
On 20.01.2010, at 15:50, Lukas Kahwe Smith wrote: > > On 19.01.2010, at 22:52, Lukas Kahwe Smith wrote: > I also want to match multiple fields at once. >>> >>> Can you give an example? >> >> >> I enter "Kreuz" but this could either be part of a persons name or of a >> street name, whic

Re: create requesthandler with default shard parameter for different query parser

2010-01-21 Thread Yonik Seeley
On Thu, Jan 21, 2010 at 3:05 PM, Joe Calderon wrote: > thx much, i see now, having request handlers with the same name as the > query parsers was confusing me, i do however have an additional > problem, if i use defType it does indeed use the right query parser > but is there a way to not send all

Fwd: performance issue

2010-01-21 Thread Matthieu Labour
Hi I have been requested to look at a solr instance that has been patched with our own home grown patch to be able to handle 1000 cores on a solr instance The solr instance doesn't perform well. Within 12 hours, I can see the garbage collection taking a lot of time and query & update requests are

Re: Replication Handler Severe Error: Unable to move index file

2010-01-21 Thread Trey
Unfortunately, when I went back to look at the logs this morning, the log file had been blown away... that puts a major damper on my debugging capabilities - so sorry about that. As a double whammy, we optimize nightly, so the old index files have completely changed at this point. I do not rememb

Re: Is it posible to set maximum indexed documents in solr?

2010-01-21 Thread Lance Norskog
There is no limit on the number of documents. If you want this feature, please open a JIRA ticket requesting it. Managing large indexes would benefit from limiting things like # of documents, # of terms, total disk space, etc. On Thu, Jan 21, 2010 at 12:08 AM, Daniel Angelov wrote: > Is it posib

Re: Replication Handler Severe Error: Unable to move index file

2010-01-21 Thread Lance Norskog
I did not have good luck with super-high-speed polling. You probably need to adjust the various parameters on both sides of the replication. Some sites (LinkedIn for example with Zoie) do not use replication. They have all query servers do their own indexing, so that new content will be available

Re: commit fails on weblogic

2010-01-21 Thread Lance Norskog
There might be a limit in Weblogic on the number or length of parameters allowed in a POST. On Thu, Jan 21, 2010 at 7:37 AM, Joe Kessel wrote: > > Using Solr 1.4 and the StreamingUpdateSolrServer on Weblogic 10.3 and get the > following error on commit.  The data seems to load fine, and the same

using term vectors

2010-01-21 Thread Harsch, Timothy J. (ARC-TI)[PEROT SYSTEMS]
Hi, I am trying termVectorComponents in SOLR. Per wiki I am trying to define component and handler. I define it so: true tvComponent ... But when I qu

Re: tomcat support

2010-01-21 Thread Lance Norskog
http://wiki.apache.org/solr/SolrTomcat I'm impressed; I haven't looked at this problem in a long time and the wiki page is nice. On Thu, Jan 21, 2010 at 10:43 AM, David MARTIN wrote: > I haven't got any information about the tomcat/solr compatibility > matrix, but you can easily have instances o

Re: TermsComponent, multiple fields, total count

2010-01-21 Thread Lance Norskog
You can use the spelling dictionary feature for auto-complete. This is actually more useful than facet.prefix because normal people are not perfect spellers. (It would be even more useful based on phonemes, but I haven't figured out how to do that yet.) On Tue, Jan 19, 2010 at 6:28 PM, Erik Hatche

Re: using term vectors

2010-01-21 Thread Koji Sekiguchi
Tim, You should define the search component in solrconfig.xml, not schema.xml. Koji -- http://www.rondhuit.com/en/ Harsch, Timothy J. (ARC-TI)[PEROT SYSTEMS] wrote: Hi, I am trying termVectorComponents in SOLR. Per wiki I am trying to define component and handler. I define it so:

Re: performance issue

2010-01-21 Thread Lance Norskog
Which version of Solr? Java? What garbage collection parameters? On Thu, Jan 21, 2010 at 1:03 PM, Matthieu Labour wrote: > Hi > > I have been requested to look at a solr instance that has been patched with > our own home grown patch to be able to handle 1000 cores on a solr instance > > The solr

ArrayIndexOutOfBoundsException when highlighting (Solr 1.4)

2010-01-21 Thread Tim Underwood
I'm seeing an java.lang.ArrayIndexOutOfBoundsException when trying to highlight for certain queries. The error seems to be an issue with the combination of the ShingleFilterFactory, PositionFilterFactory and the LengthFilterFactory. Here's my fieldType definition:

MoreLikeThis - How to pass in external text?

2010-01-21 Thread ldung
How can I have the MoreLikeThis query process a piece of text that is passed into the query. Currently I can only get it MoreLikeThis to work only for pieces of text that are already indexed by Solr. For example here is a query that works for using MoreLikeThis for document with id:134847893. h

Re: Solr vs. Compass

2010-01-21 Thread Uri Boness
There seems to be an implication that compass wont scale as well as solr - and I'm not sure that's true at all. They will both scale as well as the underlying Lucene. Lucene doesn't handle distributed search or replication out of the box, you have to implement it using some of it's features (d

How to store a dense field value efficiently

2010-01-21 Thread Dallan Quass
Hi, I want to issue queries where queried fields have a specified value or are "missing". I know that I can query missing values using a negated full-range query, but it doesn't seem like that's very efficient (the fields in question have a lot of possible values). So I've opted to store special

Re: MoreLikeThis - How to pass in external text?

2010-01-21 Thread Otis Gospodnetic
Hi, if you have text to pass in, why do you need MoreLikeThis? The text you speak of can be used as a normal query, so pass it in as a regular multi-word query. Otis -- Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch - Original Message > From: ldung > To: solr-user@lucen

Re: Improvising solr queries

2010-01-21 Thread Otis Gospodnetic
Dipti, If I'm reading that correctly, you are optimizing the index on the master before replicating it? There is no need to do that if you are constantly updating your index and replicating it every 10 minutes. Don't optimize, and you'll replicate smaller portion of an index, and thus you won't

Re: MoreLikeThis - How to pass in external text?

2010-01-21 Thread ldung
I want to use MoreLikeThis since i want to find text in the Solr data that is similar to the input text. I want to see how will this works against just a standard keyword search. I want to do something similar to the article below. http://www.bbc.co.uk/blogs/radiolabs/2008/06/wikipedia_plus_lucen

Re: Solr configuration issue for sorting on title field

2010-01-21 Thread Otis Gospodnetic
Hi, Long message. I skimmed through your configs. It looks like your main question is how can changing the field type (or, really, turning off "multiValued" on a field cause the number of document in your index to decrease, right? Well, it can't or shouldn't. I am guessing you simply did s

Re: Solr vs. Compass

2010-01-21 Thread Otis Gospodnetic
Hi Ken, Based on this, Solr sounds like the way to go. Otis -- Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch - Original Message > From: Ken Lane (kenlane) > To: solr-user@lucene.apache.org > Sent: Thu, January 21, 2010 12:07:56 PM > Subject: RE: Solr vs. Compass > > Uri

Re: solr blocking on commit

2010-01-21 Thread Otis Gospodnetic
Hi, I'm missing the bigger context of this thread here, but from the snippet below - sure, commits cause in-memory index to get written to disk, that causes some IO, and that *could* affect search *if* queries are running on the same box. When index and/or query volume is high, one typically p

Re: MoreLikeThis - How to pass in external text?

2010-01-21 Thread Otis Gospodnetic
Hi, Try what I suggested, please. Or, if you want, go to that (or any other) web page, copy a large chunk of its content, and paste it into Google/Yahoo/Bing. I just did that. Google said my query was too long, but Yahoo took it. Guess what hit #1 was? The page I copied the text from! Ver

Re: Contributors - Solr in Action Case Studies

2010-01-21 Thread Otis Gospodnetic
Hi Tom, hi Tom :) Yummy goodness. Lots of data. Big books. Thank you, I will be in touch. Otis -- Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch - Original Message > From: Tom Burton-West > To: solr-user@lucene.apache.org > Sent: Wed, January 20, 2010 5:17:39 PM > Subje

Re: Improvising solr queries

2010-01-21 Thread dipti khullar
Hi Eric, thanks for your reply. I am not sure what exactly you mean by warmup queries. But if its related to the settings we are using in solrconfig.xml, following are the configurations for query caching: Also, as we are using snapinstall script on slaves, which eventually calls commit script.

Re: NullPointerException in ReplicationHandler.postCommit + question about compression

2010-01-21 Thread Stephen Weiss
Hi Shalin, Thanks for your reply. Please see below. On Jan 18, 2010, at 4:19 AM, Shalin Shekhar Mangar wrote: On Wed, Jan 13, 2010 at 12:51 AM, Stephen Weiss wrote: ... When we replicate manually (via the admin page) things seem to go well. However, when replication is triggered by a

Solr build error in Eclipse IDE

2010-01-21 Thread dhamu
Hi.., I am newbie to solr. just now i configured solr in eclipse IDE. Whiling building the solr project, i got following errors. how to fix it? The method getTextContent() is undefined for the type Node TestConfig.java solr/src/test/org/apache/solr/core line 88 Java Problem The method