XML Update overwrite?

2011-05-12 Thread Denis Kuzmenok
Hi. I try to understand the meaning of overwrite="false" in xml that i post with post.jar. I have two possible behaviour: 1) if the document with specified uniquekey exists - it's not updated (even if some fields are changed) 2) if the document with specified uniquekey exists and all

Re: Facet Count Based on Dates

2011-05-12 Thread Otis Gospodnetic
Jasneet, Like in http://wiki.apache.org/solr/FieldCollapsing ? Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message > From: Jasneet Sabharwal > To: solr-user@lucene.apache.org > Sent: Thu, May 12,

Re: Changing the schema

2011-05-12 Thread Otis Gospodnetic
Brian, Yes, you do need to reindex. We've used Hadoop with Solr to speed up indexing by orders of magnitude for some of our customers. Something to consider. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Origin

Re: Support for huge data set?

2011-05-12 Thread Otis Gospodnetic
With that many documents, I think GSA cost might be in millions of USD. Don't go there. 300 MB docs might be called medium these days. Of course, if those documents themselves are huge, then it's more resource intensive. 10 TB sounds like a lot when it comes to search, but it's hard to tell

Re: Faceting question

2011-05-12 Thread Otis Gospodnetic
Hi, I think there is a bit of a mixup here. Facets are not about which field a match was on, but about what values hits have in one or more fields you facet on. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Ori

Re: K-Stemmer for Solr 3.1

2011-05-12 Thread Bernd Fehling
I backported a Lucid KStemmer version from solr 4.0 which I found somewhere. Just changed from import org.apache.lucene.analysis.util.CharArraySet; // solr4.0 to import org.apache.lucene.analysis.CharArraySet; // solr3.1 Bernd Am 12.05.2011 16:32, schrieb Mark: java.lang.AbstractMethodError:

Fieldcollapsing patch not applied properly

2011-05-12 Thread Isha Garg
Hi kai, As per your previous mails you have already applied the patches with solr 1.4.I followed the steps of your mail accordingly . But During step 9 i got the error # 1 out of 1 hunked failed.When I apply ony SOLR-236-1_4_1-paging-totals-working.patch it build successfully but the c

Fieldcollapsing patxh not applied properly

2011-05-12 Thread Isha Garg
Hi kai, as per your previous mails you have already applied the patches with solr 1.4.I followed the steps of your mail accordingly . But During step 9 i got the error # 1 out of 1 hunked failed.When I apply ony SOLR-236-1_4_1-paging-totals-working.patch it build successfully but the

Faceting question

2011-05-12 Thread Mark
Is there anyway to perform a search that searches across 2 fields yet only gives me facets accounts for documents matching 1 field? For example If I have fields A & B and I perform a search across I would like to match my query across either of these two fields. I would then like facet counts

Re: DIH help request: nested xml entities and xpath

2011-05-12 Thread Ashique
Hi All, I am a Java/J2ee programmer and very new to SOLR. I would like to index a table in a postgresSql database to SOLR. Then searching the records from a GUI (Jsp Page) and showing the results in tabular form. Could any one help me out with a simple sample code. Thank you. Regards, Ashique

solr velocity.log setting

2011-05-12 Thread Yuhan Zhang
hi all, I'm new to solr, and trying to install it on tomcat. however, an exception was reached when the page http://localhost/sorl/browse was visited: *FileNotFoundException: velocity.log (Permission denied) * looks like solr is trying to create a velocity.log file to tomcat root. so, how shoul

DIH help request: nested xml entities and xpath

2011-05-12 Thread Weiss, Eric
Apologies in advance if this topic/question has been previously answered…I have scoured the docs, mail archives, web looking for an answer(s) with no luck. I am sure I am just being dense or missing something obvious…please point out my stupidity as my head hurts trying to get this working. So

Re: Replication Clarification Please

2011-05-12 Thread Ravi Solr
Thank you Mr. Bell and Mr. Kanarsky, as per your advise we have moved from 1.4.1 to 3.1 and have made several changes to configuration. The configuration changes have worked nicely till now and the replication is finishing within the interval and not backing up. The changes we made are as follows

Re: field type=string vs field type=text

2011-05-12 Thread Tomás Fernández Löbbe
Hi, my recommendation: To quickly understand the difference between those two different field types, index one document using string and text fields, then facet on those fields and you will see how the terms were indexed. Using one field type or the other will depend on what you want to do with th

A couple newbie questions

2011-05-12 Thread Stuart Smith
Hello!   I just started using Solr. My general use case is pushing a lot of data from Hbase to solr via an M/R job using Solrj. I have lots of questions, but the ones I'd like to start with are: (1) I noticed this: http://lucene.472066.n3.nabble.com/what-happens-to-docsPending-if-stop-solr-befor

Re: field type=string vs field type=text

2011-05-12 Thread Gora Mohanty
On Thu, May 12, 2011 at 8:23 PM, chetan wrote: > What is the difference between setting a fields type to string vs setting it > to text. > > e.g. > > or > [...] Please take a closer look at the fieldType definitions towards the beginning of the default schema.xml. The "text" type has tokenizers

field type=string vs field type=text

2011-05-12 Thread chetan
What is the difference between setting a fields type to string vs setting it to text. e.g. or -- View this message in context: http://lucene.472066.n3.nabble.com/field-type-string-vs-field-type-text-tp2932083p2932083.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Support for huge data set?

2011-05-12 Thread atreyu
Thanks for the detailed response, Jonathon. I will look into the links and check out SolrCloud and Distributed Search. Load-sharing b/t 2 or 3 servers should not pose a problem, so long as it is robust (or at least not slower), fault-tolerant, and reliable. -- View this message in context: http

Re: What is correct use of HTMLStripCharFilter in Solr 3.1

2011-05-12 Thread Ahmet Arslan
> Wait, then what's the HTMLStripCharFilter for? To remove html tags in the analysis phase. For instance it can be used to display original html documents with search terms highlighted.

Re: What is correct use of HTMLStripCharFilter in Solr 3.1

2011-05-12 Thread Mike Sokolov
It preserves the location of the terms in the original HTML document so that you can highlight terms in HTML. This makes it possible (for instance) to display the entire document, with all the search terms highlighted, or (with some careful surgery) to display formatted HTML (bold, italic, etc

Re: Support for huge data set?

2011-05-12 Thread Jonathan Rochkind
If each document is VERY small, it's actually possible that one Solr server could handle it -- especially if you DON'T try to do facetting or other similar features, but stick to straight search and relevancy. There are other factors too. But # of documents is probably less important than total

Re: What is correct use of HTMLStripCharFilter in Solr 3.1

2011-05-12 Thread Jonathan Rochkind
On 5/12/2011 2:55 PM, Ahmet Arslan wrote: I recently upgraded from Solr 1.3 to Solr 3.1 in order to take advantage of the HTMLStripCharFilter. But it isn't working as I expected. You need to strip html tag before analysis phase. If you are using DIH, you can use stripHTML="true" transformer.

Re: Anyone familiar with Solandra or Lucandra?

2011-05-12 Thread Smiley, David W.
The old name is "Lucandra" not Lucendra. I've changed the subject accordingly. I'm looking forward to responses from people but I'm afraid it appears it has not yet gotten much uptake yet. I think it has enormous potential once it's hardened a bit and there's more documentation. Personally, I've

Re: What is correct use of HTMLStripCharFilter in Solr 3.1

2011-05-12 Thread Ahmet Arslan
> I recently upgraded from Solr 1.3 to Solr 3.1 in order to > take advantage of > the HTMLStripCharFilter. But it isn't working as I > expected. > > I have a text field that may contain HTML tags. I however > would like to > store it in Solr without the HTML tags. And retrieve the > text field for

Re: Anyone familiar with Solandra or Lucendra?

2011-05-12 Thread kenf_nc
I modified the subject to include Lucendra, in case anyone has heard of it by that name. -- View this message in context: http://lucene.472066.n3.nabble.com/Anyone-familiar-with-Solandra-or-Lucendra-tp2927357p2933051.html Sent from the Solr - User mailing list archive at Nabble.com.

What is correct use of HTMLStripCharFilter in Solr 3.1

2011-05-12 Thread nicksnels
Hi, I recently upgraded from Solr 1.3 to Solr 3.1 in order to take advantage of the HTMLStripCharFilter. But it isn't working as I expected. I have a text field that may contain HTML tags. I however would like to store it in Solr without the HTML tags. And retrieve the text field for display and

Re: Document match with no highlight

2011-05-12 Thread Phong Dais
Hi, I read the link provided and I'll need some time to digest what it is saying. Here's my "text" fieldtype. Also, I figured out what value in DOC_TEXT cause this issue to occur. With a DOC_TEXT of (without the quotes): "0176

Re: Support for huge data set?

2011-05-12 Thread Darren Govoni
Ok, thanks. Yeah, I'm in the same boat and want to know what others have done with document numbers that large. I know there is SolrCloud that can federate numerous solr instances and query across them, so I suspect some solution with 100's of M's of docs would require a federation. If anyone has

Re: Support for huge data set?

2011-05-12 Thread atreyu
Oh, my fault. No, I am not using Solr yet - just evaluating it. The current implementation is a combination of Sphinx and Oracle Text, but I have not been involved with any of the integration - I'm more of an outside analyst looking in, but will probably be involved in the integration of any new

Re: Support for huge data set?

2011-05-12 Thread Darren Govoni
I have the same questions. But from your message, I couldn't tell. Are you using Solr now? Or some other indexing server? Darren On Thu, 2011-05-12 at 09:59 -0700, atreyu wrote: > Hi, > > I have about 300 million docs (or 10TB data) which is doubling every 3 > years, give or take. The data mo

Support for huge data set?

2011-05-12 Thread atreyu
Hi, I have about 300 million docs (or 10TB data) which is doubling every 3 years, give or take. The data mostly consists of Oracle records, webpage files (HTML/XML, etc.) and office doc files. There are b/t two and four dozen concurrent users, typically. The indexing server has > 27 GB of RAM,

RE: Document match with no highlight

2011-05-12 Thread Pierre GOSSE
> Since you're using the standard "text" field, this should NOT be you're case. Sorry, for the missing NOT in previous phrase. You should have the same issue given what you said, but still, it sound very similar. Are you sure your fieldtype "text" has nothing special ? a tokenizer or filter th

RE: Document match with no highlight

2011-05-12 Thread Pierre GOSSE
> In fact if I did "3 1 15"~1 I do get snipet also. Strange, I had a very similar problem, but with overlapping tokens. Since you're using the standard "text" field, this should be you're case. Maybe you could have a look at this issue, since it sound very familiar to me : https://issues.apache

Changing the schema

2011-05-12 Thread Brian Lamb
If I change the field type in my schema, do I need to rebuild the entire index? I'm at a point now where it takes over a day to do a full import due to the sheer size of my application and I would prefer not having to reindex just because I want to make a change somewhere. Thanks, Brian Lamb

Re: Document match with no highlight

2011-05-12 Thread Phong Dais
Hi, I use DOC_ID in schema.xml I think this is the default unique id that is used for matching. Someone correct me if I am wrong. P. On Thu, May 12, 2011 at 11:01 AM, Bob Sandiford < bob.sandif...@sirsidynix.com> wrote: > Don't you need to include your unique id field in your 'fl' parameter

Re: Document match with no highlight

2011-05-12 Thread Phong Dais
Hi, The type "text" is the default one that came with the default solr 1.4 install w.o any modifications. If I remove the quotes I do get snipets. In fact if I did "3 1 15"~1 I do get snipet also. Hope that helps. P. On Thu, May 12, 2011 at 9:09 AM, Ahmet Arslan wrote: > > URL: > > > htt

Re: Result docs missing only when shards parameter present in query?

2011-05-12 Thread mrw
Does this seem like it would be a configuration issue, an indexed data issue, or something else? Thanks mrw wrote: > > We have two Solr nodes, each with multiple shards. If we query each shard > directly (no shards parameter), we get the expected results: > > response >lst name="response

TrieIntField for "short" values

2011-05-12 Thread Juan Antonio Farré Basurte
Hello, I'm quite a beginner in solr and have many doubts while trying to learn how everything works. I have only a slight idea on how TrieFields work. The thing is I have an integer value that will always be in the range 0-1000. A short field would be enough for this, but there is no such TrieSho

RE: Document match with no highlight

2011-05-12 Thread Bob Sandiford
Don't you need to include your unique id field in your 'fl' parameter? It will be needed anyways so you can match up the highlight fragments with the result docs once highlighting is working... Bob Sandiford | Lead Software Engineer | SirsiDynix P: 800.288.8020 X6943 | bob.sandif...@sirsidynix.

MoreLikeThis PDF search

2011-05-12 Thread Brian Lamb
Hi all, I've become more and more familiar with the MoreLikeThis handler over the last several months. I'm curious whether it is possible to do a MoreLikeThis search by uploading a PDF? I looked at the ExtractingRequestHandler and that looks like it that is used to process PDF files and the like b

Re: K-Stemmer for Solr 3.1

2011-05-12 Thread Mark
java.lang.AbstractMethodError: org.apache.lucene.analysis.TokenStream.incrementToken()Z Would you mind explaining your modifications? Thanks On 5/11/11 11:14 PM, Bernd Fehling wrote: Am 12.05.2011 02:05, schrieb Mark: It appears that the older version of the Lucid Works KStemmer is incompati

Re: Facet Count Based on Dates

2011-05-12 Thread Jasneet Sabharwal
Or is it possible to use a Group By query in Solr 3.1 like we do in SQL ? On 12-05-2011 19:37, Jasneet Sabharwal wrote: Is it possible to use the features of 3.1 by default for my query ? On 12-05-2011 13:38, Grijesh wrote: You can apply patch for Hierarchical faceting on Solr 3.1 - Thanx:

Re: Facet Count Based on Dates

2011-05-12 Thread Jasneet Sabharwal
Is it possible to use the features of 3.1 by default for my query ? On 12-05-2011 13:38, Grijesh wrote: You can apply patch for Hierarchical faceting on Solr 3.1 - Thanx: Grijesh www.gettinhahead.co.in -- View this message in context: http://lucene.472066.n3.nabble.com/Facet-Count-Based-on-

Re: Document match with no highlight

2011-05-12 Thread Ahmet Arslan
> URL: > http://localhost:8983/solr/select?indent=on&version=2.2&q=DOC_TEXT%3A%223+1+15%22&fq=&start=0 > &rows=10&fl=DOC_TEXT%2Cscore&qt=standard&wt=standard&debugQuery=on&explainOther=&hl=on&hl.fl=DOC_TEXT&hl.maxAnalyzedChars=-1 > > XML: > > >   >     0 >     19 >     >       >       name="

Re: Coord in queryExplain

2011-05-12 Thread Gabriele Kahlout
You are right! On Thu, May 12, 2011 at 2:54 PM, Ahmet Arslan wrote: > > I'm wondering why the results of coord() are not displayed > > when debugging > > query results, as described in the > > wiki[1< > http://wiki.apache.org/solr/SolrRelevancyFAQ#Why_does_id:archangel_come_before_id:hawkgirl_wh

Re: Coord in queryExplain

2011-05-12 Thread Ahmet Arslan
> I'm wondering why the results of coord() are not displayed > when debugging > query results, as described in the > wiki[1]. > I'd like to see it. > Could someone point to how

Coord in queryExplain

2011-05-12 Thread Gabriele Kahlout
Hello, I'm wondering why the results of coord() are not displayed when debugging query results, as described in the wiki[1]. I'd like to see it. Could someone point to how to m

Re: how to update database record after indexing

2011-05-12 Thread vrpar...@gmail.com
actually every hour some records are inserted into database, so every hour solr indexing will be called with delta import, notes: records and data are very large (in GBs) so each time to find all solr index and update database records process will be slow. is there any eventlistners or snapshoot

Spellcheck: Two dictionaries

2011-05-12 Thread roySolr
Hello, I have 2 fields: what and where. For both of the field i want some spellchecking. I have 2 dictionary's in my config: ws what what spellchecker_what where wh

Re: Facet filter: how to specify OR expression?

2011-05-12 Thread cnyee
Oh I see I was wrong in using (pdf or txt). It worked, but have different meanings altogether from (pdf OR txt). Thanks a lot for your help. Best regards, Yee -- View this message in context: http://lucene.472066.n3.nabble.com/Facet-filter-how-to-specify-OR-expression-tp2930570p2931347.htm

Re: Facet filter: how to specify OR expression?

2011-05-12 Thread Grijesh
"or" is not any operator "OR", "AND", "NOT" all are caps should be used as operator - Thanx: Grijesh www.gettinhahead.co.in -- View this message in context: http://lucene.472066.n3.nabble.com/Facet-filter-how-to-specify-OR-expression-tp2930570p2931318.html Sent from the Solr - User mailin

Re: Document match with no highlight

2011-05-12 Thread Phong Dais
Hi, Ok, here it is. Please note that I had to type everything. I did double and triple check for typos. I do not use term vectors. I also left out the "timing" section. Thanks for all the help. P. URL: http://localhost:8983/solr/select?indent=on&version=2.2&q=DOC_TEXT%3A%223+1+15%22&fq=&start

Re: Facet filter: how to specify OR expression?

2011-05-12 Thread rajini maski
The input parameter assigning to the field "tint" is type string ("or"). It is trying to assign tint=or which is incorrect. So the respective exception has occurred. On Thu, May 12, 2011 at 4:10 PM, cnyee wrote: > The exception says: > > java.lang.NumberFormatExcepton: for input string "or" > >

Re: Facet filter: how to specify OR expression?

2011-05-12 Thread cnyee
The exception says: java.lang.NumberFormatExcepton: for input string "or" The field type is: -- View this message in context: http://lucene.472066.n3.nabble.com/Facet-filter-how-to-specify-OR-expression-tp2930570p2931282.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Spatial search - SOLR 3.1

2011-05-12 Thread roySolr
Hello David, It's easy to calculate it by myself but it was nice if SOLR returns distance in the response. I can sort on distance and calculate the distance with PHP to show it to the users. -- View this message in context: http://lucene.472066.n3.nabble.com/Spatial-search-SOLR-3-1-tp2927579p293

Re: Facet Count Based on Dates

2011-05-12 Thread Grijesh
You can apply patch for Hierarchical faceting on Solr 3.1 - Thanx: Grijesh www.gettinhahead.co.in -- View this message in context: http://lucene.472066.n3.nabble.com/Facet-Count-Based-on-Dates-tp2922371p2930924.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Facet filter: how to specify OR expression?

2011-05-12 Thread Grijesh
No, OR operator should work for any data type - Thanx: Grijesh www.gettinhahead.co.in -- View this message in context: http://lucene.472066.n3.nabble.com/Facet-filter-how-to-specify-OR-expression-tp2930570p2930915.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Facet Count Based on Dates

2011-05-12 Thread Jasneet Sabharwal
But Pivot Faceting is a feature of Solr 4.0 and I am using 3.1 as that is a stable built and cant use a a Nightly Build. The question was: - I have a schema which has field Polarity which is of type "text" and it can have three values 0,1 or -1 and CreatedAt which is of type "date". *How can

Re: Facet filter: how to specify OR expression?

2011-05-12 Thread cnyee
I have another facet that is of type integer and it gave an exception. Is it true that the field has to be of type string or text for the OR expression to work? -- View this message in context: http://lucene.472066.n3.nabble.com/Facet-filter-how-to-specify-OR-expression-tp2930570p2930863.html

Re: Facet filter: how to specify OR expression?

2011-05-12 Thread cnyee
It works. Many thanks. -- View this message in context: http://lucene.472066.n3.nabble.com/Facet-filter-how-to-specify-OR-expression-tp2930570p2930783.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Is it possible to build Solr as a maven project?

2011-05-12 Thread Gabriele Kahlout
On Tue, May 10, 2011 at 3:56 PM, Gabriele Kahlout wrote: > > > On Tue, May 10, 2011 at 3:50 PM, Steven A Rowe wrote: > >> Hi Gabriele, >> >> There are some Maven instructions here (not in Lucene/Solr 3.1 because I >> just wrote the file a couple of days ago): >> < >> http://svn.apache.org/repos/a