RE: solr Invalid Date in Date Math String/Invalid Date String

2011-05-27 Thread Ellery Leung
Thank you Mike. So I understand that now. But what about the other items that have values on both size? They don't work at all. -Original Message- From: Mike Sokolov [mailto:soko...@ifactory.com] Sent: 2011年5月27日 10:23 下午 To: solr-user@lucene.apache.org Cc: alucard001 Subject: Re: sol

LucidWorks source

2011-05-27 Thread Mark
Is LucidWorks source no longer available? In earlier versions their source code was available but after the latest install I can not seem to find it?

Re: K-Stemmer for Solr 3.1

2011-05-27 Thread Mark
Where can one find the KStemmer source for 4.0? On 5/12/11 11:28 PM, Bernd Fehling wrote: I backported a Lucid KStemmer version from solr 4.0 which I found somewhere. Just changed from import org.apache.lucene.analysis.util.CharArraySet; // solr4.0 to import org.apache.lucene.analysis.CharArra

Re: Spellcheck Phrases

2011-05-27 Thread Tanner Postert
are there any updates on this? any third party apps that can make this work as expected? On Wed, Feb 23, 2011 at 12:38 PM, Dyer, James wrote: > Tanner, > > Currently Solr will only make suggestions for words that are not in the > dictionary, unless you specifiy "spellcheck.onlyMorePopular=true".

Custom Scoring relying on another server.

2011-05-27 Thread arian487
I know this question has been asked before but I think my situation is a little different. Basically I need to do custom scores that the traditional function queries simply won't allow me to do. I actually need to hit another server from Java (passing in a bunch of things mostly relying on how to

Re: Nested grouping/field collapsing

2011-05-27 Thread Martijn Laarman
Thanks Mike, I've opened https://issues.apache.org/jira/browse/SOLR-2553 for this. It's exciting to hear a workable implementation might be possible! On Fri, May 27, 2011 at 6:23 PM, Michael McCandless < luc...@mikemccandless.com> wrote: > Can you open a Lucene issue (against the new grouping m

Re: very slow commits and overlapping commits

2011-05-27 Thread Bill Au
I managed to get a thread dump during a slow commit: "resin-tcp-connection-*:5062-129" Id=12721 in RUNNABLE total cpu time=391530.ms user time=390620.ms at java.lang.String.intern(Native Method) at org.apache.lucene.util.SimpleStringInterner.intern(SimpleStringInterner.java:74) at org.apac

problem getting Solr to commit

2011-05-27 Thread David Hill
We verified with the fiddler proxy server that when we use the Java CommonsHttpSolrServer to communicate with our Solr server we are not able to get the client to post a message back to Solr. The result is that we can't force the tail end of a batch job to commit after it has run and we can't

Re: copyField of dates unworking?

2011-05-27 Thread Jack Repenning
On May 27, 2011, at 1:04 AM, Ahmet Arslan wrote: > The letter f should be capital Hah! Well-spotted! Thanks. -==- Jack Repenning Technologist Codesion Business Unit CollabNet, Inc. 8000 Marina Boulevard, Suite 600 Brisbane, California 94005 office: +1 650.228.2562 twitter: http://twitter.com/jr

Re: Pivot with Stats (or Stats with Pivot)

2011-05-27 Thread eduardo
Nobody? Please, help edua...@calandra.com.br 17/05/2011 16:13 Please respond to solr-user@lucene.apache.org To solr-user@lucene.apache.org cc Subject Pivot with Stats (or Stats with Pivot) Hi All, Is it possible to get stats (like Stats Component: min ,max, sum, count, miss

Result Grouping always returns grouped output

2011-05-27 Thread kare...@gmail.com
Hello, I am using the latest nightly build of Solr 4.0 and I would like to use grouping/field collapsing while maintaining compatibility with my current parser. I am using the regular webinterface to test it, the same commands like in the wiki, just with the field names matching my dataset. Grou

Re: Nested grouping/field collapsing

2011-05-27 Thread Michael McCandless
Can you open a Lucene issue (against the new grouping module) for this? I think this is a compelling use case that we should try to support. In theory, with the "general" two-pass grouping collector, this should be possible, but will require three passes, and we also must generalize the 2nd pass

Re: Similarity per field

2011-05-27 Thread Brian Lamb
I'm still not having any luck with this. Has anyone actually gotten this to work so far? I feel like I've followed the directions to the letter but it just doesn't work. Thanks, Brian Lamb On Wed, May 25, 2011 at 2:48 PM, Brian Lamb wrote: > I looked at the patch page and saw the files that wer

Re: Edgengram

2011-05-27 Thread Brian Lamb
For this, I ended up just changing it to string and using "abcdefg*" to match. That seems to work so far. Thanks, Brian Lamb On Wed, May 25, 2011 at 4:53 PM, Brian Lamb wrote: > Hi all, > > I'm running into some confusion with the way edgengram works. I have the > field set up as: > > position

Re: what is the need of setting in solrconfig.xml

2011-05-27 Thread Yury Kats
On 5/27/2011 6:48 AM, Romi wrote: > What is the benifit of setting autocommit in solrconfig.xml. > i read somewhere that these settings control how often pending updates will > be automatically pushed to the index. > does it mean if solr server is running then it automaticaly starts indexing > pro

Re: Nested grouping/field collapsing

2011-05-27 Thread Bill Bell
Did you try pivot? Bill Bell Sent from mobile On May 27, 2011, at 4:13 AM, Martijn Laarman wrote: > Hi, > > I was wondering if this issue had already been raised. > > We currently have a use case where nested field collapsing would be really > helpful > > I.e Collapse on field X then Collap

Nested grouping/field collapsing

2011-05-27 Thread Martijn Laarman
Hi, I was wondering if this issue had already been raised. We currently have a use case where nested field collapsing would be really helpful I.e Collapse on field X then Collapse on Field Y within the groups returned by field X The current behavior of specifying multiple fields seem to be retu

RE: Spellcheck: Two dictionaries

2011-05-27 Thread Dyer, James
You're up against a couple of real limitations with Solr's spell checking. The first limitation is that you can only use 1 dictionary per query. The second limitation is that if a word is in the dictionary it never tries to correct it. This will happen even if you *don't* combine your two di

Re: highlighting in multiValued field

2011-05-27 Thread Jeffrey Chang
Hi Bob, Hmm... I don't think this approach will scale with bigger and more documents :( Thanks for your help though; I think I should take a look at customizing highlight component to achieve this... Thanks, Jeff On May 27, 2011, at 12:24 PM, Bob Sandiford wrote: > The only thing I can th

Re: solr Invalid Date in Date Math String/Invalid Date String

2011-05-27 Thread Mike Sokolov
The "*" endpoint for range terms wasn't implemented yet in 1.4.1 As a workaround, we use very large and very small values. -Mike On 05/27/2011 12:55 AM, alucard001 wrote: Hi all I am using SOLR 1.4.1 (according to solr info), but no matter what date field I use (date or tdate) defined in def

Splitting fields

2011-05-27 Thread Joe Fitzgerald
Hello, I am in an odd position. The application server I use has built-in integration with SOLR. Unfortunately, its native capabilities are fairly limited, specifically, it only supports a standard/pre-defined set of fields which can be indexed. As a result, it has left me kludging how I wor

Re: Comma delemitered words shawn in terms like one word.

2011-05-27 Thread abhay kumar
Thanks I was looking exactly for this. I needed to spli tokens based on comma. On Fri, Jun 18, 2010 at 10:12 PM, Joe Calderon wrote: > set generateWordParts=1 on wordDelimiter or use > PatternTokenizerFactory to split on commas > > > http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#sol

Re: Nested grouping/field collapsing

2011-05-27 Thread Juan Antonio Farré Basurte
I've found the same issue. As long as I know, the only solution is to create a copy field which combines both-fields values and facet on this field. If one of the fields has a set of distinct values known in advance and its cardinality c is not too big, it isn't a great problem: you can do with c

Re: Nested grouping/field collapsing

2011-05-27 Thread Martijn Laarman
Hi, I was wondering if this issue had already been raised. We currently have a use case where nested field collapsing would be really helpful I.e Collapse on field X then Collapse on Field Y within the groups returned by field X The current behavior of specifying multiple fields seem to be retur

Re: DIH render html entities

2011-05-27 Thread anass talby
Sorry my question was not clear. when I get data from database, some field contains some html special chars, and what i want to do is just convert them automatically. On Fri, May 27, 2011 at 1:00 PM, Gora Mohanty wrote: > On Fri, May 27, 2011 at 3:50 PM, anass talby > wrote: > > Is there any wa

Re: Issue while extracting content from MS Excel 2007 file using TikaEntityProcessor

2011-05-27 Thread Gora Mohanty
On Thu, May 26, 2011 at 6:52 PM, Rahul Warawdekar wrote: > Hi All, > > I am using Solr 3.1 for one of our search based applications. > We are using DIH to index our data and TikaEntityProcessor to index > attachments. > Currently we are running into an issue while extracting content from one of >

Re: Documents update

2011-05-27 Thread Denis Kuzmenok
I'm using 3.1 now. Indexing lasts for a few hours, and have big plain size. Getting all documents would be rather slow :( > Not with 1.4, but apparently there is a patch for trunk. Not > sure if it is in 3.1. > If you are on 1.4, you could first query Solr to get the data > for the document

Re: Documents update

2011-05-27 Thread Gora Mohanty
2011/5/27 Denis Kuzmenok : > Hi. > > I  have  and  indexed  database  which  is indexed few times a day and > contain  tinyint  flag  (like is_enabled, is_active, etc), and content > isn't changed too often, but flags are. > So  if i index via post.jar only flags then entire document is deleted > a

Documents update

2011-05-27 Thread Denis Kuzmenok
Hi. I have and indexed database which is indexed few times a day and contain tinyint flag (like is_enabled, is_active, etc), and content isn't changed too often, but flags are. So if i index via post.jar only flags then entire document is deleted and there's only unique key and flags. Is

Re: DIH render html entities

2011-05-27 Thread Gora Mohanty
On Fri, May 27, 2011 at 3:50 PM, anass talby wrote: > Is there any way to render html entities in DIH for a specific field? [...] This does not make too much sense: What do you mean by "rendering HTML entities". DIH just indexes, so where would it render HTML to, even if it could? Please take a

what is the need of setting in solrconfig.xml

2011-05-27 Thread Romi
What is the benifit of setting autocommit in solrconfig.xml. i read somewhere that these settings control how often pending updates will be automatically pushed to the index. does it mean if solr server is running then it automaticaly starts indexing process if it finds any updates in database???

DIH render html entities

2011-05-27 Thread anass talby
Is there any way to render html entities in DIH for a specific field? Thanks -- Anass

what is the need of setting writeLockTimeout and commitLockTimeout in solrconfig.xml

2011-05-27 Thread Romi
I wanted to have the basic idea of setting these parameters in solrconfig.xml what actually writeLockTimeout and commitLockTimeout indicates here. - Thanks & Regards Romi -- View this message in context: http://lucene.472066.n3.nabble.com/what-is-the-need-of-setting-writeLockTimeout-and-c

Re: How to disable QueryElevationComponent

2011-05-27 Thread Romi
i removed string elevate.xml from solrconfig.xml but it is showing the following exception: java.lang.NullPointerException at org.apache.solr.handler.dataimport.DataImporter.identifyPk(DataImporter.java:152) at org.apache.solr.handler.dataimport.DataImporter.(DataI

Re: How to disable QueryElevationComponent

2011-05-27 Thread Markus Jelsma
Remove the component configuration from your solrconfig. > Hi, in my indexed document i do not want a uniqueKey field, but when i do > not give any uniqueKey in schema.xml then it shows an exception > org.apache.solr.common.SolrException: QueryElevationComponent requires the > schema to have a uni

test

2011-05-27 Thread Romi
test - Thanks & Regards Romi -- View this message in context: http://lucene.472066.n3.nabble.com/test-tp2992199p2992199.html Sent from the Solr - User mailing list archive at Nabble.com.

How to disable QueryElevationComponent

2011-05-27 Thread Romi
Hi, in my indexed document i do not want a uniqueKey field, but when i do not give any uniqueKey in schema.xml then it shows an exception org.apache.solr.common.SolrException: QueryElevationComponent requires the schema to have a uniqueKeyField. it means QueryElevationComponent requires a uniqueKey

Re: Returning documents using multi-valued field

2011-05-27 Thread Kurt Sultana
Thanks for you answer James :) For guys who would meet up with this problem, http://markmail.org/thread/xce4qyzs5367yplo also speaks about this, and reaches James' conclusion too. On Thu, May 26, 2011 at 10:19 PM, Dyer, James wrote: > This is a limitation of Lucene/Solr in that there is no way t

RE: HTMLStripTransformer will remove the content in XML??

2011-05-27 Thread Ellery Leung
Got it. Actually I use solr.MappingCharFilterFactory to replace the to empty first, and use HTMLStripCharFilterFactory to get "hello" and "solr". For future reference, here is part of schema.xml ... In mappings.txt (2 lines) "" => "" Restart Solr

TermFreqVector Problem

2011-05-27 Thread deniz
Hi all here is what I have been trying and the problem I am trying to see how many times a single word appears in a field. Basically, I have a field called universal, and lets say the field is like this: "car house road age sex school education education tree garden" and I am searching useing t

frange vs TrieRange

2011-05-27 Thread Juan Antonio Farré Basurte
Hello, I have to perform range queries agains a date field. It is a TrieDateField, and I'm already using it for sorting. Hence, there will be already en entry in the FieldCache for it. According to: http://www.lucidimagination.com/blog/2009/07/06/ranges-over-functions-in-solr-14/ frange queries

Re: HTMLStripTransformer will remove the content in XML??

2011-05-27 Thread bryan rasmussen
I would expect that it doesn't understand CDATA and thinks of everything between < and > as a 'tag'. Best Regards, Bryan Rasmussen On Fri, May 27, 2011 at 9:41 AM, Ellery Leung wrote: > I have an XML string like this: > > > > encoding="UTF-8"?> > > > > By using HTMLStripTransformer, I expect to

RE: Spellcheck: Two dictionaries

2011-05-27 Thread roySolr
That uber dictionary is not what i want. I get also suggestions form the where in the what. An example: what where chelseaLondon Soccerclub Bondon London When i type "soccerclub london" i want the suggestion from the wha

Re: copyField of dates unworking?

2011-05-27 Thread Ahmet Arslan
> >  The letter f should be capital. copyfield =>copyField

Re: Facet Query

2011-05-27 Thread Juan Antonio Farré Basurte
Are you talking about a facet query or a facet field? If it's a facet query, I don't get what's going on. If it's a facet field... well, if it's a fixed set of words you're interested in, filter the query to only those words and you'll get counts only for them. If you just need to filter out comm

Re: Problem with spellchecking, dont want multiple request to SOLR

2011-05-27 Thread roySolr
mm ok. I configure 2 spellcheckers: spell_what spell_what true spellchecker_what spell_where spell_where true spellchecker_where H

Re: Facet Query

2011-05-27 Thread Chandan Tamrakar
which analyzer do you use for indexing ? You could exclude those stop words during indexing http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters On Fri, May 27, 2011 at 1:36 PM, Jasneet Sabharwal < jasneet.sabhar...@ngicorporation.com> wrote: > Hi > > When I do a facet query on my data

Facet Query

2011-05-27 Thread Jasneet Sabharwal
Hi When I do a facet query on my data, it shows me a list of all the words present in my database with their count. Is it possible to not get the results of common words like a, an, the, http and so one but only get the count of stuff we need like microsoft, ipad, solr, etc. -- Thanx& Regar

Re: Query regarding Solr-2242 patch for getting distinct facet counts.

2011-05-27 Thread rajini maski
No such issues . Successfully integrated with 1.4.1 and it works across single index. for f.2.facet.numFacetTerms=1 parameter it will give the distinct count result for f.2.facet.numFacetTerms=2 parameter it will give counts as well as results for facets. But this is working only across singl

HTMLStripTransformer will remove the content in XML??

2011-05-27 Thread Ellery Leung
I have an XML string like this: By using HTMLStripTransformer, I expect to get 'hello,solr'. But actual this transformer will remove ALL THE TEXT INSIDE! Did I do something silly, or is it a bug? Thank you

Re: Query regarding Solr-2242 patch for getting distinct facet counts.

2011-05-27 Thread Bill Bell
I am pretty sure it does not yet support distributed shards.. But the patch was written for 4.0... So there might be issues with running it on 1.4.1. On 5/26/11 11:08 PM, "rajini maski" wrote: > The patch solr 2242 for getting count of distinct facet terms doesn't >work for distributedProce