Re: Synonym format not working

2008-10-16 Thread prerna07
Actual synonym : ccc => 1,2 ccc=>3 The result when i added &dubugQuery=true is: - - 0 15 - 10 0 on ccc true 2.2 - ccc ccc MultiPhraseQuery(all:" (1 ) (2 ccc ) 3") all:" (1 ) (2 ccc ) 3" OldLuceneQParse

Re: Synonym format not working

2008-10-16 Thread Otis Gospodnetic
I can't see the problem at the moment. What do you see when you use &debugQuery=true in the URL? Do you see the query that includes synonyms? Can you give us the actual query and actual synonyms? Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message

Re: Reduction of open files

2008-10-16 Thread Otis Gospodnetic
Out of curiosity, how many files are held open when you hit the limit? What does ulimit show? And what does lsof show? Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Paul deGrandis <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org >

Re: Different XML format for multi-valued fields?

2008-10-16 Thread Noble Paul നോബിള്‍ नोब्ळ्
The component that writes out the values do not know if it is multivalued or not. So if it finds only a single value it writes it out as such On Thu, Oct 16, 2008 at 10:52 PM, oleg_gnatovskiy <[EMAIL PROTECTED]> wrote: > > Hello. I have an index built in Solr with several multi-value fields. When

Re: RegexTransformer debugging (DIH)

2008-10-16 Thread Noble Paul നോബിള്‍ नोब्ळ्
If it is a normal exception it is logged with the number of document where it failed and you can put it on debugger with start=&rows=1 We do not catch a throwable or Error so it gets slipped through. if you are adventurous enough wrap the RegexTranformer with your own and apply that say transform

Re: dataimport, both splitBy and dateTimeFormat

2008-10-16 Thread Noble Paul നോബിള്‍ नोब्ळ्
Thanks David, I have updated the wiki documentation http://wiki.apache.org/solr/DataImportHandler#transformer The default transformers do not have any special privilege it is like any normal user provided transformer.We just identified some commonly found usecases and added transformers for that.

Re: Tree Faceting Component

2008-10-16 Thread Jeremy Hinegardner
After a bit more investigating, it appears that any facet tree where the first item is numerical or boolean or some non-textual type does not produce any secondary facets. This includes sint, sfloat, boolean and such. For instance, on the sample index: facet.tree=sku,cat => works facet.tree=

Re: error with delta import

2008-10-16 Thread Noble Paul നോബിള്‍ नोब्ळ्
the last-index_time is available only from second time onwards that is . It expects a full-import to be done first It knows that by the presence of dataimport.properties in the config directory. Did you check if it is present? On Thu, Oct 16, 2008 at 5:33 PM, Florian Aumeier <[EMAIL PROTECTED]>

Synonym format not working

2008-10-16 Thread prerna07
Hi, I am facing issue in synonym search of solr. The synonym.txt contain the format: ccc => 1,2,ccc ccc => 3 I am not getting any search result for ccc. I have created indexes with string value. Do i need to change anything in schema .xml ? String tag from Schema.xml :

RE: error with delta import

2008-10-16 Thread Lance Norskog
If you make a database view with the query, it is easy to examine the data you want to index. Then, your solr import query would just pull the view. The Solr setup file is much simpler this way. -Original Message- From: Noble Paul നോബിള്‍ नोब्ळ् [mailto:[EMAIL PROTECTED] Sent: Wednesda

Re: Reduction of open files

2008-10-16 Thread Paul deGrandis
My biggest concern is why do the remaining files stay open even if my mergeFactor is 2. I would expect to see one or two segment files and one or two sets of accompanying file (.nrm, .frq, etc), based on the documentation. Paul On Thu, Oct 16, 2008 at 4:23 PM, Paul deGrandis <[EMAIL PROTECTED]>

Re: Reduction of open files

2008-10-16 Thread Paul deGrandis
I currently am not. The document collection is highly volatile (3000 modifications a minute) and from reading thought it would be too much of a performance penalty but never tested it. What behavior in terms of file creation and open fd is seen when useCompoundFile is set to true? Paul On Thu,

Re: Reduction of open files

2008-10-16 Thread Grant Ingersoll
Are you using the compound file format? -Grant On Oct 16, 2008, at 3:28 PM, Paul deGrandis wrote: I have been working with SOLR for a few months now. According to some documentation I read, segment files only have one set of all the other lingustic module type of stuff (normalization, frequen

Reduction of open files

2008-10-16 Thread Paul deGrandis
I have been working with SOLR for a few months now. According to some documentation I read, segment files only have one set of all the other lingustic module type of stuff (normalization, frequency), is there a way to remove/reduce the files not associated with a segment besides optimizing the ind

Re: dataimport, both splitBy and dateTimeFormat

2008-10-16 Thread David Smiley @MITRE.org
The wiki didn't mention I can specify multiple transformers. BTW, it's "transformer" (singular), not "transformers". I did mean both NFT and DFT because I was speaking of the general case, not just mine in particular. I thought that the built-in transformers were always in-effect and so I expec

Different XML format for multi-valued fields?

2008-10-16 Thread oleg_gnatovskiy
Hello. I have an index built in Solr with several multi-value fields. When the multi-value field has only one value for a document, the XML returned looks like this: 5693 However, when there are multiple values for the field, the XMl looks like this: arr name="someIds"> 11199 1722 Is there a

Re: Tree Faceting Component

2008-10-16 Thread Jeremy Hinegardner
Erik, After some more experiments, I can get it to perform incorrectly using the sample solr data. The example query from SOLR-792 ticket: http://localhost:8983/solr/select?q=*:*&rows=0&facet=on&facet.field=cat&facet.tree=cat,inStock&wt=json&indent=on Make a few altertions to the query: 1) s

RegexTransformer debugging (DIH)

2008-10-16 Thread Jon Baer
Is there a way to prevent this from occurring (or a way to nail down the doc which is causing it?): INFO: [news] webapp=/solr path=/admin/dataimport params={command=status} status=0 QTime=0 Exception in thread "Thread-14" java.lang.StackOverflowError at java.util.regex.Pattern$Single

Re: updating documents in solr 1.3.0

2008-10-16 Thread Bill Au
This is being worked on for Solr 1.4: https://issues.apache.org/jira/browse/SOLR-139 Bill On Wed, Oct 15, 2008 at 7:47 PM, Walter Underwood <[EMAIL PROTECTED]>wrote: > Neither Solr no Lucene support partial updates. "Update" means > "add or replace". --wunder > > On 10/15/08 4:23 PM, "[EMAIL PR

Re: How to retrieve all field names of index of one type

2008-10-16 Thread Otis Gospodnetic
Hi, I don't have the sources handy, but look at the Luke request handler in Solr sources and you'll see how it can be done. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: prerna07 <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Se

Re: Advice on analysis/filtering?

2008-10-16 Thread Andrzej Bialecki
Jarek Zgoda wrote: Wiadomość napisana w dniu 2008-10-16, o godz. 16:21, przez Grant Ingersoll: I'm trying to create a search facility for documents in "broken" Polish (by broken I mean "not language rules compliant"), Can you explain what you mean here a bit more? I don't know Polish, Hi

Re: How Synonyms work in Solr

2008-10-16 Thread Otis Gospodnetic
Hi, It looks like you have not seen a pretty detailed page on Synonyms on the Solr wiki. Have a look, I think you'll find answers to your questions there. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: payalsharma <[EMAIL PROTECTED]> >

Re: snapshooter and spellchecker

2008-10-16 Thread Otis Gospodnetic
Geoff, maybe this will help: https://issues.apache.org/jira/browse/SOLR-433 Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Geoffrey Young <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Thursday, October 16, 2008 10:34:40 AM

Re[2]: How to change a port?

2008-10-16 Thread Aleksey Gogolev
o solr examples simultaneously. >> >> -- >> Aleksey Gogolev >> developer, >> dev.co.ua >> Aleksey >> RM> __ NOD32 3528 (20081016) Information __ RM> This message was checked by NOD32 antivirus system. RM> http://www.eset.com -- Aleksey Gogolev developer, dev.co.ua Aleksey mailto:[EMAIL PROTECTED]

Re: Advice on analysis/filtering?

2008-10-16 Thread Jarek Zgoda
Wiadomość napisana w dniu 2008-10-16, o godz. 16:21, przez Grant Ingersoll: I'm trying to create a search facility for documents in "broken" Polish (by broken I mean "not language rules compliant"), Can you explain what you mean here a bit more? I don't know Polish, but most spoken langu

Re: How to change a port?

2008-10-16 Thread Ryan McKinley
that will depend on your servlet container. (jetty, resin, tomcat, etc...) If you are running jetty from the example, you can change the port by adding -Djetty.port=1234 to the command line. The port is configured in example/etc/jetty.xml the relevant line is: > ryan On Oct 16, 2

snapshooter and spellchecker

2008-10-16 Thread Geoffrey Young
hi all :) I was surprised to find that snapshooter didn't account for the spellcheck dictionary. but then again, since you can call it whatever you want I guess it couldn't. so, how are people distributing their dictionaries across their slaves? since it takes so long to generate, I can't see i

How to change a port?

2008-10-16 Thread Aleksey Gogolev
Hello. Is there a way to change the port (8983) of solr example? I want to run two solr examples simultaneously. -- Aleksey Gogolev developer, dev.co.ua Aleksey

Re: Advice on analysis/filtering?

2008-10-16 Thread Grant Ingersoll
On Oct 16, 2008, at 3:07 AM, Jarek Zgoda wrote: Hello, group. I'm trying to create a search facility for documents in "broken" Polish (by broken I mean "not language rules compliant"), Can you explain what you mean here a bit more? I don't know Polish, but most spoken languages can't be

Re: Advice on analysis/filtering?

2008-10-16 Thread Erick Erickson
You're welcome. I should have pointed out that I was responding mostly to the "false hits are not acceptable" portion, which I don't think is achievable Best Erick 2008/10/16 Jarek Zgoda <[EMAIL PROTECTED]> > Wiadomość napisana w dniu 2008-10-16, o godz. 15:54, przez Erick Erickson: > > Wel

Re: Advice on analysis/filtering?

2008-10-16 Thread Jarek Zgoda
Wiadomość napisana w dniu 2008-10-16, o godz. 15:54, przez Erick Erickson: Well, let me see. Your customers are telling you, in essence, "for any random input, you cannot return false positives". Which is nonsense, so I'd say you need to negotiate with your customers. I flat guarantee that, fo

How Synonyms work in Solr

2008-10-16 Thread payalsharma
Hi, Please explain that how the below mentioned synonyms patterns work in Solr Search as there exists several seperators for synonym patterns: 1. #Explicit mappings match any token sequence on the LHS of "=>" #and replace with all alternatives on the RHS. These types of mappings #ignore the ex

Re: Advice on analysis/filtering?

2008-10-16 Thread Erick Erickson
Well, let me see. Your customers are telling you, in essence, "for any random input, you cannot return false positives". Which is nonsense, so I'd say you need to negotiate with your customers. I flat guarantee that, for any algorithm you try, you can write a counter-example in, oh, 15 seconds or s

Re: dataimport, both splitBy and dateTimeFormat

2008-10-16 Thread Shalin Shekhar Mangar
Hi David, I think you meant RegexTransformer instead of NumberFormatTransformer. Anyhow, the order in which the transformers are applied is the same as the order in which you specify them. So make sure your entity has transformers="RegexTransformer,DateFormatTransformer". On Thu, Oct 16, 2008 at

dataimport, both splitBy and dateTimeFormat

2008-10-16 Thread David Smiley @MITRE.org
I'm trying out the dataimport capability. I have a column that is a series of dates separated by spaces like so: "1996-00-00 1996-04-00" And I'm trying to import it like so: However this fails and the stack trace suggests it is first trying to apply the dateTimeFormat before splitBy. I think t

Re: snapcleaner >> problem solr 1.3

2008-10-16 Thread Chris Haggstrom
On Oct 16, 2008, at 4:29 AM, sunnyfr wrote: still nothing changed : It looks like it worked better to me, in that it resulted in a valid find command for any snapshots with an -mtime of +1: ++ find /data/solr/video/data -maxdepth 1 -name 'snapshot.*' -mtime +1 -print instead of showi

Re: error with delta import

2008-10-16 Thread Florian Aumeier
Noble Paul നോബിള്‍ नोब्ळ् schrieb: Well, when doing the way you described below (full-import with the delta query), the '${dataimporter.last_index_time}' timestamp is empty: I guess this was fixed post 1.3 . probably you can take dataimporthandler.jar from a nightly build (you may also need

Re: snapcleaner >> problem solr 1.3

2008-10-16 Thread sunnyfr
still nothing changed : [EMAIL PROTECTED]:/data/solr/video# ./bin/snapcleaner -V -D 1 + [[ -z 1 ]] + fixUser -V -D 1 + [[ -z '' ]] ++ whoami + user=root ++ whoami + [[ root != root ]] ++ who -m ++ cut '-d ' -f1 ++ sed '-es/^.*!//' + oldwhoami=root + [[ root == '' ]] + [[ -z /data/solr/video/data

Re: snapcleaner >> problem solr 1.3

2008-10-16 Thread Chris Haggstrom
On Oct 16, 2008, at 3:10 AM, sunnyfr wrote: I've a wierd problem when I try to fire snapcleaner manually : Already : is it correct : [EMAIL PROTECTED]:/data/solr/video# ./bin/snapcleaner -V -D-1 To remove every snapshot older than one day. You need to change "-D -1" to "-D 1". Otherwise, y

Re: Solr search not displaying all the indexed values.

2008-10-16 Thread con
Yes. something similar to :

Re: Solr search not displaying all the indexed values.

2008-10-16 Thread con
Yes. something similar to : But the searching will not give all the results even if there is only one result. whereas indexing is fine. Thanks con Noble Paul നോബിള്‍ नोब्ळ् wrote: > > do you have 2 queries in 2 different entities? >

snapcleaner >> problem solr 1.3

2008-10-16 Thread sunnyfr
Hi guys, I've a wierd problem when I try to fire snapcleaner manually : Already : is it correct : [EMAIL PROTECTED]:/data/solr/video# ./bin/snapcleaner -V -D-1 To remove every snapshot older than one day. It doesn't remove older than one day obviously and debugger show me : + [[ -z -1 ]] + fix

Re: Solr search not displaying all the indexed values.

2008-10-16 Thread Noble Paul നോബിള്‍ नोब्ळ्
do you have 2 queries in 2 different entities? On Thu, Oct 16, 2008 at 3:17 PM, con <[EMAIL PROTECTED]> wrote: > > I have two queries in my data-config.xml which takes values from multiple > tables, like: > select * from EMPLOYEE, CUSTOMER where EMPLOYEE.prod_id= CUSTOMER.prod_id. > > When i do a

Solr search not displaying all the indexed values.

2008-10-16 Thread con
I have two queries in my data-config.xml which takes values from multiple tables, like: select * from EMPLOYEE, CUSTOMER where EMPLOYEE.prod_id= CUSTOMER.prod_id. When i do a full-import it is indexing all the rows as expected. But when i search it with a *:* , it is not displaying all the value

Re: error with delta import

2008-10-16 Thread Noble Paul നോബിള്‍ नोब्ळ्
On Thu, Oct 16, 2008 at 2:08 PM, Florian Aumeier <[EMAIL PROTECTED]> wrote: > Noble Paul നോബിള്‍ नोब्ळ् schrieb: >> >> The delta implementation is a bit fragile in DIH for complex queries >> >> > > that's too bad. It's a nice interface and less complex to configure than to > go the XML /update way.

Re: error with delta import

2008-10-16 Thread Florian Aumeier
Noble Paul നോബിള്‍ नोब्ळ् schrieb: The delta implementation is a bit fragile in DIH for complex queries that's too bad. It's a nice interface and less complex to configure than to go the XML /update way. Well, when doing the way you described below (full-import with the delta query), the

Advice on analysis/filtering?

2008-10-16 Thread Jarek Zgoda
Hello, group. I'm trying to create a search facility for documents in "broken" Polish (by broken I mean "not language rules compliant"), searchable by terms in "broken" Polish, but broken in many other ways than documents. See this example: document text: "włatcy móch" (in proper Polish t