Re: Attempting dataimport using FileListEntityProcessor

2008-06-23 Thread Shalin Shekhar Mangar
Hi Mike, Just curious to know the use-case here. Why do you want to limit updates to 100 instead of importing all documents? On Tue, Jun 24, 2008 at 10:23 AM, mike segv <[EMAIL PROTECTED]> wrote: > > That fixed it. > > If I'm inserting millions of documents, how do I control docs/update? E.g. >

Re: Attempting dataimport using FileListEntityProcessor

2008-06-23 Thread Noble Paul നോബിള്‍ नोब्ळ्
Just extend XPathEntityProcessor override nextRow() after 100 return null. Use it as your processor --Noble On Tue, Jun 24, 2008 at 10:45 AM, Noble Paul നോബിള്‍ नोब्ळ् <[EMAIL PROTECTED]> wrote: > Just extend XPathEntityProcessor override nextRow() after 100 . Use > it as your processor > retu

Re: Attempting dataimport using FileListEntityProcessor

2008-06-23 Thread Noble Paul നോബിള്‍ नोब्ळ्
Just extend XPathEntityProcessor override nextRow() after 100 . Use it as your processor return null; On Tue, Jun 24, 2008 at 10:23 AM, mike segv <[EMAIL PROTECTED]> wrote: > > That fixed it. > > If I'm inserting millions of documents, how do I control docs/update? E.g. > if there are 50K docs

Re: Attempting dataimport using FileListEntityProcessor

2008-06-23 Thread mike segv
That fixed it. If I'm inserting millions of documents, how do I control docs/update? E.g. if there are 50K docs per file, I'm thinking that I should probably code up my own DataSource that allows me to stipulate docs/update. Like say, 100 instead of 50K. Does this make sense? Mike Noble Pau

Re: Attempting dataimport using FileListEntityProcessor

2008-06-23 Thread Noble Paul നോബിള്‍ नोब्ळ्
hi , You have not registered any datasources . the second entity needs a datasource. Remove the dataSource="null" and add a name for the second entity (good practice). No need for baseDir attribute for second entity . See the modified xml added below --Noble On Tue, Jun 24, 2008

Re: Cost of having fieldTypes defined but not used

2008-06-23 Thread Mike Klaas
On 23-Jun-08, at 7:05 PM, Norberto Meijome wrote: Hi all, I'm curious , what is the cost (memory / processing time @ load? performance hit ? ) of having several unused fieldTypes defined in schema.xml ? Affects startup time only, likely non-measurable. -Mike

Cost of having fieldTypes defined but not used

2008-06-23 Thread Norberto Meijome
Hi all, I'm curious , what is the cost (memory / processing time @ load? performance hit ? ) of having several unused fieldTypes defined in schema.xml ? cheers, B _ {Beto|Norberto|Numard} Meijome "Egotism is the anesthetic that dulls the pain of stupidity." Frank Leahy

Attempting dataimport using FileListEntityProcessor

2008-06-23 Thread mike segv
I'm trying to use the fileListEntityProcessor to add some xml documents to a solr index. I'm running a nightly version of solr-1.3 with SOLR-469 and SOLR-563. I've been able to successfuly run the slashdot httpDataSource example. My data-config.xml file loads without errors. When I attempt the

Re: adding documents with json post

2008-06-23 Thread Cam Bazz
thanks a bunch. On Mon, Jun 23, 2008 at 4:39 AM, Otis Gospodnetic < [EMAIL PROTECTED]> wrote: > Hi Cam, > > Yes, the various other formats are for responses only, as far as I'm aware. > > Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > - Original Message > > F

Re: n-Gram, only works with queries of 2 letters

2008-06-23 Thread Norberto Meijome
On Mon, 23 Jun 2008 05:33:49 -0700 (PDT) Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > Hi, > > > When you add &debugQuery=true to the request, what does your query look like > after parsing? > > BTW, I've tested same data + similar config using EdgeNGramTokenizer and this works properly -

Re: Wildcard search question

2008-06-23 Thread Norberto Meijome
On Mon, 23 Jun 2008 14:23:14 -0700 Jon Drukman <[EMAIL PROTECTED]> wrote: > ok well let's say that i can live without john/jon in the short term. > what i really need today is a case insensitive wildcard search with > literal matching (no fancy stemming. bobby is bobby, not bobbi.) > > what ar

Re: SpellCheckComponent: No file-based suggestions + Location issue

2008-06-23 Thread Grant Ingersoll
Hmm, if I add: solr.FileBasedSpellChecker file spellings.txt UTF-8 ./spellcheckerFile To the examples declaration with the spellings.txt in the working directory, it works for me. Did you see any exceptions in your log? On Jun 21, 2008, at 5:28 PM, Ronal

Re: SpellCheckComponent: No file-based suggestions + Location issue

2008-06-23 Thread Grant Ingersoll
Inline below On Jun 21, 2008, at 5:28 PM, Ronald K. Braun wrote: 2. I'm having difficulty getting the FileBasedSpellChecker to work -- probably something silly on my part but given the above I thought there might be container differences that haven't been vetted. Here is my config:

Re: n-Gram, only works with queries of 2 letters

2008-06-23 Thread Norberto Meijome
On Mon, 23 Jun 2008 05:33:49 -0700 (PDT) Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > Hi, > > > When you add &debugQuery=true to the request, what does your query look like > after parsing? Hi Otis, sorry, i should have sent this before too. With minGramSize = 3 , same data, clean server star

Re: Wildcard search question

2008-06-23 Thread Jon Drukman
Erik Hatcher wrote: No, because the original data is Bobby Gaza, so Bobby* would match, but not bobby*. "string" type (in the example schema, to be clear) does effectively no analysis, leaving the original string indexed as-is, case and all. [...] stemming and wildcard term queries aren't

Re: Wildcard search question

2008-06-23 Thread Erik Hatcher
On Jun 23, 2008, at 4:45 PM, Jon Drukman wrote: Erik Hatcher wrote: Jon, You provided a lot of nice details, thanks for helping us help you :) The one missing piece is the definition of the "text" field type. In Solr's _example_ schema, "bobby" gets analyzed (stemmed) to "bobbi"[1]. Whe

Re: Wildcard search question

2008-06-23 Thread Jon Drukman
Erik Hatcher wrote: Jon, You provided a lot of nice details, thanks for helping us help you :) The one missing piece is the definition of the "text" field type. In Solr's _example_ schema, "bobby" gets analyzed (stemmed) to "bobbi"[1]. When you query for bobby*, the query parser is not ru

Re: Wildcard search question

2008-06-23 Thread Erik Hatcher
Jon, You provided a lot of nice details, thanks for helping us help you :) The one missing piece is the definition of the "text" field type. In Solr's _example_ schema, "bobby" gets analyzed (stemmed) to "bobbi"[1]. When you query for bobby*, the query parser is not running an analyzer

Wildcard search question

2008-06-23 Thread Jon Drukman
When I search with q=bobby I get the following record: 2008-06-23T07:06:40Z http://farm1.static.flickr.com/117/... 9 Bobby Gaza [EMAIL PROTECTED] When I search with bobby* I get nothing. When I search with steve* I get "Steve Ballmer" and "Steve Jobs"... What's going on?

Re: never desallocate RAM...during search

2008-06-23 Thread Erik Hatcher
On Jun 23, 2008, at 8:16 AM, <[EMAIL PROTECTED]> wrote: I was doing something similar to your solution to have better searching times. I download you patch but I have a problem in one class. I'm not sure if I'm doing something wrong but if I what to compile the proyect I must change in In

"document commit" possible?

2008-06-23 Thread Brian Whitman
Could the commit operation be adapted to just have the searchers aware of new stored content in a particular document? e.g. With the understanding that queries for newly indexed fields in this document will not return this newly added document, but a query for the document by its id will r

Re: n-Gram, only works with queries of 2 letters

2008-06-23 Thread Otis Gospodnetic
Hi, When you add &debugQuery=true to the request, what does your query look like after parsing? Otis-- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Norberto Meijome <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Monday, June 23

RE: never desallocate RAM...during search

2008-06-23 Thread r.nieto
Hi Nicolas, I was doing something similar to your solution to have better searching times. I download you patch but I have a problem in one class. I'm not sure if I'm doing something wrong but if I what to compile the proyect I must change in IndexSchema: //private Similarity similarity;

Re: Can I specify the default operator at query time ?

2008-06-23 Thread Nikhil Chhaochharia
Thanks, I had tried mm (through SolrJ) some time ago but it did not seem to work. I will look into it more carefully - the problem maybe related to SolrJ. Nikhil - Original Message From: Erik Hatcher <[EMAIL PROTECTED]> To: solr-user@lucene.apache.org Sent: Monday, 23 June, 2008 5:15:

Re: Can I specify the default operator at query time ?

2008-06-23 Thread Erik Hatcher
dismax is a different sort of parser, where AND/OR doesn't really make sense exactly. The mm (minimum match) parameter for dismax can be used to set things to be fully OR (mm=0) or fully AND (mm=100%) Erik On Jun 23, 2008, at 6:27 AM, Nikhil Chhaochharia wrote: I think this works

Re: Can I specify the default operator at query time ?

2008-06-23 Thread Nikhil Chhaochharia
I think this works only for StandardRequestHandler. Any way of doing it in DisMaxRequestHandler ? Nikhil - Original Message From: Erik Hatcher <[EMAIL PROTECTED]> To: solr-user@lucene.apache.org Sent: Monday, 23 June, 2008 3:24:13 PM Subject: Re: Can I specify the default operator at

RE: Can I specify the default operator at query time ?

2008-06-23 Thread Pierre-Yves LANDRON
Thanks !!! it'll save me great time ! > From: [EMAIL PROTECTED] > To: solr-user@lucene.apache.org > Subject: Re: Can I specify the default operator at query time ? > Date: Mon, 23 Jun 2008 05:54:13 -0400 > > > On Jun 23, 2008, at 5:42 AM, Pierre-Yves LANDRON wrote: > > Is there a way to trans

Re: Can I specify the default operator at query time ?

2008-06-23 Thread Erik Hatcher
On Jun 23, 2008, at 5:42 AM, Pierre-Yves LANDRON wrote: Is there a way to transgress the default operator defined in solrConfig for a particular query, and for this query only (a query- specific option) ? Yes, you can specify the q.op=AND or q.op=OR per Solr request. Erik

Can I specify the default operator at query time ?

2008-06-23 Thread Pierre-Yves LANDRON
Hello, Is there a way to transgress the default operator defined in solrConfig for a particular query, and for this query only (a query-specific option) ? Thanks, P-Y Landron _ Discover the new Windows Vista http://search.msn.com/r