Re: 99.9% uptime requirement

2009-08-04 Thread Norberto Meijome
On Mon, 3 Aug 2009 13:15:44 -0700 "Robert Petersen" wrote: > Thanks all, I figured there would be more talk about daemontools if there > were really a need. I appreciate the input and for starters we'll put two > slaves behind a load balancer and grow it from there. > Robert, not taking away f

Re: weird problem with letters S and T

2009-10-28 Thread Norberto Meijome
On Wed, 28 Oct 2009 19:20:37 -0400 Joel Nylund wrote: > Well I tried removing those 2 letters from stopwords, didnt seem to > help, I also tried changing the field type to "text_ws", didnt seem to > work. Any other ideas? Hi Joel, if your stop word filter was applied on index, you will have

n-Gram, only works with queries of 2 letters

2008-06-22 Thread Norberto Meijome
hi there, my use case : I want to be able to match documents when only a partial word is provided. ie, searching for 'roc' or 'ock' should match documents containing 'rock'. As I understand, the way to solve this problem is to use the nGram tokenizer @ index time and the nGram analyser @ sear

Re: n-Gram, only works with queries of 2 letters

2008-06-22 Thread Norberto Meijome
On Mon, 23 Jun 2008 16:23:55 +1000 Norberto Meijome <[EMAIL PROTECTED]> wrote: > hi there, > my use case : I want to be able to match documents when only a partial word > is provided. ie, searching for 'roc' or 'ock' should match documents > containing &#

Re: n-Gram, only works with queries of 2 letters

2008-06-23 Thread Norberto Meijome
On Mon, 23 Jun 2008 05:33:49 -0700 (PDT) Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > Hi, > > > When you add &debugQuery=true to the request, what does your query look like > after parsing? Hi Otis, sorry, i should have sent this before too. With minGramSize = 3 , same data, clean server star

Re: Wildcard search question

2008-06-23 Thread Norberto Meijome
On Mon, 23 Jun 2008 14:23:14 -0700 Jon Drukman <[EMAIL PROTECTED]> wrote: > ok well let's say that i can live without john/jon in the short term. > what i really need today is a case insensitive wildcard search with > literal matching (no fancy stemming. bobby is bobby, not bobbi.) > > what ar

Re: n-Gram, only works with queries of 2 letters

2008-06-23 Thread Norberto Meijome
On Mon, 23 Jun 2008 05:33:49 -0700 (PDT) Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > Hi, > > > When you add &debugQuery=true to the request, what does your query look like > after parsing? > > BTW, I've tested same data + similar config using EdgeNGramTokenizer and this works properly -

Cost of having fieldTypes defined but not used

2008-06-23 Thread Norberto Meijome
Hi all, I'm curious , what is the cost (memory / processing time @ load? performance hit ? ) of having several unused fieldTypes defined in schema.xml ? cheers, B _ {Beto|Norberto|Numard} Meijome "Egotism is the anesthetic that dulls the pain of stupidity." Frank Leahy

several tokenizers in one field type

2008-06-24 Thread Norberto Meijome
hi all, ( I'm using 1.3 nightly build from 15th June 08.) Is there some documentation about how analysers + tokenizers are applied in fields ? In particular, my question : - If I define 2 tokenizers in a fieldtype, only the first one is applied, the other is ignored. Is that because the 2nd tok

(Edge)NGram tokenizer interaction with other filters

2008-06-24 Thread Norberto Meijome
hi everyone, if I define a field as

Re: several tokenizers in one field type

2008-06-24 Thread Norberto Meijome
On Tue, 24 Jun 2008 00:14:57 -0700 Ryan McKinley <[EMAIL PROTECTED]> wrote: > best docs are here: > http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters yes, I've been reading that already , thanks :) > > > - If I define 2 tokenizers in a fieldtype, only the first one is > > applied, t

Re: (Edge)NGram tokenizer interaction with other filters

2008-06-24 Thread Norberto Meijome
On Tue, 24 Jun 2008 04:54:46 -0700 (PDT) Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > One tokenizer is followed by filters. I think this all might be a bit > clearer if you read the chapter about Analyzers in Lucene in Action if you > have a copy. I think if you try to break down that "the re

Otis : Re: n-Gram, only works with queries of 2 letters

2008-06-24 Thread Norberto Meijome
On Tue, 24 Jun 2008 09:10:58 +1000 Norberto Meijome <[EMAIL PROTECTED]> wrote: > On Mon, 23 Jun 2008 05:33:49 -0700 (PDT) > Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > > > Hi, > > > > > > When you add &debugQuery=true to the request, what

Re: SOLR-139 (Support updateable/modifiable documents)

2008-06-24 Thread Norberto Meijome
On Tue, 24 Jun 2008 16:04:24 +0100 Dave Searle <[EMAIL PROTECTED]> wrote: > At the moment I have an index of forum messages (each message being a > separate doc). Results are displayed on a per message basis, however, I would > like to group the results via their thread. Apart from using a facet

Re: SOLR-139 (Support updateable/modifiable documents)

2008-06-24 Thread Norberto Meijome
On Tue, 24 Jun 2008 16:34:44 +0100 Dave Searle <[EMAIL PROTECTED]> wrote: > I am currently storing the thread id within the message index, however, > although this would allow me to sort, it doesn't help with the grouping of > threads based on relevancy. See the idea is to index message data in

How to debug ?

2008-06-24 Thread Norberto Meijome
hi, I'm trying to understand why a search on a field tokenized with the nGram tokenizer, with minGramSize=n and maxGramSize=m doesn't find any matches for queries of length (in characters) of n+1..m (n works fine). analysis.jsp shows that it SHOULD match, but /select doesn't bring anything back. (

Re: How to debug ?

2008-06-24 Thread Norberto Meijome
On Tue, 24 Jun 2008 19:17:58 -0700 Ryan McKinley <[EMAIL PROTECTED]> wrote: > also, check the LukeRequestHandler > > if there is a document you think *should* match, you can see what > tokens it has actually indexed... right, I will look into that a bit more. I am actually using the lukeall.

Re: How to debug ?

2008-06-24 Thread Norberto Meijome
On Wed, 25 Jun 2008 08:37:35 +0200 Brian Carmalt <[EMAIL PROTECTED]> wrote: > There is a plugin for jetty: http://webtide.com/eclipse. Insert this as > and update site and let eclipse install the plugin for you You can then > start the jetty server from eclipse and debug it. Thanks Brian, good i

Re: How to debug ?

2008-06-25 Thread Norberto Meijome
On Tue, 24 Jun 2008 19:17:58 -0700 Ryan McKinley <[EMAIL PROTECTED]> wrote: > also, check the LukeRequestHandler > > if there is a document you think *should* match, you can see what > tokens it has actually indexed... > hi Ryan, I can't see the tokens generated using LukeRequestHandler. I c

Lucene 2.4-dev source ?

2008-06-25 Thread Norberto Meijome
Hi, where can I find these sources? I have the binary jars included with the nightly builds,but I'd like to look @ the code of some of the objects. In particular, http://svn.apache.org/viewvc/lucene/java/ doesnt have any reference to 2.4, and http://svn.apache.org/viewvc/lucene/java/trunk/src/

Re: Lucene 2.4-dev source ?

2008-06-25 Thread Norberto Meijome
On Wed, 25 Jun 2008 20:22:06 -0400 Grant Ingersoll <[EMAIL PROTECTED]> wrote: > Note, also, that the Manifest file in the JAR has information about > the exact SVN revision so that you can check it out from there. > > > On Jun 25, 2008, at 12:37 PM, Yonik Seeley wrote: > > > trunk is the late

Re: NGramTokenizer issue

2008-06-25 Thread Norberto Meijome
On Wed, 25 Jun 2008 15:37:09 -0300 "Jonathan Ariel" <[EMAIL PROTECTED]> wrote: > I've been trying to use the NGramTokenizer and I ran into a problem. > It seems like solr is trying to match documents with all the tokens that the > analyzer returns from the query term. So if I index a document with

Re: NGramTokenizer issue

2008-06-25 Thread Norberto Meijome
On Thu, 26 Jun 2008 10:44:32 +1000 Norberto Meijome <[EMAIL PROTECTED]> wrote: > On Wed, 25 Jun 2008 15:37:09 -0300 > "Jonathan Ariel" <[EMAIL PROTECTED]> wrote: > > > I've been trying to use the NGramTokenizer and I ran into a problem. > > It s

Re: NGramTokenizer issue

2008-06-25 Thread Norberto Meijome
On Thu, 26 Jun 2008 01:15:34 -0300 "Jonathan Ariel" <[EMAIL PROTECTED]> wrote: > Ok. Played a bit more with that. > So I had a difference between my unit test and solr. In solr I'm actually > using a solr.RemoveDuplicatesTokenFilterFactory when querying. Tried to add > that to the test, and it fai

SpellCheckerRequestHandler & qt parameter

2008-06-26 Thread Norberto Meijome
Hi there, Short and sweet : Is SCRH intended to honour qt= ? longer... I'm testing the newest SCRH ( SOLR-572), using last night's nightly build. I have defined a 'dismax' request handler which searches across a number of fields. When I use the SCRH in a query, and I pass the qt=dismax param

SpellCheckComponent = choosing which one...

2008-06-26 Thread Norberto Meijome
Hi there, I am using the an almost default, config of spellcheck component ( details @ very end of email). I have the 3 spellcheckers defined, 'default', 'jarowinkler' and 'file'. I tried adding spellcheck.name=jarowinkler&spellcheck.build=true , and with spellcheck.reload=true as well , but

Re: SpellCheckComponent = choosing which one...

2008-06-26 Thread Norberto Meijome
On Fri, 27 Jun 2008 01:44:38 +1000 Norberto Meijome <[EMAIL PROTECTED]> wrote: > I am using the an almost default, config of spellcheck component ( details @ > very end of email). I have the 3 spellcheckers defined, 'default', > 'jarowinkler' and 'fil

Re: SpellCheckerRequestHandler & qt parameter

2008-06-26 Thread Norberto Meijome
On Thu, 26 Jun 2008 16:25:46 -0500 (CDT) "Geoffrey Young" <[EMAIL PROTECTED]> wrote: > > it seems like it ought to work as a component of your dismax handler. this > works for me: > [] ah i see now. cool. too bad about the crash. I don't know what the policy is for opening bugs in JIRA...

Re: SpellCheckerRequestHandler & qt parameter

2008-06-26 Thread Norberto Meijome
On Thu, 26 Jun 2008 16:25:46 -0500 (CDT) "Geoffrey Young" <[EMAIL PROTECTED]> wrote: > well *almost* - it works most excellently with q=$term but when I add > spellchecker.q=$term things implode: > > HTTP Status 500 - null java.lang.NullPointerException at > org.apache.solr.handler.component.Spel

Re: Problems with Stored Field

2008-06-29 Thread Norberto Meijome
On Sun, 29 Jun 2008 19:40:44 -0300 "Hugo Barauna" <[EMAIL PROTECTED]> wrote: > I am having problems with a stored field. The problem is that field is not > being stored as I need it to be. It has a tokenizer > class="solr.HTMLStripWhitespaceTokenizerFactory", but when it is stored, > that tokenize

analyzer index vs query vs {missing}

2008-06-30 Thread Norberto Meijome
hi there, when defining a field type, i understand the meaning of 'analyzer type="index"' , or type="query". What does it mean when the type is missing? does it apply at both index and query ? This can be found in the example's schema.xml :

Re: analyzer index vs query vs {missing}

2008-06-30 Thread Norberto Meijome
On Mon, 30 Jun 2008 05:52:33 -0400 Erik Hatcher <[EMAIL PROTECTED]> wrote: > Yes, that's exactly what it means. > > Erik great, thanks for the clarification. B _ {Beto|Norberto|Numard} Meijome "A dream you dream together is reality." John Lennon I speak for mys

Re: Best practices for permissions in DistrobutionScripts

2008-07-01 Thread Norberto Meijome
On Tue, 01 Jul 2008 17:04:07 +0530 Jacob Singh <[EMAIL PROTECTED]> wrote: > a). > Add jetty to a group called jetty > Somehow get jetty6 to use that group > Create another user (solr) and add it to the group jetty > Let it run the snapshooter This seems the best option. B ___

Re: Some non-standard implementations

2008-07-04 Thread Norberto Meijome
On Fri, 4 Jul 2008 10:39:28 -0300 "Alexander Ramos Jardim" <[EMAIL PROTECTED]> wrote: > 3. "Did you mean" feature > 3.1. Does Solr implements that? http://wiki.apache.org/solr/SpellCheckComponent _ {Beto|Norberto|Numard} Meijome "And that's one reason we like to believe

Re: Pre-processor for stored fields

2008-07-08 Thread Norberto Meijome
On Tue, 8 Jul 2008 10:20:15 -0300 "Hugo Barauna" <[EMAIL PROTECTED]> wrote: > Hi, > > I already haved aked this, but I didn't get any good answer, so I will try > again. I need to pre-process a stored field before it is saved. Just like a > field that is gonna be indexed. I would be good to apply

Re: problems with SpellCheckComponent

2008-07-08 Thread Norberto Meijome
On Tue, 8 Jul 2008 21:10:51 +0530 "Shalin Shekhar Mangar" <[EMAIL PROTECTED]> wrote: > Also note that you'll need to specify spellcheck.build=true only on the > first request when it will build the spell check index. The subsequent > requests need not have spellcheck.build=true. as a matter of fa

Re: Automated Index Creation

2008-07-08 Thread Norberto Meijome
On Tue, 8 Jul 2008 12:05:45 -0400 Willie Wong <[EMAIL PROTECTED]> wrote: > I think the snapshooter will work fine for creating the indexes and then I > can use the multicore capabilities to make them available to users one > final question though, after snapshot has been created is there a w

Re: Automated Index Creation

2008-07-09 Thread Norberto Meijome
On Wed, 9 Jul 2008 08:48:35 +0530 "Shalin Shekhar Mangar" <[EMAIL PROTECTED]> wrote: > Yes, SOLR-350 added that capability. Look at > http://wiki.apache.org/solr/MultiCore for details. ahh loving SOLR more every day :P thx _ {Beto|Norberto|Numard} Meijome I used to hate

Re: Indexing xml data

2008-07-09 Thread Norberto Meijome
On Wed, 9 Jul 2008 19:51:45 +0530 "Noble Paul _ __" <[EMAIL PROTECTED]> wrote: > You can put it into a 'string' field directly if we refer to the default string field , you won't be able to search for the contents of the XML (unless you search for the whole t

Re: tagging application, best way to architect?

2008-07-09 Thread Norberto Meijome
On Thu, 10 Jul 2008 09:36:01 +0530 "Noble Paul _ __" <[EMAIL PROTECTED]> wrote: > > 2. We're assuming we'll have thousands of users with independent data; any > > good way to partition multiple indexes with solr? With Lucene we could > > just save those in ind

Re: Search slow on a field with many unique values (date)

2008-07-11 Thread Norberto Meijome
On Thu, 10 Jul 2008 17:55:55 -0600 "Galen Pahlke" <[EMAIL PROTECTED]> wrote: > Could this perhaps be because a date field has so many possible unique > values? I don't know how to find out exactly, but I'd guess there are > at least a few million unique dates in the index. Would increasing the >

Re: Wiki for 1.3

2008-07-11 Thread Norberto Meijome
On Fri, 11 Jul 2008 15:22:35 + sundar shankar <[EMAIL PROTECTED]> wrote: > I recently was looking to find details of 1.3 specific analysers and filters > in the solr wiki and was unable to do so. Could anyone please point me to a > place where I can find some documentation of the same. >

Re: Wiki for 1.3

2008-07-14 Thread Norberto Meijome
On Mon, 14 Jul 2008 15:52:35 + sundar shankar <[EMAIL PROTECTED]> wrote: > Hi Hoss, > I was talking about classes like EdgeNGramFilterFactory, > PatterReplaceFilterfactory etc. I didnt find these in the 1.2 Jar. Where do I > find wiki for these and Specific classes introduced fo

Re: Wiki for 1.3

2008-07-14 Thread Norberto Meijome
On Mon, 14 Jul 2008 23:25:25 + sundar shankar <[EMAIL PROTECTED]> wrote: >Thanks for your patient response. I dont wanna know the classes changed, > but I wanna get a hand on the wiki page for the same. I tried to search for > these classes in the solr wiki. I was getting a page does not

Re: Duplicate content

2008-07-15 Thread Norberto Meijome
On Tue, 15 Jul 2008 13:15:41 +0530 "Sunil" <[EMAIL PROTECTED]> wrote: > 1) I don't want duplicate content. SOLR uses the field you define as the unique field to determine whether a document should be replaced or added. The rest of the fields are in your hands. You could devise a setup whereby the

Re: Duplicate content

2008-07-15 Thread Norberto Meijome
On Tue, 15 Jul 2008 10:48:14 +0200 Jarek Zgoda <[EMAIL PROTECTED]> wrote: > >> 2) I don't want to overwrite old content with new one. > >> > >> Means, if I add duplicate content in solr and the content already > >> exists, the old content should not be overwritten. > > > > before inserting a n

Re: Filter by Type increases search results.

2008-07-15 Thread Norberto Meijome
On Tue, 15 Jul 2008 18:07:43 +0530 "Preetam Rao" <[EMAIL PROTECTED]> wrote: > When I say filter, I meant q=fish&fq=type:idea btw, this *seems* to only work for me with standard search handler. dismax and fq: dont' seem to get along nicely... but maybe, it is just late and i'm not testing it pro

Re: Out of memory on Solr sorting

2008-07-23 Thread Norberto Meijome
On Tue, 22 Jul 2008 20:19:49 + sundar shankar <[EMAIL PROTECTED]> wrote: > Thanks for the explanation mark. The reason I had it as 512 max was cos > earlier the data file was just about 30 megs and it increased to this much > for of the usage of EdgeNGramFactoryFilter for 2 fields. Thats gre

Re: performance implications on using lots of values in fq

2008-07-24 Thread Norberto Meijome
On Wed, 23 Jul 2008 11:28:49 -0700 (PDT) briand <[EMAIL PROTECTED]> wrote: > I have documents in SOLR such that each document contains one to many points > (latitude and longitudes). Currently we store the multiple points for a > given document in the db and query the db to find all of the docum

Re: nested data structure definition

2008-07-29 Thread Norberto Meijome
On Mon, 28 Jul 2008 23:37:22 -0700 (PDT) matt connolly <[EMAIL PROTECTED]> wrote: > In my site, I have a document, which may have multiple comments. > > For each comment, I would like to know several pieces of information, like: > text, author, and date. Matt, please read the documentation abou

Re: Sum of one field

2008-08-04 Thread Norberto Meijome
On Mon, 04 Aug 2008 14:19:45 -0300 Leonardo Dias <[EMAIL PROTECTED]> wrote: > Everyone exhibits "your search for x has returned y results" on the top > of the results page, but we need something else, which would be > something like "your search for x returned y results in z records", > being z

Re: Solr Logo thought

2008-08-04 Thread Norberto Meijome
On Mon, 4 Aug 2008 09:29:30 -0700 Ryan McKinley <[EMAIL PROTECTED]> wrote: > > > > If there is a still room for new log design for Solr and the > > community is > > open for it then I can try to come up with some proposal. Doing logo > > for > > Mahout was really interesting experience. > > >

Re: solr 1.3 ??

2008-08-04 Thread Norberto Meijome
On Mon, 4 Aug 2008 21:13:09 -0700 (PDT) Vicky_Dev <[EMAIL PROTECTED]> wrote: > Can we get solr 1.3 release as soon as possible? Otherwise some interim > release (1.2.x) containing DataImportHandler will also a good option. > > Any Thoughts? have you tried one of the nightly builds? I've been f

Re: Diagnostic tools

2008-08-05 Thread Norberto Meijome
On Tue, 5 Aug 2008 11:43:44 -0500 "Kashyap, Raghu" <[EMAIL PROTECTED]> wrote: > Hi, Hi Kashyap, please don't hijack topic threads. http://en.wikipedia.org/wiki/Thread_hijacking thanks!! B _ {Beto|Norberto|Numard} Meijome Software QA is like cleaning my cat's litter box:

Re: unique key

2008-08-05 Thread Norberto Meijome
On Tue, 5 Aug 2008 14:41:08 -0300 "Scott Swan" <[EMAIL PROTECTED]> wrote: > I currently have multiple documents that i would like to index but i would > like to combine two fields to produce the unique key. > > the documents either have 1 or the other fields so by combining the two > fields i w

Re: Sum of one field

2008-08-05 Thread Norberto Meijome
On Tue, 05 Aug 2008 18:58:42 -0300 Leonardo Dias <[EMAIL PROTECTED]> wrote: > So I'm looking for a Ferrari. CarStore says that there are 5 ads for > Ferrari, but one ad has 2 Ferraris being sold, the other ad has 3 > Ferraris and all the others have 1 Ferrari each, meaning that there are > 5 ad

Re: Solr Logo thought

2008-08-06 Thread Norberto Meijome
On Tue, 05 Aug 2008 16:02:51 -0400 Stephen Weiss <[EMAIL PROTECTED]> wrote: > My issue with the logos presented was they made solr look like a > school project instead of the powerful tool that it is. The tricked > out font or whatever just usually doesn't play well with the business > type

Re: case preserving for data but not for indexing

2008-08-06 Thread Norberto Meijome
On Wed, 6 Aug 2008 20:21:28 -0400 "Ian Connor" <[EMAIL PROTECTED]> wrote: > In order to preserve case for the data, but not for indexing, I have > created two fields. One is type Author that is defined as: > > sortMissingLast="true" omitNorms="true"> > >

Re: case preserving for data but not for indexing

2008-08-07 Thread Norberto Meijome
On Wed, 6 Aug 2008 21:35:47 -0700 (PDT) Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > > > > > 2 Tokenizers? i wondered about that too, but didn't have the time to test... B _ {Beto|Norberto|Numard} Meijome "Always listen to experts. They'll t

Re: HTML Standard Strip filter word boundary bug

2008-08-07 Thread Norberto Meijome
On Thu, 7 Aug 2008 00:50:59 -0700 (PDT) matt connolly <[EMAIL PROTECTED]> wrote: > Where do I file a bug report? https://issues.apache.org/jira thanks! B _ {Beto|Norberto|Numard} Meijome Contrary to popular belief, Unix is user friendly. It just happens to be very sele

Re: Still no results after removing from stopwords

2008-08-11 Thread Norberto Meijome
On Sun, 10 Aug 2008 19:58:24 -0700 (PDT) SoupErman <[EMAIL PROTECTED]> wrote: > I needed to run a search with a query containing the word "not", so I removed > "not" from the stopwords.txt file. Which seemed to work, at least as far as > parsing the query. It was now successfully searching for tha

Re: unique key

2008-08-11 Thread Norberto Meijome
On Wed, 6 Aug 2008 12:25:34 +1000 Norberto Meijome <[EMAIL PROTECTED]> wrote: > On Tue, 5 Aug 2008 14:41:08 -0300 > "Scott Swan" <[EMAIL PROTECTED]> wrote: > > > I currently have multiple documents that i would like to index but i would > > like to c

Re: Can't Delete Record

2008-08-11 Thread Norberto Meijome
On Mon, 11 Aug 2008 06:48:05 -0700 (PDT) Vj Ali <[EMAIL PROTECTED]> wrote: > i also sends tag as well. maybe you need instead of ? _ {Beto|Norberto|Numard} Meijome "With sufficient thrust, pigs fly just fine. However, this is not necessarily a good idea. It is h

adds / delete within same 'transaction'..

2008-08-11 Thread Norberto Meijome
Hello :) I *think* i know the answer, but i'd like to confirm : Say I have 1old already indexed and commited (ie, 'live' ) What happens if I issue: 1 1new will delete happen first, and then the add, or could it be that the add happens before delete, in which case i end up with no more doc

Problems using saxon for XSLT transforms

2008-08-12 Thread Norberto Meijome
hi :) I'm trying to use SAXON instead of the default XSLT parser. I was pretty sure i had it running fine on 1.2, but when I repeated the same steps (as per the wiki) on latest nightly build, i cannot see any sign of it being loaded or use, although the classpath seems to be pointing to them (see b

Re: adds / delete within same 'transaction'..

2008-08-12 Thread Norberto Meijome
On Tue, 12 Aug 2008 11:21:50 -0700 Mike Klaas <[EMAIL PROTECTED]> wrote: > > will delete happen first, and then the add, or could it be that the > > add happens before delete, in which case i end up with no more doc > > id=1 ? > > As long as you are sending these requests on the same thread

Re: adds / delete within same 'transaction'..

2008-08-12 Thread Norberto Meijome
On Tue, 12 Aug 2008 20:53:12 -0400 "Yonik Seeley" <[EMAIL PROTECTED]> wrote: > On Tue, Aug 12, 2008 at 1:48 AM, Norberto Meijome <[EMAIL PROTECTED]> wrote: > > What happens if I issue: > > > > 1 > > 1new > > > > > > will

Re: Best way to index without diacritics

2008-08-12 Thread Norberto Meijome
On Tue, 12 Aug 2008 11:44:42 -0400 "Steven A Rowe" <[EMAIL PROTECTED]> wrote: > Solr is Unicode aware. The ISOLatin1AccentFilterFactory handles diacritics > for the ISO Latin-1 section of the Unicode character set. UTF (do you mean > UTF-8?) is a (set of) Unicode serialization(s), and once Sol

Re: Searching Questions

2008-08-13 Thread Norberto Meijome
On Tue, 12 Aug 2008 13:26:26 -0700 "Jake Conk" <[EMAIL PROTECTED]> wrote: > 1) I want to search only within a specific field, for instance > `category`. Is there a way to do this? of course. Please see http://wiki.apache.org/solr/SolrQuerySyntax (in particular, follow the link to Lucene syntax..

Re: Spellcheker and Dismax both

2008-08-14 Thread Norberto Meijome
On Thu, 14 Aug 2008 12:21:13 +0530 "Shalin Shekhar Mangar" <[EMAIL PROTECTED]> wrote: > The SpellCheckerRequestHandler is now deprecated with Solr 1.3 and it has > been replaced by SpellCheckComponent. > > http://wiki.apache.org/solr/SpellCheckComponent which works quite well with dismax. B __

Re: Best way to index without diacritics

2008-08-14 Thread Norberto Meijome
( 2 in 1 reply) On Wed, 13 Aug 2008 09:59:21 -0700 Walter Underwood <[EMAIL PROTECTED]> wrote: > Stripping accents doesn't quite work. The correct translation > is language-dependent. In German, o-dieresis should turn into > "oe", but in English, it shoulde be "o" (as in "co__perate" or > "M__tle

Re: Best way to index without diacritics

2008-08-14 Thread Norberto Meijome
On Thu, 14 Aug 2008 11:34:47 -0400 "Steven A Rowe" <[EMAIL PROTECTED]> wrote: [...] > The kind of filter Walter is talking about - a generalized language-aware > character normalization Solr/Lucene filter - does not yet exist. My guess is > that if/when it does materialize, both the Solr and th

[SOLVED...]Re: Problems using saxon for XSLT transforms

2008-08-17 Thread Norberto Meijome
On Tue, 12 Aug 2008 23:36:32 +1000 Norberto Meijome <[EMAIL PROTECTED]> wrote: > hi :) > I'm trying to use SAXON instead of the default XSLT parser. I was pretty sure > i > had it running fine on 1.2, but when I repeated the same steps (as per the > wiki) on latest n

DataImportHandler : more forgiving initialisation possible?

2008-08-17 Thread Norberto Meijome
hi guys, First of all, thanks for DIH - it's great :) One thing I noticed during my tests ( nightly, 2008-08-16) is that, if the DB is not available during SOLR startup time, the whole core won't initialise .- the error is shown below. I was wondering, 1) would it be possible to have DIH bomb o

DIH - calling spellchecker rebuild...

2008-08-17 Thread Norberto Meijome
Guys + gals, just a question of form - would DIH itself be the right place to implement a "URLS to call after successfully completing a DIH full or partial load" - for example, to rebuild spellchecker when new items have been added? Or should that be part of my external process (cron -> shell

Re: DIH - calling spellchecker rebuild...

2008-08-17 Thread Norberto Meijome
On Sun, 17 Aug 2008 20:22:26 +0530 "Shalin Shekhar Mangar" <[EMAIL PROTECTED]> wrote: > If it is only SpellCheckComponent that you are interested in, then see > SOLR-622. > > You can add this to your SCC config to rebuild SCC after every commit: > true ah great stuff , thanks Shalin. B

DIH - commit / optimize

2008-08-17 Thread Norberto Meijome
Hi again, I see in the DIH wiki page : [...] full-import [..] commit: (default 'true'). Tells whether to commit+optimize after the operation [...] but nothing for delta-import... I think it would be useful , a 'commit' (default=true) , 'optimize' (default=false) for the delta-import - these sh

Re: DIH - commit / optimize

2008-08-17 Thread Norberto Meijome
On Mon, 18 Aug 2008 10:14:32 +0800 "finy finy" <[EMAIL PROTECTED]> wrote: > i use solr for 3 months, and i find some question follow: Please do not hijack mail threads. http://en.wikipedia.org/wiki/Thread_hijacking _ {Beto|Norberto|Numard} Meijome "Ask not what's inside

Re: DIH - commit / optimize

2008-08-17 Thread Norberto Meijome
On Mon, 18 Aug 2008 09:34:56 +0530 "Shalin Shekhar Mangar" <[EMAIL PROTECTED]> wrote: > Actually we have commit and optimize as separate request parameters > defaulting to true for both full-import and delta-import. You can add a > request parameter optimize=false for delta-import if you want to c

Re: hello, a question about solr.

2008-08-18 Thread Norberto Meijome
On Mon, 18 Aug 2008 15:33:02 +0800 "finy finy" <[EMAIL PROTECTED]> wrote: > the name field is text,which is analysed, i use the query > "name:ibmT63notebook" why do you search with no spaces? is this free text entered by a user, or is it part of a link which you control ? PS: please dont top-po

.wsdl for example....

2008-08-18 Thread Norberto Meijome
hi :) does anyone have a .wsdl definition for the example bundled with SOLR? if nobody has it, would it be useful to have one ? cheers, B _ {Beto|Norberto|Numard} Meijome Intelligence: Finding an error in a Knuth text. Stupidity: Cashing that $2.56 check you got. I spe

Re: hello, a question about solr.

2008-08-18 Thread Norberto Meijome
On Mon, 18 Aug 2008 23:07:19 +0800 "finy finy" <[EMAIL PROTECTED]> wrote: > because i use chinese character, for example "ibm___" > solr will parse it into a term "ibm" and a phraze "_ __" > can i use solr to query with a term "ibm" and a term "_" and a term > "__

Re: .wsdl for example....

2008-08-18 Thread Norberto Meijome
On Mon, 18 Aug 2008 19:08:24 -0300 "Alexander Ramos Jardim" <[EMAIL PROTECTED]> wrote: > Do you wanna a full web service for SOLR example? How a .wsdl will help you? > Why don't you use the HTTP interface SOLR provides? > > Anyways, if you need to develop a web service (SOAP compliant) to access

Re: .wsdl for example....

2008-08-18 Thread Norberto Meijome
On Tue, 19 Aug 2008 11:23:48 +1000 Norberto Meijome <[EMAIL PROTECTED]> wrote: > On Mon, 18 Aug 2008 19:08:24 -0300 > "Alexander Ramos Jardim" <[EMAIL PROTECTED]> wrote: > > > Do you wanna a full web service for SOLR example? How a .wsdl will help you? >

Re: Clarification on facets

2008-08-18 Thread Norberto Meijome
On Tue, 19 Aug 2008 10:18:12 +1200 "Gene Campbell" <[EMAIL PROTECTED]> wrote: > Is this interpreted as meaning, there are 10 documents that will match > with 'car' in the title, and likewise 6 'boat' and 2 'bike'? Correct. > If so, is there any way to get counts for the *number times* a value >

Re: hello, a question about solr.

2008-08-20 Thread Norberto Meijome
On Wed, 20 Aug 2008 10:58:50 -0300 "Alexander Ramos Jardim" <[EMAIL PROTECTED]> wrote: > A tiny but really explanation can be found here > http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters thanks Alexander - indeed, quite short, and focused on shingles ... which , if I understand corr

Re: Querying Question

2008-08-21 Thread Norberto Meijome
On Thu, 21 Aug 2008 18:09:11 -0700 "Jake Conk" <[EMAIL PROTECTED]> wrote: > I thought if I used to copy my string field to a text > field then I can search for words within it and not limited to the > entire content. Did I misunderstand that? but you need to search on the fields that are defined

Re: "Multicore" and snapshooter / snappuller

2008-08-25 Thread Norberto Meijome
On Fri, 22 Aug 2008 12:21:53 -0700 "Lance Norskog" <[EMAIL PROTECTED]> wrote: > Apparently the ZFS (Silicon Graphics > originally) is great for really huge files. hi Lance, You may be confusing Sun's ZFS with SGI's XFS. The OP referred, i think, to ZFS. B _ {Beto|Nor

Re: dataimporthandler and mysql connector jar

2008-08-25 Thread Norberto Meijome
On Mon, 25 Aug 2008 17:11:47 +0200 Walter Ferrara <[EMAIL PROTECTED]> wrote: > Launching a multicore solr with dataimporthandler using a mysql driver, > (driver="com.mysql.jdbc.Driver") works fine if the mysql connector jar > (mysql-connector-java-5.0.7-bin.jar) is in the classpath, either jdk > c

Re: Question about search suggestion

2008-08-26 Thread Norberto Meijome
On Tue, 26 Aug 2008 15:15:21 +0300 Aleksey Gogolev <[EMAIL PROTECTED]> wrote: > > Hello. > > I'm new to solr and I need to make a search suggest (like google > suggestions). > Hi Aleksey, please search the archives of this list for subjects containing 'autocomplete' or 'auto-suggest'. that sho

Re: Storing two different files

2008-08-28 Thread Norberto Meijome
On Thu, 28 Aug 2008 02:01:05 -0700 (PDT) sanraj25 <[EMAIL PROTECTED]> wrote: > I want to index two different files in solr.(for ex) I want to store > two tables like, job_post and job_profile in solr. But now both are stored > in same place in solr.when i get data from job_post, data come from >

Re: Regarding Indexing

2008-08-29 Thread Norberto Meijome
On Fri, 29 Aug 2008 00:31:13 -0700 (PDT) sanraj25 <[EMAIL PROTECTED]> wrote: > But still i cant maintain two index. > please help me how to create two cores in solr What specific problem do you have ? B _ {Beto|Norberto|Numard} Meijome "Always listen to experts. They'll

Re: Regarding Indexing

2008-08-29 Thread Norberto Meijome
On Fri, 29 Aug 2008 02:37:10 -0700 (PDT) sanraj25 <[EMAIL PROTECTED]> wrote: > I want to store two independent datas in solr index. so I decided to create > two index.But that's not possible.so i go for multicore concept in solr > .can u give me step by step procedure to create multicore in solr

Re: about boost weight

2008-09-14 Thread Norberto Meijome
On Sat, 13 Sep 2008 16:17:12 + zzh <[EMAIL PROTECTED]> wrote: >I think this is a stupid method, because the search conditions is too > long, and the search efficiency will be low, we hope you can help me to solve > this problem. Hi, IMHO,a long set of conditions doesn't make it stupid. Yo

Re: Special character matching 'x' ?

2008-09-18 Thread Norberto Meijome
On Thu, 18 Sep 2008 10:53:39 +0530 "Sanjay Suri" <[EMAIL PROTECTED]> wrote: > One of my field values has the name "R__ikk__nen" which contains a special > characters. > > Strangely, as I see it anyway, it matches on the search query 'x' ? > > Can someone explain or point me to the solution/doc

Re: Any way to extract most used keywords from an index (or a random set)

2008-09-22 Thread Norberto Meijome
On Mon, 22 Sep 2008 15:46:54 +0530 "Jacob Singh" <[EMAIL PROTECTED]> wrote: > Hi, > > I'm trying to write a testing suite to gauge the performance of solr > searches. To do so, I'd like to be able to find out what keywords > will get me search results. Is there anyway to programaticaly do this

Re: Using Shingles to Increase Phrase Search Performance

2008-09-24 Thread Norberto Meijome
On Sat, 16 Aug 2008 15:39:44 -0700 "Chris Harris" <[EMAIL PROTECTED]> wrote: [...] > So finally I modified the Lucene ShingleFilter class to add an > "outputUnigramIfNoNgram option". Basically, if you set that option, > and also set outputUnigrams=false, then the filter will tokenize just > as in

Dismax , "query phrases"

2008-09-24 Thread Norberto Meijome
Hello, I've seen references to this in the list, but not completely explained...my apologies if this is FAQ (and for the length of the email). I am using dismax across a number of fields on an index with data about music albums & songs - the fields are quite full of stop words. I am trying to

Re: help required: how to design a large scale solr system

2008-09-24 Thread Norberto Meijome
On Wed, 24 Sep 2008 07:46:57 -0400 Mark Miller <[EMAIL PROTECTED]> wrote: > Yes. You will def see a speed increasing by avoiding http (especially > doc at a time http) and using the direct csv loader. > > http://wiki.apache.org/solr/UpdateCSV and the obvious reason that if, for whatever reason,

Re: Dismax , "query phrases"

2008-09-24 Thread Norberto Meijome
On Wed, 24 Sep 2008 08:34:57 -0700 (PDT) Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > What happens if you change ps from 100 to 1 and comment out that ord function? > > > Otis Hi Otis, no luck - without " " : smashing pumpkins smashing pumpkins +((DisjunctionMaxQuery((genre:smash^0.2 | title

  1   2   3   >