RE: encoding problem

2009-09-01 Thread Bernadette Houghton
Finally resolved the problem! The solution was 3-pronged on my windows PC- Added to my.ini under mysqld- default-character-set=utf8 collation_server=utf8_unicode_ci character_set_server=utf8 skip-character-set-client-handshake Added to JAVA_OPTS environmental variable – -Dfile.encoding=UTF-8 Add

Re: Monitoring split time for fq queries when filter cache is used

2009-09-01 Thread Rahul R
Thank you Martijn. On Tue, Sep 1, 2009 at 8:07 PM, Martijn v Groningen < martijn.is.h...@gmail.com> wrote: > Hi Rahul, > > Yes you are understanding is correct, but it is not possible to > monitor these actions separately with Solr. > > Martijn > > 2009/9/1 Rahul R : > > Hello, > > I am trying t

Searching for a set of keywords /phrases in a document

2009-09-01 Thread matchan
I have a large document with various sections. Each section has a list of keywords /phrases of interest. I have a master list of keywords/phrases stored as a String array. How can I use Solr or Lucene to search each section document for all keywords and basically give me which keywords were found

Re: Why dismax isn't the default with 1.4 and why it doesn't support fuzzy search ?

2009-09-01 Thread Chris Hostetter
: The wiki says "As of Solr 1.3, the DisMaxRequestHandler is simply the : standard request handler with the default query parser set to the : DisMax Query Parser (defType=dismax).". I just made a checkout of svn : and dismax doesn't seems to be the default as : that paragraph doesn't say that dism

Re: Using Lucene's payload in Solr

2009-09-01 Thread Chris Hostetter
: Is it possible to have the copyField strip off the payload while it is : copying since doing it in the analysis phrase is too late? Or should I : start looking into using UpdateProcessors as Chris had suggested? "nope" and "yep" I've had an idea in the back of my mind ofr a while now about add

Re: Adding new docs, but duplicating instead of updating

2009-09-01 Thread Chris Hostetter
: specified (again, using schema browser). The unique key field is marked as : type textTight. your uniqueKey field needs to be something where everydoc is only going to produce a single token, if you are using textTight, and sending product sku type data (as mentioned in another mesg in this t

Re: Date Faceting and Double Counting

2009-09-01 Thread Chris Hostetter
: When I added numerical faceting to my checkout of solr (solr-1240) I basically : copied date faceting and modified it to work with numbers instead of dates. : With numbers I got a lot of doulbe-counted values as well. So to fix my : problem I added an extra parameter to number faceting where you

Re: Date Faceting and Double Counting

2009-09-01 Thread Chris Hostetter
: Is this a known behavior people are happy with, or should I file an issue : asking for ranges in date-facets to be constructed to subtract one second : from the end of each range (so that the effective range queries for my case It's a known anoyance, but not something that seems to anoy people e

Re: Can solr do the equivalent of "select distinct(field)"?

2009-09-01 Thread Chris Hostetter
: lets say you filter your query on something and want to know how many : distinct "categories" that your results comprise. : then you can facet on the category field and count the number of facet : values that are returned, right? if you count the number of facet values returned you are getting

Re: extended documentation on analyzers

2009-09-01 Thread Chris Hostetter
: is there an online resource or a book that contains a thorough list of : tokenizers and filters available and their functionality? : : http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters ...from the intro on that page... For a more complete list of what Tokenizers and TokenFilters co

RE: SOLR vs SQL

2009-09-01 Thread Fuad Efendi
"No results found for 'surface area 377', displaying all properties." - why do we need SOLR then...

SOLR vs SQL

2009-09-01 Thread Fuad Efendi
RE: http://www.mysecondhome.eu I am browsing this website again (I have similar challenge at http://www.casaGURU.com but still prefer database-SQL to search Professional by service type) I don't think SOLR is applicable in this specific case. I think standard DB queries with predefined d

RE: Adding new docs, but duplicating instead of updating

2009-09-01 Thread Christopher Baird
Hi Tim, The value I'm using is a product SKU. A sample would be like: L49-4251. Thanks -Chris -Original Message- From: Harsch, Timothy J. (ARC-SC)[PEROT SYSTEMS] [mailto:timothy.j.har...@nasa.gov] Sent: Tuesday, September 01, 2009 12:52 PM To: solr-user@lucene.apache.org; cba...@cardin

RE: Adding new docs, but duplicating instead of updating

2009-09-01 Thread Harsch, Timothy J. (ARC-SC)[PEROT SYSTEMS]
What is the value of your uniqueKey? -Original Message- From: Christopher Baird [mailto:cba...@cardinalcommerce.com] Sent: Tuesday, September 01, 2009 8:20 AM To: solr-user@lucene.apache.org Subject: RE: Adding new docs, but duplicating instead of updating Hi Tim, I appreciate the sugge

Re: Adding docs from MySQL and php

2009-09-01 Thread Pablo Ferrari
wow, it looks like DIH already works with relational databases... thanks again! 2009/9/1 Pablo Ferrari > Thanks Aakash! > > I've looked at it and it looks very interesting, the problem is that my > database is a relational model, therefore I don't have a table with all the > information, but man

Re: Adding docs from MySQL and php

2009-09-01 Thread Pablo Ferrari
Thanks Aakash! I've looked at it and it looks very interesting, the problem is that my database is a relational model, therefore I don't have a table with all the information, but many tables related to each other by their ids (primary keys and foreign keys). I've been thinking about using DataIm

Why dismax isn't the default with 1.4 and why it doesn't support fuzzy search ?

2009-09-01 Thread Erwin
Hello, Solr is a great software, but I have some interrogations like : The wiki says "As of Solr 1.3, the DisMaxRequestHandler is simply the standard request handler with the default query parser set to the DisMax Query Parser (defType=dismax).". I just made a checkout of svn and dismax doesn't s

Re: Adding docs from MySQL and php

2009-09-01 Thread Aakash Dharmadhikari
hi Pablo, DataImportHandler might be the best option for you. check this link http://wiki.apache.org/solr/DataImportHandler regards, aakash On Tue, Sep 1, 2009 at 9:18 PM, Pablo Ferrari wrote: > Hello all, > > I'm new to the list and new to Solr. My name is Pablo, I'm from Spain and > I'm dev

Re: solrj - Log4j and slf4j integration - java.lang.IllegalStateException thrown

2009-09-01 Thread Smiley, David W.
Are you running the latest versions of these logging libraries? I see nothing in the 1.5.8 SLF4J Log4j adapter that would cause this. ~ David Smiley Author: http://www.packtpub.com/solr-1-4-enterprise-search-server On 9/1/09 10:49 AM, "Villemos, Gert" wrote: We are using solrj 1.3 (with slf

RE: Adding new docs, but duplicating instead of updating

2009-09-01 Thread Christopher Baird
Hi Tim, I appreciate the suggestions. I can tell you that the document I ran the second time was the same document run the first time -- so any questions of field value shouldn't be a concern. Thanks -Chris -Original Message- From: Harsch, Timothy J. (ARC-SC)[PEROT SYSTEMS] [mailto:timo

solrj - Log4j and slf4j integration - java.lang.IllegalStateException thrown

2009-09-01 Thread Villemos, Gert
We are using solrj 1.3 (with slf4j) in a client also using Aperture (with log4j 1.2.14). When executing a query I get the error shown below. The request is never received by the server, i.e. the exception is thrown before the request is issued. I think I'm running into a compatibility issue bet

Re: Is caching worth it when my whole index is in RAM?

2009-09-01 Thread Michael
Thanks, Avlesh! I'll try the filter cache. Anybody familiar enough with the caching implementation to chime in? Michael On Mon, Aug 31, 2009 at 10:02 PM, Avlesh Singh wrote: > Good question! > The application level cache, say filter cache, would still help because it > not only caches values b

RE: Adding new docs, but duplicating instead of updating

2009-09-01 Thread Harsch, Timothy J. (ARC-SC)[PEROT SYSTEMS]
I could be off base here, maybe using textTight as unique key is a common SOLR practice I don't know. But, It would seem to me that using any field type that transforms a value (even if it is just whitespace removal) could be problematic. Maybe not the source of your issue here, but I'd be wo

Re: Monitoring split time for fq queries when filter cache is used

2009-09-01 Thread Martijn v Groningen
Hi Rahul, Yes you are understanding is correct, but it is not possible to monitor these actions separately with Solr. Martijn 2009/9/1 Rahul R : > Hello, > I am trying to measure the benefit that I am getting out of using the filter > cache. As I understand, there are two major parts to an fq qu

Adding new docs, but duplicating instead of updating

2009-09-01 Thread Christopher Baird
Hi All, I'm running Solr in a multicore setup. I've set one of the cores to have a specific field as the unique key (marked as the uniqueKey in the document and the field is defined as required). I'm sending an command with all the docs using a multipart post. After running the add file, I

RE: Error while indexing using SmartChineseAnalyzer

2009-09-01 Thread Jana, Kumar Raja
Thanks for the reply Shalin. Posted the stack trace on the Jira issue SOLR-1336. -Kumar -Original Message- From: Shalin Shekhar Mangar [mailto:shalinman...@gmail.com] Sent: Tuesday, September 01, 2009 4:56 PM To: solr-user@lucene.apache.org Subject: Re: Error while indexing using SmartCh

Re: Error while indexing using SmartChineseAnalyzer

2009-09-01 Thread Shalin Shekhar Mangar
On Tue, Sep 1, 2009 at 4:37 PM, Jana, Kumar Raja wrote: > Hi, > > I tried using the patch provided for Solr-1336 JIRA issue for > integrating Lucene's SmartChineseAnalyzer with Solr and tried testing it > out but I faced the AbstractMethodError during indexing as well as > Searching (stack trace

Error while indexing using SmartChineseAnalyzer

2009-09-01 Thread Jana, Kumar Raja
Hi, I tried using the patch provided for Solr-1336 JIRA issue for integrating Lucene's SmartChineseAnalyzer with Solr and tried testing it out but I faced the AbstractMethodError during indexing as well as Searching (stack trace below). There seems to be something wrong during the tokenization of

Re: Drill down into hierarchical facet : how to?

2009-09-01 Thread Uri Boness
Hi, You know the level your currently in: America/USA You have the values for the location facet in the form: America/USA/NYC/Chelsea...3 America/USA/NYC/East Village2 America/USA/San Francisco/Haight-Ashbury...5 America/USA/Los A

Re: How to set similarity to catch more results ?

2009-09-01 Thread Kaoul
Thank you three for answers. After more research, I think I need to use fuzzy search as I already know Levenshtein Distance and I don't want to manage a list of synonyms manually. So "manually" spell check isn't for me. Thanks a lot. On Tue, Sep 1, 2009 at 1:15 AM, Avlesh Singh wrote: >> >> I want

Re: Date Faceting and Double Counting

2009-09-01 Thread gwk
Hi Stephen, When I added numerical faceting to my checkout of solr (solr-1240) I basically copied date faceting and modified it to work with numbers instead of dates. With numbers I got a lot of doulbe-counted values as well. So to fix my problem I added an extra parameter to number faceting