Re: how to do auto-suggest case-insensitive match and return original case field values

2009-12-08 Thread hermida
Uri Boness wrote: > > Just updated SOLR-1625 to support regexp hints. > > https://issues.apache.org/jira/browse/SOLR-1625 > > Cheers, > Uri > This is perfect, exactly what is needed to make this functionality possible. Is the patch already in trunk? thanks, leandro -- View this message i

RE: do copyField's need to exist as Fields?

2009-12-08 Thread Jaco Olivier
Hi Regan, Something I noticed on your setup... The ID field in your setup I assume to be your uniqueID for the book or journal (The ISSN or something) Try making this a string as TEXT is not the ideal field to use for unique IDs Congrats on figuring out SOLR fields - I suggest getting the SOLR

Re: Enumerating wildcard terms

2009-12-08 Thread Erik Hatcher
Mark, The TermsComponent should do the trick for you. http://wiki.apache.org/solr/TermsComponent Erik On Dec 9, 2009, at 7:46 AM, Mark N wrote: Is it possible to enumerate all terms that match the specified wildcard filter term. Similar to Lunce WildCardTermEnum API for exa

Enumerating wildcard terms

2009-12-08 Thread Mark N
Is it possible to enumerate all terms that match the specified wildcard filter term. Similar to Lunce WildCardTermEnum API for example if I search abc* then I just should able to access all the terms abc1, abc2 , abc3... that exists in Index What should be better approach to meet this functi

Re: Replicating multiple cores

2009-12-08 Thread Jason Rutherglen
> Complexity is the main problem I agree, replicating multiple cores otherwise means multiple rsyncd processes, and true enough that management of shell scripts multiplies in complexity. 2009/12/8 Noble Paul നോബിള്‍ नोब्ळ् : > On Wed, Dec 9, 2009 at 6:14 AM, Jason Rutherglen > wrote: >>> Yes. I

Re: Replicating multiple cores

2009-12-08 Thread Noble Paul നോബിള്‍ नोब्ळ्
On Wed, Dec 9, 2009 at 6:14 AM, Jason Rutherglen wrote: >> Yes. I'd highly recommend using the Java replication though. > > Is there a reason?  I understand it's new etc, however I think one > issue with it is it's somewhat non-native access to the filesystem. > Can you illustrate a real world adv

Re: search on tomcat server

2009-12-08 Thread Lance Norskog
Solr comes with an example solr installation in the example/ directory. Run this, look at the README.txt file, index the xml files in example/exampledocs, and do queries like 'disk' and 'memory'. And read example/conf/schema.xml and example/conf/solrconfig.xml. Most of the details of what solr doe

Re: Solr Search in stemmed and non stemmed mode

2009-12-08 Thread Lance Norskog
Short answer: the standard query handler is right for carefully designing queries. The dismax query handler is right for putting a 'search box' in a web page for regular users. On 12/7/09, khalid y wrote: > Thanks, > > I'll read the mail archive. > > Your suggestion is like mine but whitout the D

Re: question about schemas

2009-12-08 Thread Lance Norskog
I don't know. The common way to do this in Solr is the full denormalization technique, but that blows up in this case. This is not an easy problem space to implement in Solr. Data warehousing & star schema techniques may be more appropriate. On 12/7/09, solr-user wrote: > > > Lance Norskog-2 wrot

Re: Multiindexing

2009-12-08 Thread Lance Norskog
A "core" is one index. I think you mean: 3-5 indexes in different cores. Since you want to search across them, they should have the same schema. There is a feature called Distributed Search that searches across multiple indexes. There is no administration support for indexing parts of one data set

bool default - if missing when "updating" uses current or default value?

2009-12-08 Thread regany
hello, if I have a booleen fieldType (solr.BoolField) with a default value of "true", and I insert a new document I understand that the booleen value will be set to TRUE. But if I update an existing document, and I don't pass in a value for the booleen field, will Solr keep the existing booleen

Solr Cell and Spellchecking.

2009-12-08 Thread Michael Boyle
Following Eric Hatcher's post about using SolrCell and acts_as_solr { http://www.lucidimagination.com/blog/2009/02/17/acts_as_solr_cell/ }, I have been able to index a rich document stream and retrieve it's id. No worries. However, I have the SpellCheckComponent setup to build on commit (buil

Re: Replicating multiple cores

2009-12-08 Thread Jason Rutherglen
> Yes. I'd highly recommend using the Java replication though. Is there a reason? I understand it's new etc, however I think one issue with it is it's somewhat non-native access to the filesystem. Can you illustrate a real world advantage other than the enhanced admin screens? On Mon, Dec 7, 200

Packaging & installing SOLR on linux

2009-12-08 Thread insaneyogi3008
Hello, At the risk of asking a highly general question , can anybody give me pointers or best practices on how best one can package SOLR & its associated file as a Linux rps file , so that this core/instance can be ported on multiple instances ? If anybody has experience working on such a system

Multiple Facet prefixes on the same facet field in one request?

2009-12-08 Thread Robert Purdy
Hey all, Is there anyway in Solr 1.4/1.5 to perform multiple facet prefixes on the same facet field in one request? Ex. On field 'Foo' I want to perform a facet prefix of A* and B* so I can get a facet response of all terms prefixed with A and all terms prefixed with B, either grouped togethe

Solr usage with Auctions/Classifieds?

2009-12-08 Thread regany
hello! just wondering if anyone is using Solr as their search for an auction / classified site, and if so how have you managed your setup in general? ie. searching against listings that may have expired etc. regan -- View this message in context: http://old.nabble.com/Solr-usage-with-Auctions-

Re: do copyField's need to exist as Fields?

2009-12-08 Thread regany
regany wrote: > > Is there a different way I should be setting it up to achieve the above?? > Think I figured it out. I set up the so they are present, but get ignored accept for the "text" field which gets indexed... and then copyField the first 4 fields to the "text" field:

Re: java.lang.NumberFormatException: For input string: ""

2009-12-08 Thread Chris Hostetter
: its strange i had a dismaxhandler and it had an empty value for ps field : i added a default value like 100 and the error disappeared. I really wish the java compiler had an option so we could say "when compiling our code, treat this list of unchecked exceptions like checked exceptions" so we

do copyField's need to exist as Fields?

2009-12-08 Thread regany
Hello! (solr newbie alert) I want to pass 4 fields into Solr 1. id (unique) 2. title 3. subtitle 4. body but only want to index and store 2: 1. id (unique) 2. text (copyField of id, title, subtitle, body). The search then searches on "text", and returns only matching "id's". When I set up t

Re: how to do auto-suggest case-insensitive match and return original case field values

2009-12-08 Thread hermida
Hello, Thanks for the reply (see below) hossman wrote: > > The type of approach you are describing (doing a prefix based query for > autosuggest) probably won't work very well unless your index is 100% > designed just for the autosuggest ... if it's an index about products, and > you're jus

Re: About fsv (sort field falues)

2009-12-08 Thread Yonik Seeley
On Tue, Dec 8, 2009 at 4:04 PM, Marc Sturlese wrote: > I am tracing QueryComponent.java and would like to know the pourpose of doFSV > function. Don't understand what fsv are for. > Have tried some queries with fsv=true and some extra info apears in the > response: > > It's currently an internal

About fsv (sort field falues)

2009-12-08 Thread Marc Sturlese
I am tracing QueryComponent.java and would like to know the pourpose of doFSV function. Don't understand what fsv are for. Have tried some queries with fsv=true and some extra info apears in the response: But don't know what is it for and can't find much info out there. I read: // The query

Re: # in query

2009-12-08 Thread Erick Erickson
Sorry, I usually think of things in Lucene land and reflexively think of the fat client. Anyway, here's your problem I think... WordDelimiterFilterFactory. See: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimiterFilterFactory It's losing the # altogether, as indica

Re: Case Insensitive search not working

2009-12-08 Thread Tom Hill
Did you rebuild the index? Changing the analyzer for the index doesn't affect already indexed documents. Tom On Tue, Dec 8, 2009 at 11:57 AM, insaneyogi3008 wrote: > > Hello, > > I tried to force case insensitive search by having the following setting in > my schema.xml file which I guess is st

Re: how to do auto-suggest case-insensitive match and return original case field values

2009-12-08 Thread Uri Boness
Just updated SOLR-1625 to support regexp hints. https://issues.apache.org/jira/browse/SOLR-1625 Cheers, Uri Chris Hostetter wrote: : In my web application I want to set up auto-suggest as you type : functionality which will search case-insensitively yet return the original : case terms. It do

Case Insensitive search not working

2009-12-08 Thread insaneyogi3008
Hello, I tried to force case insensitive search by having the following setting in my schema.xml file which I guess is standard for Case sensitive searches : However when I perform searches on "San Jose" & "san jose" , I ge

Re: WELCOME to solr-user@lucene.apache.org

2009-12-08 Thread Chris Hostetter
(FYI: in the future please start a new thread with an approriate subject line when you ask questions -- you probably would have gotten a lot more responses fro people interested in Tika and SolrCell if they could tell that this email was about SolrCell) : I found that Tika read the html and ex

Re: # in query

2009-12-08 Thread Joel Nylund
ok, I just realized I was using the luke handler, didnt know there was a fat client, I assume thats what you are talking about. I downloaded the lukeall.jar, ran it, pointed to my index, found the document in question, didn't see how it was tokenized, but I clicked the "reconstruct & edit"

Re: Solr Admin XPath

2009-12-08 Thread Chris Hostetter
Wild shots in the dark: * remove the white psace arround the "=" characters * replace the single-quote characters with double quote characters : XPathExpression reqPerSec = xpath.compile("/solr/solr-info/QUERYHANDLER/entry[name = 'dismax']/stats/st...@name = 'avgRequestsPerSecond

indexing XML with solr example webapp - out of java heap space

2009-12-08 Thread Feroze Daud
Hi! I downloaded SOLR and am trying to index an XML file. This XML file is huge (500M). When I try to index it using the "post.jar" tool in example\exampledocs, I get a "out of java heap space" error in the SimplePostTool application. Any ideas how to fix this? Passing in "-Xms1024M" doe

Re: java.lang.NoSuchMethodError: org.apache.commons.httpclient.HttpConnectionManager.getParams()Lorg/apache/commons/httpclient/params/HttpConnectionManagerParams;

2009-12-08 Thread Chris Hostetter
: Strangely i dont get this error when i execute this code from command line. : This error only occurs when i access it from a web application. Secondly, : this same method works fine with another web application. Both web ... : java.lang.NoSuchMethodError: : org.apache.commons.httpclient

Re: how to do auto-suggest case-insensitive match and return original case field values

2009-12-08 Thread Chris Hostetter
: In my web application I want to set up auto-suggest as you type : functionality which will search case-insensitively yet return the original : case terms. It doesn't seem like TermsComponent can do this as it can only : return the lowercase indexed terms your are searching against, not the

Re: KStem download

2009-12-08 Thread darniz
Hi Guys I still have this problem I got the fresh release of apache solr 1.4 i added decleration of Kstemmer in my schema.xml and put the two jars files under \example\lib folder. I some how think its not able to find the solr home, looking at the error. If i make a nightly distribution build

RE: Facet query with special characters

2009-12-08 Thread Chris Hostetter
: Note that I am (supposed to be) indexing/searching without analysis : tokenization (if that's the correct term) - i.e. field values like : 'pds-comp.domain' shouldn't be (and I believe aren't) broken up as in : 'pds', 'comp' 'domain' etc. (e.g. using the 'text_ws' fieldtype). ... : Wh

Re: Exception encountered during replication on slave....Any clues?

2009-12-08 Thread William Pierce
Hi, Noble: When I hit the masterUrl from the slave box at http://localhost:8080/postingsmaster/replication I get the following xml response: - - 0 0 OK No command And then when I look in the logs, I see the exception that I mentioned. Wha

Re: # in query

2009-12-08 Thread Erick Erickson
In Luke, there's a tab that will let you go to a document ID. From there you can see all the fields in a particular document, and examine what the actual tokens stored are. Until and unless you know what tokens are being indexed, you simply can't know what your queries should look like... *Ass

Re: Tika and DIH integration (https://issues.apache.org/jira/browse/SOLR-1358)

2009-12-08 Thread Noble Paul നോബിള്‍ नोब्ळ्
we are very close to resolving SOLR-1358 . So you may be able to use it On Tue, Dec 8, 2009 at 5:32 PM, Jorg Heymans wrote: > Hi, > > I am looking into using Solr for indexing a large database that has > documents (mostly pdf and msoffice) stored as CLOBs in several tables. > It is my understandi

Re: # in query

2009-12-08 Thread Joel Nylund
Thanks Eric, I looked more into this, but still stuck: I have this field indexed using text_rev I looked at the luke analysis for this field, but im unsure how to read it. When I query the field by the id I get: − 5405255 ###'s test blog If I try to query even multiple ### I get n

Re: how to do auto-suggest case-insensitive match and return original case field values

2009-12-08 Thread hermida
Hi again, Just pinging again to any Solr experts out there... sorry that my previous message was a bit long (wanted to fully explain what I've already done and where the exact difficulty arises)... but to summarize: Does anyone know how to use Solr querying with faceting to do an auto-suggest th

Re: How to setup dynamic multicore replication

2009-12-08 Thread Thijs
Hi, Thanks. That was my second option. But I was hoping that the master and slaves could find that out for themselves. As now I have to also have my 'updater software' know about all the slaves (and maybe even their state). Which it previously had no idea about. This way I can't just plugin

Re: Multiple default search fields or catchall field?

2009-12-08 Thread Erick Erickson
See below. On Tue, Dec 8, 2009 at 3:48 AM, Thomas Koch wrote: > Hi, > > I'm indexing feeds and websites referenced by the feeds. So I have as text > fields: > title - from the feed entries title > description - from the feed entries description > text - the websites text > > When the user doesn'

RE: How to setup dynamic multicore replication

2009-12-08 Thread Joe Kessel
Hi, In my environment I create cores on the fly, then replicate the core to all of the slaves. I first create the core on the master and persist the solr.xml via the CoreAdmin API. I then do the same on each of my slaves. After loading / committing / optimizing the data on the master I sen

RE: Facet query with special characters

2009-12-08 Thread Peter 4U
Hello Hoss, Many thanks for your answer. That's very interesting. So, are you saying this is an issue on the index side, rather than the query side? Note that I am (supposed to be) indexing/searching without analysis tokenization (if that's the correct term) - i.e. field values like 'pds

Re: edismax using bigrams instead of phrases?

2009-12-08 Thread Bill Dueber
On Mon, Dec 7, 2009 at 5:45 PM, Chris Hostetter wrote: > > it would be a mistake to have a "pf1" field that was an alias for "pf" ... > as it stands the "pf" parm in dismax is analogous to a "pf*" or > "pf-Infinity" > Of course -- I waswell, let's just pretend I was drunk. How about pfInf or

Tika and DIH integration (https://issues.apache.org/jira/browse/SOLR-1358)

2009-12-08 Thread Jorg Heymans
Hi, I am looking into using Solr for indexing a large database that has documents (mostly pdf and msoffice) stored as CLOBs in several tables. It is my understanding that the DIH as provided in Solr 1.4 cannot index these CLOBs yet, and that SOLR-1358 should provide exactly this. So i was wonderin

test

2009-12-08 Thread Thomas Senior
test

Re: How to setup dynamic multicore replication

2009-12-08 Thread Thijs
But the slave never gets the message that a core is created... at least not in my setup... So it never starts replicating... On 8-12-2009 12:13, Noble Paul നോബിള്‍ नोब्ळ् wrote: On Tue, Dec 8, 2009 at 2:43 PM, Thijs wrote: Hi I need some help setting up dynamic multicore replication. We ar

Re: How to setup dynamic multicore replication

2009-12-08 Thread Thijs
If I for example do: /select?q=type:book&facet=true&facet.mincount=0&facet.field=title the titles that are returned for the facet query also contains titles that are of type dvd. While I only want the unique titles for type book. On 8-12-2009 12:09, Shalin Shekhar Mangar wrote: On Tue, Dec 8

Re: Apache solr for multiple searches

2009-12-08 Thread Shalin Shekhar Mangar
On Tue, Dec 8, 2009 at 2:28 PM, regany wrote: > > > Bhuvi HN wrote: > > > > Can we have one single instance of the Apache Solr running for both the > > search like Job search and resume search. > > > > Yes, you want to run a multicore (multiple index) setup - see: > http://wiki.apache.org/solr/Co

Re: How to setup dynamic multicore replication

2009-12-08 Thread Noble Paul നോബിള്‍ नोब्ळ्
On Tue, Dec 8, 2009 at 2:43 PM, Thijs wrote: > Hi > > I need some help setting up dynamic multicore replication. > > We are changing our setup from a replicated single core index with multiple > document types, as described on the wiki[1], to a dynamic multicore setup. > We need this so that we ca

Re: How to setup dynamic multicore replication

2009-12-08 Thread Shalin Shekhar Mangar
On Tue, Dec 8, 2009 at 2:43 PM, Thijs wrote: > Hi > > I need some help setting up dynamic multicore replication. > > We are changing our setup from a replicated single core index with multiple > document types, as described on the wiki[1], to a dynamic multicore setup. > We need this so that we c

RE: why no results?

2009-12-08 Thread Jaco Olivier
Hi, Try changing your TEXT field to type "text" (without the of course :)) That is your problem... also use the "text" type as per default examples with SOLR distro :) Jaco Olivier -Original Message- From: regany [mailto:re...@newzealand.co.nz] Sent: 08 December 2009 05:

RE: why no results?

2009-12-08 Thread Jaco Olivier
Hi Regan, I am using STRING fields only for values that in most cases will be used to FACET on.. I suggest using TEXT fields as per the default examples... ALSO, remember that if you do not specify the " solr.LowerCaseFilterFactory " that your search has just become case sensitive.. I struggled w

How to setup dynamic multicore replication

2009-12-08 Thread Thijs
Hi I need some help setting up dynamic multicore replication. We are changing our setup from a replicated single core index with multiple document types, as described on the wiki[1], to a dynamic multicore setup. We need this so that we can display facets with a zero count that are unique to

Re: Apache solr for multiple searches

2009-12-08 Thread regany
Bhuvi HN wrote: > > Can we have one single instance of the Apache Solr running for both the > search like Job search and resume search. > Yes, you want to run a multicore (multiple index) setup - see: http://wiki.apache.org/solr/CoreAdmin -- View this message in context: http://old.nabble.

Multiple default search fields or catchall field?

2009-12-08 Thread Thomas Koch
Hi, I'm indexing feeds and websites referenced by the feeds. So I have as text fields: title - from the feed entries title description - from the feed entries description text - the websites text When the user doesn't define a default search field, then all three fields should be used for searc