Re: Status of Spelt integration

2009-12-07 Thread freerk55
The standard spell checker of Thunderbird works in eGroupware. But not in Felamimail!!?? Why not? How can I get it working as it does in the rest of eGroupware? Freerk Jongsma Toby Cole-2 wrote: > > Hi Andrew, > We ended up abandoning the spelt integration as the built in solr > spell

Re: Status of Spelt integration

2009-12-07 Thread Toby Cole
I'm pretty sure this isn't a Solr related question. Have you tried asking on the eGroupware mailing lists? http://sourceforge.net/mail/?group_id=78745 Toby. On 7 Dec 2009, at 08:52, freerk55 wrote: The standard spell checker of Thunderbird works in eGroupware. But not in Felamimail!!?? Why no

Re: Solr 1.4: StringIndexOutOfBoundsException in SpellCheckComponent with HTMLStripCharFilterFactory

2009-12-07 Thread Koji Sekiguchi
Robin Wojciki wrote: Koji, I was able to create a minimal replication. Attached zip has solr.xml, solrconf.xml and Main.java. I was able to replicate the issue by replacing the conf files in apache-solr-1.4.0/example/solr/conf and running the class Main. Could please confirm if this replication

Re: Question about the message "Indexing failed. Rolled back all changes."

2009-12-07 Thread yountod
That was it! Thank you for the tip. To clarify for other beginners: Create a blank file called dataimport.properties in your conf directory and don't forget to make sure the system has write access to it. Lance Norskog-2 wrote: > > This is definitely a bug. Please open a JIRA issue for thi

Multiindexing

2009-12-07 Thread Jörg Agatz
Hi Users.. i need help with Multiindexing in Solr, i want one Core, and 3 to 5 diferent indizes. So i can search in simultan in all or in some of them. i find the Help im WIKI.. but it dosent Help. http://wiki.apache.org/solr/MultipleIndexes?highlight=%28multi%29 there stand nothing about Multiin

DIH Updating

2009-12-07 Thread Lee Smith
Hello All Sorry newbie Q. Im looking at using the Data Import Handler to add my data to solr. But I am a little confused how I go about updating the index. I understand there is no update index so just a delete replace but how will solr know what to remove and add ? Also hope someone does

Solr Search in stemmed and non stemmed mode

2009-12-07 Thread khalid y
Hi !! I'm looking for a way to have two index in solr one stemmed and another non stemmed. Why ? It's simple :-) My users can do query for : - banking marketing => it return all document matches bank*** and market*** - "banking" marketing => it return all document matches "banking" and market*

Re: Solr 1.4: StringIndexOutOfBoundsException in SpellCheckComponent with HTMLStripCharFilterFactory

2009-12-07 Thread Robin Wojciki
Koji, In the sample I sent, the exception comes only if the HTMLStripCharFilter is there. However, your test case seems to capture the essence. Sorry if I sent you on a wild goose chase. Thanks for taking the time! I will log a ticket. Robin On Mon, Dec 7, 2009 at 5:09 PM, Koji Sekiguchi wrote

RE: search on tomcat server

2009-12-07 Thread Jill Han
In fact, I just followed the instructions titled as Tomcat On Windows. Here are the updates on my computer 1. -Dsolr.solr.home=C:\solr\example 2. change dataDir to C:\solr\example\data in solrconfig.xml at C:\solr\example\conf 3. created solr.xml at C:\Tomcat 5.5\conf\Catalina\localhost I r

Re: question about schemas

2009-12-07 Thread solr-user
Lance Norskog-2 wrote: > > You can make a separate facet field which contains a range of "buckets": > 10, 20, 50, or 100 means that the field has a value 0-10, 11-20, 21-50, or > 51-100. You could use a separate filter query with values for these > buckets. Filter queries are very fast in Solr 1

Re: DIH Updating

2009-12-07 Thread Erick Erickson
The short form is that you must provide and identify a unique key (analogous to a database PK). See http://wiki.apache.org/solr/UniqueKey There's an example here: http://wiki.apache.org/solr/DataImportHandler Bu

Re: Solr 1.4: StringIndexOutOfBoundsException in SpellCheckComponent with HTMLStripCharFilterFactory

2009-12-07 Thread Robin Wojciki
Logged a ticket for Solr: https://issues.apache.org/jira/browse/SOLR-1630 Thanks, Robin On Mon, Dec 7, 2009 at 9:36 PM, Robin Wojciki wrote: > Koji, > > In the sample I sent, the exception comes only if the > HTMLStripCharFilter is there. > > However, your test case seems to capture the essence.

Re: Solr Search in stemmed and non stemmed mode

2009-12-07 Thread Erick Erickson
Try searching the mail archive for stemmer exact match or similar, this has been discussed multiple times and you'll get more complete discussions wy faster One suggestion is to use two fields, one for the stemmed version and one for the original, then use whichever field you need to via D

Re: Solr Search in stemmed and non stemmed mode

2009-12-07 Thread khalid y
Thanks, I'll read the mail archive. Your suggestion is like mine but whitout the DisMax handler. I'm going to read what is this handler. I have one field text and another text_unstemmed where I copy all others fields. I'm writing my custom query handler who check if quotes exists and switch betwe

Re: question about schemas (and SOLR-1131?)

2009-12-07 Thread solr-user
wojtekpia wrote: > > Could this be solved with a multi-valued custom field type (including a > custom comparator)? The OP's situation deals with multi-valuing products > for each customer. If products contain strictly numeric fields then it > seems like a custom field implementation (or extensio

RE: Embedded for write, HTTP for read - cache aging

2009-12-07 Thread Peter 4U
Hi Erik, Thanks for your answer. Yes, I've done an /update to the http server, which certainly works as far as the 'reading' goes. This sends the update to the back-end index though, which essentially defeats the purpose of having the embedded instance do the write (as writes are always

Re: Multiple Solr Instances - Multiple Jetty Instances

2009-12-07 Thread Smiley, David W.
If you have many documents (say > 10M documents, probably a larger threshold) then you will benefit from sharding your index, i.e. splitting your index up into multiple cores and using distributed searches. You could use one VM and multiple cores just fine, assuming you have multiple CPUs. If

RE: search on tomcat server

2009-12-07 Thread Sascha Szott
Hi Jill, just to make sure your index contains at least one document, what is the output of Best, Sascha Jill Han wrote: > In fact, I just followed the instructions titled as Tomcat On Windows. > Here are the updates on my

Re: Multiple Solr Instances - Multiple Jetty Instances

2009-12-07 Thread Amit Nithian
David thanks for your response. With that having been said, is there a general ratio of the number of Tomcat/Jetty HTTP threads to allocate relative to the number of CPU cores you have on your machine? Is the default in Tomcat/Jetty acceptable? Thanks again Amit On Mon, Dec 7, 2009 at 10:00 AM,

Re: [SolrResourceLoader] Unable to load cached class-name

2009-12-07 Thread Chris Hostetter
: Ok, I'm not sure where my particular use of Solr fits into all this. : I'm writing a log4j appender that adds each log entry to a Solr index. : It's not really a Solr plugin. So if i'm understanding correctly, you want to run a "stock" solr server, with none of your own custom code in that ser

Re: comparing index-time boost and sort in the case of a date field

2009-12-07 Thread Chris Hostetter
: : I have a requirement where I need to display records with more recent values : for approval_dt to come first when a query is made. I thought of approaching : this in 2 different ways:- ... : 2. INDEX-TIME boosting. : I sorted the query from databse itself in asc order of approval_dt

# in query

2009-12-07 Thread Joel Nylund
Hi, How can I put a # sign in a query, do I need to escape it? For example I want to query books with title that contain # No work so far: http://localhost:8983/solr/select?q=textTitle:"#"; http://localhost:8983/solr/select?q=textTitle:# http://localhost:8983/solr/select?q=textTitle:"\#"; Gett

RE: SolrPlugin Guidance

2009-12-07 Thread Chris Hostetter
: e.g. For the following query that looks for a file in a directory: : q=+directory_name:"myDirectory" +file_name:"myFile" : : We'd need to decompose the query into the following two queries: : 1. q=+directory_name:"myDirectory"&fl=directory_id : 2. q=+file_name:"myFile" +directory_id:(results fr

Re: # in query

2009-12-07 Thread Paul Libbrecht
Sure you have to escape it! %23 otherwise the browser considers it as a separator between the URL for the server (on the left) and the fragment identifier (on the right) which is not sent the server. You might want to read about "URL-encoding", escaping with backslash is a shell-thing, no

Re: # in query

2009-12-07 Thread Joel Nylund
ok thanks, sorry my brain wasn't working, but even when I url encode it, I dont get any results, is there something special I have to do for solr? thanks Joel On Dec 7, 2009, at 3:20 PM, Paul Libbrecht wrote: Sure you have to escape it! %23 otherwise the browser considers it as a separat

Re: # in query

2009-12-07 Thread Erick Erickson
Well, the very first thing I would is examine the field definition in your schema file. I suspect that the tokenizers and/or filters you're using for indexing and/or querying is doing something to the # symbol. Most likely stripping it. If you're just searching for the single-letter term "#", I *th

Re: Question regarding scoring/boosting

2009-12-07 Thread Chris Hostetter
Unfortunately understanding how Lucene/Scoring works isn't much of a beginer level topic -- the short answer to your question is that adding a function to the "bf" param of dismax causees that function to be evaluated for every doc that matches your main query and the scores are "boosted" in

Re: Solr plugin or something else for custom work?

2009-12-07 Thread Chris Hostetter
What you are describing corrisponds pretty closely to some work currently in progress to make the DataImportHandler integrate with the ExtractingRequestHandler/Tika ... https://issues.apache.org/jira/browse/SOLR-1358 ...in the meantime, your options are either to extract all the metad

Exception encountered during replication on slave....Any clues?

2009-12-07 Thread William Pierce
Folks: I am seeing this exception in my logs that is causing my replication to fail. I start with a clean slate (empty data directory). I index the data on the postingsmaster using the dataimport handler and it succeeds. When the replication slave attempts to replicate it encounters this

Re: Exception encountered during replication on slave....Any clues?

2009-12-07 Thread TCK
are you missing the port number in the master's url ? -tck On Mon, Dec 7, 2009 at 4:44 PM, William Pierce wrote: > Folks: > > I am seeing this exception in my logs that is causing my replication to > fail.I start with a clean slate (empty data directory). I index the > data on the postin

Oddly slow replication

2009-12-07 Thread Simon Wistow
I have a Master server with two Slaves populated via Solr 1.4 native replication. Slave1 syncs at a respectable speed i.e around 100MB/s but Slave2 runs much, much slower - the peak I've seen is 56KB/s. Both are running off the same hardware with the same config - compression is set to 'intern

Re: Exception encountered during replication on slave....Any clues?

2009-12-07 Thread William Pierce
tck, thanks for your quick response. I am running on the default port (8080). If I copy that exact string given in the masterUrl and execute it in the browser I get a response from solr: - - 0 0 OK No command So the masterUrl is reachable/accessible so far as I am able to tell...

Re: Response writer configs

2009-12-07 Thread Chris Hostetter
: I guess we should remove this commented response writers from the : example solrconfig. It adds no value. The comment tried to make it clear that it was showing what writers were enabled by default. But i changed it to be more in line with what we have for search components. -Hoss

Re: Exception encountered during replication on slave....Any clues?

2009-12-07 Thread William Pierce
Just to make doubly sure, per tck's suggestion, I went in and explicitly added in the port in the masterurl so that it now reads: http://localhost:8080/postingsmaster/replication Still getting the same exception... I am running solr 1.4, on Ubuntu karmic, using tomcat 6 and Java 1.6. Thanks

how to set CORE when using Apache Solr extension?

2009-12-07 Thread regany
Hello, Can anyone tell me how you set which Solr CORE to use when using the Apache Solr extension? (Using Solr with multicores) http://www.php.net/manual/en/book.solr.php thanks, regan -- View this message in context: http://old.nabble.com/how-to-set-CORE-when-using-Apache-Solr-extension--tp26

Re: Windows 7 / Java 64bit / solr 1.4 - solr.solr.home problem

2009-12-07 Thread Chris Hostetter
: I tried it on Vista 32 & Java 1.6.0_17-b04 and it works without a problem. : Actually on all other computers in the office there is no problem - I am the : only one using Windows 7 : : I did try with full path and it didn't work as well. Here's the result: by the looks of it, your problem is h

Re: Stopping & Starting

2009-12-07 Thread regany
Lee Smith-6 wrote: > > So how can I stop and restart the service ? > > Hope you can help get me going again. > > Thank you > Lee > I found this shell script which works well for me... #!/bin/sh -e # Starts, stops, and restarts solr SOLR_DIR="/usr/local/solr/example" JAVA_OPTIONS="-Xmx10

Re: edismax using bigrams instead of phrases?

2009-12-07 Thread Chris Hostetter
: > I've started trying edismax, and have noticed that my relevancy ranking is : > messed up with edismax because, according to the debug output, it's using : > bigrams instead of phrases and inexplicably ignoring a couple of the pf I noticed that aw well while testing edismax on the train the ot

Re: edismax using bigrams instead of phrases?

2009-12-07 Thread Chris Hostetter
: I see that edismax already defines pf (bigrams) and pf3 (trigrams) -- how : would folks think about just calling them pf / pf1 (aliases for each : other?), pf2, and pf3? The pf would then behave exactly as it does in : dismax. changing edismax's current pasing logic to be applied to a "pf2" par

Re: latency in solr response is observed after index is updated

2009-12-07 Thread Chris Hostetter
: We are observing latency (some times huge latency upto 10-20 secs) in solr : response after index is updated . whats the reason of this latency and how : can it be minimized ? : Note: our index size is pretty large. Please read the following wiki pages... http://wiki.apache.org/solr/Solr

Re: Facet query with special characters

2009-12-07 Thread Chris Hostetter
: When performing a facet query where part of the value portion has a : special character (a minus sign in this case), the query returns zero : results unless I put a wildcard (*) at the end. check your analysis configuration for this fieldtype, in particular look at what debugQuery produces

Re: NullPointerException thrown during updates to index

2009-12-07 Thread Chris Hostetter
: Hi, : I'm running a distributed solr index (3 nodes) and have noticed frequent : exceptions thrown during updates. The exception (see below for full trace) what do you mean "during updates" ? ... QueryComponent isn't used at all when updating hte index, so there may be a missunderstanding here

why no results?

2009-12-07 Thread regany
hi all - newbie solr question - I've indexed some documents and can search / receive results using the following schema - BUT ONLY when searching on the "id" field. If I try searching on the title, subtitle, body or text field I receive NO results. Very confused. :confused: Can anyone see anything

Re: why no results?

2009-12-07 Thread Tom Hill
Hi - That's a common one to get bit by. The string On Mon, Dec 7, 2009 at 7:44 PM, regany wrote: > > hi all - newbie solr question - I've indexed some documents and can search > / > receive results using the following schema - BUT ONLY when searching on the > "id" field. If I try searching on t

Re: why no results?

2009-12-07 Thread regany
Tom Hill-7 wrote: > > That's a common one to get bit by. The string > You lost me Tom? I Think your message got cut off. I'm guessing something to do with the "string" type?? -- View this message in context: http://old.nabble.com/why-no-results--tp26688249p26688295.html Sent from the Solr -

Re: why no results?

2009-12-07 Thread Tom Hill
Sorry, just discovered a keyboard shortcut for "send". :-) That's a common one to get bit by. The fieldtype StrField indexes the entire field as one item. So you can only find it if your search term is everything in the field. That is, "fox" will not find "The Quick Brown Fox", because it's not th

Re: why no results?

2009-12-07 Thread regany
Tom Hill-7 wrote: > > Try solr.TextField instead. > Thanks Tom, I've replaced the section above with... deleted my index, restarted Solr and re-indexed my documents - but the search still returns nothing. Do I need to change the type in the sections as well? regan -- View

Re: Exception encountered during replication on slave....Any clues?

2009-12-07 Thread Noble Paul നോബിള്‍ नोब्ळ्
are you able to hit the http://localhost:8080/postingsmaster/replication using a browser from the slave box. if you are able to hit it what do you see? On Tue, Dec 8, 2009 at 3:42 AM, William Pierce wrote: > Just to make doubly sure,  per tck's suggestion,  I went in and explicitly > added in th

Re: how to set CORE when using Apache Solr extension?

2009-12-07 Thread Noble Paul നോബിള്‍ नोब्ळ्
the core is a part of the uri http://host:port///select say if the core name is core1 and solr app name is solr deployed at port 8983 then it would look like http://host:8983/solr/core1/select On Tue, Dec 8, 2009 at 3:44 AM, regany wrote: > > Hello, > > Can anyone tell me how you set which Solr

Re: Oddly slow replication

2009-12-07 Thread Noble Paul നോബിള്‍ नोब्ळ्
this has to be a network problem . We have never encountered such vastly different speeds in the same LAN. On Tue, Dec 8, 2009 at 3:22 AM, Simon Wistow wrote: > I have a Master server with two Slaves populated via Solr 1.4 native > replication. > > Slave1 syncs at a respectable speed i.e around 1

Replicating multiple cores

2009-12-07 Thread Jason Rutherglen
If I've got multiple cores on a server, I guess I need multiple rsyncd's running (if using the shell scripts)?

Re: Replicating multiple cores

2009-12-07 Thread Shalin Shekhar Mangar
On Tue, Dec 8, 2009 at 11:48 AM, Jason Rutherglen < jason.rutherg...@gmail.com> wrote: > If I've got multiple cores on a server, I guess I need multiple > rsyncd's running (if using the shell scripts)? > Yes. I'd highly recommend using the Java replication though. -- Regards, Shalin Shekhar Man