RE: fast update handlers

2007-05-10 Thread Chris Hostetter
: want to add docs every 2 seconds all while doing queries. if I do : commits every 2 seconds I basically loose any caching advantage and my : faceting performance goes down the tube. If however, I were to add : things to a smaller index and then roll it into the larger one every ~30 : minutes th

New user - indexing problems

2007-05-10 Thread Gary Browne
Hi I'll probably be posting a bunch of stupid questions in the near future, so bear with me. I'm finding the documentation a little confusing. For starters, I've got Solr up and running under Tomcat on port 8080, and I can pull up the admin page, no problems. I'm running on RHEL AS 4, with curl

Re: Solr Sorting, merging/weighting sort fields

2007-05-10 Thread Walter Underwood
The boost is a way to adjust the weight of that field, just like you adjust the weight of any other field. If the boost is dominating the score, reduce the weight and vice versa. wunder On 5/10/07 9:22 PM, "Chris Hostetter" <[EMAIL PROTECTED]> wrote: > > : Is this correct? bf is a boosting fun

Re: Solr Sorting, merging/weighting sort fields

2007-05-10 Thread Chris Hostetter
: I actually used the _val_ hack, so query foo:bar : _val_:"ord(popularity)^0.5", still playing with it though its still : skewing results a bit to much. if your query string is... foo:bar _val_:ord(popularity)^0.5 ..then part of your problem may be that (unless you've changed the default o

Re: Solr Sorting, merging/weighting sort fields

2007-05-10 Thread Chris Hostetter
: Is this correct? bf is a boosting function, so a function is needed there, no? : If I'm not missing someting, the ^0.5 is just a boost, and "popularity" : is just a (numeric) field. So boosting a numeric field wouldn't make : sense, but appying it to a function would. Am I missing something?

can i modifie date format

2007-05-10 Thread James liu
MS SQL database have one date format solr have one date format web page show have one date format why not user config date format, solr read date format rule, maybe like this, http://cn2.php.net/manual/en/function.date.php now solr 1.1 date format is /MM/DD H:I:S? -- regards jl

Re: Question about delete

2007-05-10 Thread Chris Hostetter
: Closing an IndexReader only flushes the list of deleted docids to the : index... it doesn't actually delete them. Deletions only happen when : the deleted docs segment is involved in a merge, or when an optimize : is done (which is a merge of all segments). just to clarify slightly because "de

RE: cwd requirement to run Solr with Tomcat

2007-05-10 Thread Chris Hostetter
that section was never really intented to be *the* set of instructions for installing Solr on Tomcat, just the *simplest* set of things you could do to see it working, many additional things could be done (besides deleting the unzipped dir). If we start listing more things, people may get confuse

Re: Solr concurrent commit not updated

2007-05-10 Thread James liu
u should know id is unique number. 2007/5/11, David Xiao <[EMAIL PROTECTED]>: Hello all, I have tested by use post.sh in example directory to add xml documents into solr. It works when I add one by one. But when I have a lot of .xml file to be posted (say about 500-1000 files) and I wrote a

Solr concurrent commit not updated

2007-05-10 Thread David Xiao
Hello all, I have tested by use post.sh in example directory to add xml documents into solr. It works when I add one by one. But when I have a lot of .xml file to be posted (say about 500-1000 files) and I wrote a shell script to call post.sh one by one. I found those xml files are not sear

Re: Solr Sorting, merging/weighting sort fields

2007-05-10 Thread Nick Jenkin
Hi Otis I used a boosting function, bf pointed me in the right direction :) I actually used the _val_ hack, so query foo:bar _val_:"ord(popularity)^0.5", still playing with it though its still skewing results a bit to much. -Nick On 5/11/07, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: Is this c

Re: Question about delete

2007-05-10 Thread James liu
get it. thks yonik. 2007/5/10, Yonik Seeley <[EMAIL PROTECTED]>: On 5/10/07, Ajanta Phatak <[EMAIL PROTECTED]> wrote: > I believe in lucene at least deleting documents only marks them for > deletion. The actual delete happens only after closing the IndexReader. > Not sure about Solr Closing an

Re: cwd requirement to run Solr with Tomcat

2007-05-10 Thread James liu
I don't know simeple example. i use rename dist/apache*.war as solr.war and only use example/solr directory and define it with http://wiki.apache.org/solr/SolrTomcat Multiple Solr Webapps 2007/5/11, Teruhiko Kurosaka <[EMAIL PROTECTED]>: BTW, The Simple Example Install section in http://w

Re: Index Concurrency

2007-05-10 Thread joestelmach
> Yes, coordination between the main index searcher, the index writer, > and the index reader needed to delete other documents. Can you point me to any documentation/code that describes this implementation? > That's weird... I've never seen that. > The lucene write lock is only obtained when th

Re: Does solr support index which made by lucene 1.4.3

2007-05-10 Thread James liu
This information when i use solr admin to query '中国' 0 313 on 0 中国 10 2.2 java.io.IOException: read past EOF at org.apache.lucene.store.BufferedIndexInput.refill(BufferedIndexInput.java:89) at org.apache.lucene.store.BufferedIndexInput.readByte(BufferedInd

Re: Index Concurrency

2007-05-10 Thread Otis Gospodnetic
Though, isn't there a recent patch to allow multiple indices under a single Solr instance in JIRA? Otis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Simpy -- http://www.simpy.com/ - Tag - Search - Share - Original Message From: Yonik Seeley <[EMAIL PROTECTED]> To: s

Re: Solr Sorting, merging/weighting sort fields

2007-05-10 Thread Otis Gospodnetic
Is this correct? bf is a boosting function, so a function is needed there, no? e.g. ord(popularity)^0.5 recip(rord(price),1,1000,1000)^0.3 If I'm not missing someting, the ^0.5 is just a boost, and "popularity" is just a (numeric) field. So boosting a numeric field wouldn't make

Re: Requests per second/minute monitor?

2007-05-10 Thread Ian Holsman
Walter Underwood wrote: Yes, that is possible, but we also monitor Apache, Tomcat, the JVM, and OS through JMX and other live monitoring interfaces. Why invent a real-time HTTP log analysis system when I can fetch /search/stats.jsp at any time? "there are lies, damnd lies, and statistics" Th

RE: Facet only support english?

2007-05-10 Thread Teruhiko Kurosaka
If my memory is correct, UTF-8 has been the default encoding per XML specification from a very early stage. If the XML parser is not defaulting to UTF-8 in absence of the encoding attribute, that means the XML parser has a bug, and the code should be corrected. (I don't have an objection to add

RE: dates & times

2007-05-10 Thread Binkley, Peter
Regarding Hoss's points about the internal format, resolution of date-times, etc.: maybe a good starting point would be to implement the date-time algorithms of XML Schema (http://www.w3.org/TR/xmlschema-2/#isoformats), where these behaviors are spelled out in reasonably precise terms. There must b

RE: Does Solr XSL writer work with Arabic text?

2007-05-10 Thread Teruhiko Kurosaka
Yes, that is it! Thank you, Brian. I've filed SOLR-233. https://issues.apache.org/jira/browse/SOLR-233 -kuro > -Original Message- > From: Brian Whitman > Sent: Thursday, May 10, 2007 1:19 PM > To: solr-user@lucene.apache.org > Subject: Re: Does Solr XSL writer work with Arabic text? > >

Re: Costume response writer

2007-05-10 Thread Debra
hossman_lucene wrote: > > > can you clarify: > > 1) which version of Solr you are using (the "Solr Implementation Version" > from /admin/registry.jsp gives the best answer) > > ... > > > -Hoss > > > Just downloaded the latest night build and viola it's back on track (with the other

Re: dates & times

2007-05-10 Thread Ryan McKinley
(In general a "DateTranslatingTokenFilter" class would be a pretty cool addition to Lucene, it could as constructor args two DateFormatters (one for parsing the incoming tokens, and one for formating the outgoing If this happens, it would be nice (perhaps overkill) to have a "chronic" input fi

Re: dates & times

2007-05-10 Thread Chris Hostetter
: The right approach for more flexible date parsing is probably to add : more functionality to the date field and configure via optional : attributes. Adding configuration options to DateField seems like it might ultimately be the right choice for changing the *internal* format, but assuming we w

Re: Does Solr XSL writer work with Arabic text?

2007-05-10 Thread Brian Whitman
In example.xsl change the output type to And see if that helps. I had the same problem (different language.) If this works we should file a JIRA to fix it up in trunk. On May 10, 2007, at 4:13 PM, Teruhiko Kurosaka wrote: I'm trying to search an index of docs which have text fie

Does Solr XSL writer work with Arabic text?

2007-05-10 Thread Teruhiko Kurosaka
I'm trying to search an index of docs which have text fields in Arabic, using XSL writer (wt=xslt&tr=example.xsl). But the Arabic text gets all garbled. Is XSL writer known to work for Arabic text? Is anybody using it? -kuro

RE: cwd requirement to run Solr with Tomcat

2007-05-10 Thread Teruhiko Kurosaka
BTW, The Simple Example Install section in http://wiki.apache.org/solr/SolrTomcat leaves the unzipped directory apache-solr-nightly-incubating intact, but this is not needed after copying the solr.war and the example solr directory, is it? Can I edit the instruction to insert: rm -r apache-solr-ni

Re: dates & times

2007-05-10 Thread Yonik Seeley
On 5/10/07, Brian Whitman <[EMAIL PROTECTED]> wrote: On May 10, 2007, at 2:30 PM, Chris Hostetter wrote: > Questions like these are whiy I'm glad Solr currently keeps it > simple and > makes people deal in absolutes .. less room for confusion :) I get all that, thanks for the great explanation.

Re: dates & times

2007-05-10 Thread Brian Whitman
On May 10, 2007, at 2:30 PM, Chris Hostetter wrote: Questions like these are whiy I'm glad Solr currently keeps it simple and makes people deal in absolutes .. less room for confusion :) I get all that, thanks for the great explanation. I imagine most of my problems can be solved with a cus

RE: dates & times

2007-05-10 Thread Binkley, Peter
Minor clarification re the exslt license: that applies to the external exslt implementations, which you only need if your xsl engine doesn't support exslt natively. Since Xalan does, at least mostly, it's all already there in Solr. I agree that more flexible date-time parsing of input to Solr is a

Re: Costume response writer

2007-05-10 Thread Chris Hostetter
: INFO: adding queryResponseWriter : jdbc=com.lss.search.request.JDBCResponseWriter : 10/05/2007 21:11:44 org.apache.solr.core.SolrCore execute : INFO: null jdsn=4&start=0&q=white&wt=jdbc&qt=standard&rows=90 0 1442 that's very strange ... the only thing that jumps out at me is the "null" there w

Re: dates & times

2007-05-10 Thread Chris Hostetter
: It's more than string processing, anyway. I would want to convert the : Solr Time 2007-03-15T00:41:5:2Z to "March 15th, 2007" in a web app. : I'd also like to say 'Posted 3 days ago." In my vision of things, : that work is done on Solr's side. (The former case with a strftime : type formatter in

Re: Costume response writer

2007-05-10 Thread Debra
This is from the log: ... INFO: adding queryResponseWriter jdbc=com.lss.search.request.JDBCResponseWriter 10/05/2007 21:11:39 com.lss.search.request.JDBCResponseWriter init INFO: Init JDBC reponse writer //This is added from the ini of the class to see that it's actually finding the right one

Re: dates & times

2007-05-10 Thread Brian Whitman
You can get at some of this functionality in the built-in xslt 1.0 engine (Xalan) by using the e-xslt date-time extensions: see http://exslt.org/date/index.html, and for Xalan's implementation see http://xml.apache.org/xalan-j/extensionslib.html#exslt . The exslt stuff looks good, thanks! I'll h

RE: dates & times

2007-05-10 Thread Binkley, Peter
You can get at some of this functionality in the built-in xslt 1.0 engine (Xalan) by using the e-xslt date-time extensions: see http://exslt.org/date/index.html, and for Xalan's implementation see http://xml.apache.org/xalan-j/extensionslib.html#exslt . There are some examples here: http://www-128.

Re: dates & times

2007-05-10 Thread Brian Whitman
Those are interesting ideas and it probably would not be difficult to create a patch if you were interested, but I'm curious: What about XSL makes what seems to me an elementary string-processing task so difficult? Well, XSL 1.0 (which is the one that "comes for free" with Solr/java) doesn'

Re: dates & times

2007-05-10 Thread Mike Klaas
On 5/10/07, Brian Whitman <[EMAIL PROTECTED]> wrote: After writing my 3rd parser in my third scripting language in so many months to go from unix timestamps to "Solr Time" (8601) I have to ask: shouldn't the date/time field type be more resilient? I assume there's a good reason that it's 8601 int

Re: fast update handlers

2007-05-10 Thread Ryan McKinley
I don't know if this helps, but... Do *all* your queries need to include the fast updates? I have a setup where there are some cases that need the newest stuff but most cases can wait 5 mins (or so) In that case, I have two solr instances pointing to the same index files. One is used for

dates & times

2007-05-10 Thread Brian Whitman
After writing my 3rd parser in my third scripting language in so many months to go from unix timestamps to "Solr Time" (8601) I have to ask: shouldn't the date/time field type be more resilient? I assume there's a good reason that it's 8601 internally, but certainly it would be excellent fo

Re: Requests per second/minute monitor?

2007-05-10 Thread Walter Underwood
Yes, that is possible, but we also monitor Apache, Tomcat, the JVM, and OS through JMX and other live monitoring interfaces. Why invent a real-time HTTP log analysis system when I can fetch /search/stats.jsp at any time? By "number of rows fetched", do you mean "number of documents matched"? The

Re: Costume response writer

2007-05-10 Thread Yonik Seeley
On 5/10/07, Debra <[EMAIL PROTECTED]> wrote: I have written a costume response writer and added the response writer to solrconfig.xml When I run a program I can see the costume response writer is initialized, but when I run a search with the costume writer's name as the wt paramater the search

Re: Question about delete

2007-05-10 Thread Yonik Seeley
On 5/10/07, Ajanta Phatak <[EMAIL PROTECTED]> wrote: I believe in lucene at least deleting documents only marks them for deletion. The actual delete happens only after closing the IndexReader. Not sure about Solr Closing an IndexReader only flushes the list of deleted docids to the index... it

Re: Does solr support index which made by lucene 1.4.3

2007-05-10 Thread James liu
if u mean that Can't find resource 'solrconfig.xml' in classpath or 'solr/conf/', cwd=C:\Tomcat 6.0 i m sure solr/conf/solrconfig.xml existed. and i m sure schema was configured to match it. some error information because i delete solr11,solr12..xml i just use solr1and i use solr1 admi

RE: fast update handlers

2007-05-10 Thread Will Johnson
The problem is I want the newly added documents to be made searchable every 1-2 seconds so I need the commits. I was hoping that the caches could be stored/tied to the IndexSearcher then a MultiSearcher could take advantage of the multiple sub indexes and their respective caches. I think the bes

RE: fast update handlers

2007-05-10 Thread Charlie Jackson
What about issuing separate commits to the index on a regularly scheduled basis? For example, you add documents to the index every 2 seconds, or however often, but these operations don't commit. Instead, you have a cron'd script or something that just issues a commit every 5 or 10 minutes or whatev

Re: Question about delete

2007-05-10 Thread Ajanta Phatak
I believe in lucene at least deleting documents only marks them for deletion. The actual delete happens only after closing the IndexReader. Not sure about Solr Ajanta. James liu wrote: but index file size not changed and maxDoc not changed. > 2007/5/10, Nick Jenkin <[EMAIL PROTECTED]>:

Re: fast update handlers

2007-05-10 Thread Yonik Seeley
On 5/10/07, Will Johnson <[EMAIL PROTECTED]> wrote: I guess I was more concerned with doing the frequent commits and how that would affect the caches. Say I have 2M docs in my main index but I want to add docs every 2 seconds all while doing queries. if I do commits every 2 seconds I basically

RE: fast update handlers

2007-05-10 Thread Will Johnson
I guess I was more concerned with doing the frequent commits and how that would affect the caches. Say I have 2M docs in my main index but I want to add docs every 2 seconds all while doing queries. if I do commits every 2 seconds I basically loose any caching advantage and my faceting performanc

Re: fast update handlers

2007-05-10 Thread Yonik Seeley
On 5/10/07, Will Johnson <[EMAIL PROTECTED]> wrote: I'm trying to setup a system to have very low index latency (1-2 seconds) and one of the javadocs intrigued me: "DirectUpdateHandler2 implements an UpdateHandler where documents are added directly to the main Lucene index as opposed to adding t

fast update handlers

2007-05-10 Thread Will Johnson
I'm trying to setup a system to have very low index latency (1-2 seconds) and one of the javadocs intrigued me: "DirectUpdateHandler2 implements an UpdateHandler where documents are added directly to the main Lucene index as opposed to adding to a separate smaller index" The plain DirectUpd

Costume response writer

2007-05-10 Thread Debra
I have written a costume response writer and added the response writer to solrconfig.xml When I run a program I can see the costume response writer is initialized, but when I run a search with the costume writer's name as the wt paramater the search is executed but the response writer is not ca

Re: Does solr support index which made by lucene 1.4.3

2007-05-10 Thread Yonik Seeley
On 5/10/07, James liu <[EMAIL PROTECTED]> wrote: i try, it show me error information: Solr could support a Lucene 1.4.3 index if the schema was configured to match it. I see the following buried in your logs: java.lang.RuntimeException: Can't find resource 'solrconfig.xml' -Yonik

Re: Question about delete

2007-05-10 Thread James liu
but index file size not changed and maxDoc not changed. > 2007/5/10, Nick Jenkin <[EMAIL PROTECTED]>: Hi James, As I understand it numDocs is the number of documents in your index, maxDoc is the most documents you have ever had in your index. You currently have no documents in your index by

Re: unsubscribe

2007-05-10 Thread Thorsten Scherler
On Thu, 2007-05-10 at 10:05 +0100, Kainth, Sachin wrote: > unsubscribe Hi Sachin, you need to send to a different mailing address: [EMAIL PROTECTED] HTH salu2 -- Thorsten Scherler thorsten.at.apache.org Open Source Java consulting, training

Re: Question about delete

2007-05-10 Thread Nick Jenkin
Hi James, As I understand it numDocs is the number of documents in your index, maxDoc is the most documents you have ever had in your index. You currently have no documents in your index by the looks, thus your delete query must of deleted everything. That would be why you are getting no results.

unsubscribe

2007-05-10 Thread Kainth, Sachin
unsubscribe This email and any attached files are confidential and copyright protected. If you are not the addressee, any dissemination of this communication is strictly prohibited. Unless otherwise expressly agreed in writing, nothing stated in this communication shall be legally binding. Th