[solrCloud] Distributed IDF - scoring in the cloud

2011-02-18 Thread Thorsten Scherler
etter to look into something that aims to scale the index into hbase so he does not lose the scoring. TIA for your feedback -- Thorsten Scherler codeBusters S.L. - web based systems http://www.codebusters.es/ smime.p7s Description: S/MIME cryptographic signature

Re: SolrCloud - Example C not working

2011-02-15 Thread Thorsten Scherler
Hmm, nobody has an idea, for everybody the example c is working fine. salu2 On Mon, 2011-02-14 at 14:08 +0100, Thorsten Scherler wrote: > Hi all, > > I followed http://wiki.apache.org/solr/SolrCloud and everything worked > fine till I tried "Example C:". > > I s

SolrCloud - Example C not working

2011-02-14 Thread Thorsten Scherler
blem seems that the zk instances can not connects to the different nodes and so do not get up at all. I am using revision 1070473 for the tests. Anybody has an idea? salu2 -- Thorsten Scherler codeBusters S.L. - web based systems http://www.codebusters.es/ smime.p7s Description: S/MIM

Re: big index vs. lots of small ones

2010-01-25 Thread Thorsten Scherler
am not worng the first is to get term > frequencies and needed info and the second one is the proper search request. > The patch also includes caching for terms in the first request. > Nice! Thank you very much, Mark. Como van las cosas en Barcelona? salu2 > > Thorsten S

big index vs. lots of small ones

2010-01-20 Thread Thorsten Scherler
re other people doing to solve this problem? - What is the best way with Solr to solve the problem of the "base" scoring? - What is the best way to have multiple indexes in solr? - Is it possible to get rid of the "base" scoring in solr? TIA for any informations. salu2 -- Thors

Re: Beginner questions: Jetty and solr with utf-8 + cached page + dedup

2008-03-26 Thread Thorsten Scherler
olr "... Typically it's not recommended to have your front end users/clients hitting Solr directly as part of an HTML form submit ..." In the above page there you find answers to many of your questions. HTH salu2 -- Thorsten Scherler thorst

Re: out of memory every time

2008-03-03 Thread Thorsten Scherler
S. > Shan W. > Deshpande P. > 31311_at > > > > > > the schema is (in summary): > > multiValued="false" omitNorms="true"/> > multiValued="true" omitNorms="true"/> > &g

Re: solr to work for my web application

2008-02-22 Thread Thorsten Scherler
On Fri, 2008-02-22 at 04:11 -0800, newBea wrote: > Hi Thorsten, > > Many thanks for ur replies so far...finally i set up correct environment for > Solr. Its working:clap: :) Congrats, glad you got it running. > > Solr Rocks! Indeed. :) salu2 > > Thorsten Scherle

Re: How do I secure solr server?

2008-02-21 Thread Thorsten Scherler
the server and also prevent unauthorized > access (through the web) to admin page. I would not expose http://yourServer:8983 at all. I would use an Apache httpd server as proxy and implement the ac there. salu2 > > Any help is extremely appreciated!! :) >

Re: solr to work for my web application

2008-02-19 Thread Thorsten Scherler
${solr.data.dir:C:\solr \dat} but I am not using windows so I am not sure whether you may need to escape the path. salu2 > but the indexes are not getting stored there...indexes for > search are getting stored in the default dir of solr...any suggestions > > Thanks in advance..

Re: solr to work for my web application

2008-02-13 Thread Thorsten Scherler
different server running. Alternatively you can run solr in the same tomcat as you application. If you follow SolrTomcat from the wiki it will be install as "solr" servlet. Your application will then communicate with this serlvet. salu2 > > any help? > > Thorsten Scherler

Re: solr to work for my web application

2008-02-13 Thread Thorsten Scherler
> Hmm, what is your update command and your unique key? We would need to see this modification to tell you what may be wrong. Did you try http://YOUR_HOST:8983/solr/admin/luke?wt=xslt&tr=luke.xsl What does this gives? salu2 > I am not able to find the solution for this one... any

Re: solr to work for my web application

2008-02-13 Thread Thorsten Scherler
arted: http://wiki.apache.org/solr http://wiki.apache.org/solr/mySolr http://wiki.apache.org/solr/SolrTomcat salu2 -- Thorsten Scherler thorsten.at.apache.org Open Source Java consulting, training and solutions

Re: Get last updated/committed document

2007-11-26 Thread Thorsten Scherler
On Sat, 2007-11-24 at 00:17 +1100, climbingrose wrote: > Assuming that you have the timestamp field defined: > q=*:*&sort=timestamp desc > Thanks. salu2 > On Nov 23, 2007 10:43 PM, Thorsten Scherler > <[EMAIL PROTECTED]> wrote: > > Hi all, > > > > I

Get last updated/committed document

2007-11-23 Thread Thorsten Scherler
Hi all, I need to ask solr to return me the id of the last committed document. Is there a way to archive this via a standard lucene query or do I need a custom connector that gives me this information? TIA for any information salu2 -- Thorsten Scherler

Re: Help with Debian solr/jetty install?

2007-11-21 Thread Thorsten Scherler
0) > at > org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:633) > at org.mortbay.http.HttpContext.handle(HttpContext.java:1482) > at org.mortbay.http.HttpServer.service(HttpServer.java:909) > at org.mortbay.http.HttpConnection.service(HttpConnection.java:820) > at > org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:986) > at org.mortbay.http.HttpConnection.handle(HttpConnection.java:837) > at > org.mortbay.http.SocketListener.handleConnection(SocketListener.java:245) > at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:357) > at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:534) > > > > -- Thorsten Scherler thorsten.at.apache.org Open Source Java consulting, training and solutions

Re: escaping characters and security

2007-11-06 Thread Thorsten Scherler
using JDBC or some other binary protocol, Solr uses HTTP, and you *can* talk to it directly from a web browser, but that's really more of a debugging feature then anything else." HTH salu2 > > Thanks, > Micah -- Thorsten Scherler thorsten.at.apa

Re: Tagging in solr

2007-10-19 Thread Thorsten Scherler
a tag_keywords field, but updating a single tag_keywords field is not so straightforward without sending the entire document to Solr every time it is tagged. See SOLR-139's extensive comments and patches to see what you're getting into. salu2 -- Thorsten Scherler

Re: Search results problem

2007-10-17 Thread Thorsten Scherler
s I expected to find there. > > > > Best regards, > > > > Max > > > > -- > > Maximilian Hütter > > blue elephant systems GmbH > > Wollgrasweg 49 > > D-70599 Stuttgart > > > > Tel: (+49) 0711 - 45 10 17 57

Re: Converting German special characters / umlaute

2007-09-28 Thread Thorsten Scherler
ler -> Mueller Mueller -> Mueller is right to further shorten the word does not seems right since one is changing the meaning too much. Further: groß -> gross gross -> gross ß is pronounced 'sz' but only replaced by 'ss'. salu2 > - J.J. -- Thorsten Scherler thorsten.at.apache.org Open Source Java consulting, training and solutions

Re: Problem with html code inside xml

2007-09-25 Thread Thorsten Scherler
t; > > < > > div > > class > > = > > "paragraphTitle" > > /> > > − > > < > > ... > > > > It is not exactly what I want. I want to keep the html tags, that all > > without formatting. > > > > So the br tags and a tags are well formed in xml and json result, but > > the div tags are not kept. > > - > > In the schema.xml I've got this for the html content > > > > > > > >> stored="true" multiValued="true"/> > > > > - > > > > Any help would be appreciate. > > > > Thanks in advance. > > > > S. Christin > > > > > > > > > > > > > > -- Thorsten Scherler thorsten.at.apache.org Open Source Java consulting, training and solutions

Re: How to get all the search results - python

2007-09-25 Thread Thorsten Scherler
he response to see how many results you have 3) do a loop (rows=50,start=50*x) and call solr till you have all results. Like Jérôme stated: On Mon, 2007-09-24 at 12:45 +0100, Jérôme Etévé wrote: > By design, it's not very efficient to ask for a large number of &

Re: How to get all the search results - python

2007-09-24 Thread Thorsten Scherler
position, offset etc for > pagination through the python client? http://wiki.apache.org/solr/CommonQueryParameters It should work as described in the above document (with the start parameter. e.g. data = c.search(q='query', fl='id score uniq

Re: How to get all the search results - python

2007-09-24 Thread Thorsten Scherler
em. Is it the limitation of rows/pagination? If so why not using a real high number (like rows=100)? salu2 -- Thorsten Scherler thorsten.at.apache.org Open Source Java consulting, training and solutions

Re: Scripts not working on cron - always asking for password

2007-09-21 Thread Thorsten Scherler
t; Piete > >>>> > >>>> > >>>> > >>>> On 20/09/2007, Yu-Hui Jin <[EMAIL PROTECTED]> wrote: > >>>>> > >>>>> Hi, there, > >>>>> > >>>>> I used an absolute path

RE: Strange behavior when searching with accents

2007-09-21 Thread Thorsten Scherler
t: Thursday, September 20, 2007 8:11 AM > To: solr-user@lucene.apache.org > Subject: Re: Strange behavior when searching with accents > > On 9/20/07, Thorsten Scherler <[EMAIL PROTECTED]> > wrote: > > ...Betrand, does the French Snowball work fine?... > > I'v

Re: Strange behavior when searching with accents

2007-09-20 Thread Thorsten Scherler
al after -s. There is a difficulty with pure vowel endings, and the stemmer can't always get this right. So in English 'academic' is stemmed to 'academ' but 'academy' does not lose the final -y (or -i). This explains the residual vowels with -io, -ia endings etc." salu2 -- Thorsten Scherler thorsten.at.apache.org Open Source Java consulting, training and solutions

Re: Strange behavior when searching with accents

2007-09-20 Thread Thorsten Scherler
; > > > Thank you both for the quick response. > > > > On 20/09/2007, Bertrand Delacretaz < [EMAIL PROTECTED] > wrote: > > > > > > On 9/20/07, Thierry Collogne < [EMAIL PROTECTED]> wrote: > > > > > > > ..when we search for "matthé" or for "matthe", we get two totally > > > > different results > > > > > > The analyzer admin tool should help you find out what's happening, see > > > http://wiki.apache.org/solr/FAQ#head-b25df8c8393bbcca28f1f344c432975002e29ca9 > > > > > > > > > -Bertrand > > > > > > > -- Thorsten Scherler thorsten.at.apache.org Open Source Java consulting, training and solutions

Re: Strange behavior when searching with accents

2007-09-20 Thread Thorsten Scherler
On 20/09/2007, Bertrand Delacretaz <[EMAIL PROTECTED]> wrote: > > > > On 9/20/07, Thierry Collogne <[EMAIL PROTECTED]> wrote: > > > > > ..when we search for "matthé" or for "matthe", we get two totally > > > different

Re: Strange behavior when searching with accents

2007-09-20 Thread Thorsten Scherler
That highly depends on your schema. Do you use ? I am using the following an it works like a charm HTH salu2 -- Thorsten Scherler thorsten.at.apache.org Open Source Java consulting, training and solutions

Re: RSS syndication Plugin

2007-09-06 Thread Thorsten Scherler
ch Ryan to point me into the right direction and Brian Whitman for his contribution. salu2 -- Thorsten Scherler thorsten.at.apache.org Open Source Java consulting, training and solutions

RSS syndication Plugin

2007-09-06 Thread Thorsten Scherler
ody already done this? salu2 -- Thorsten Scherler thorsten.at.apache.org Open Source Java consulting, training and solutions

Re: Indexing very large files.

2007-09-06 Thread Thorsten Scherler
The problem is that the "text" object is bigger then heaps, maybe invoking garbage collection before will help. salu2 -- Thorsten Scherler thorsten.at.apache.org Open Source Java consulting, training and solutions

Re: Tagging using SOLR

2007-09-06 Thread Thorsten Scherler
... HTH salu2 -- Thorsten Scherler thorsten.at.apache.org Open Source Java consulting, training and solutions

Re: Indexing very large files.

2007-09-06 Thread Thorsten Scherler
t from the > module org.xmlpull.mxp1.MXParser. Hmmm. I'm open to sugestions and ideas. Which version do you use of solr? http://svn.apache.org/viewvc/lucene/solr/trunk/src/java/org/apache/solr/handler/XmlUpdateRequestHandler.java?view=markup The trunk ver

Re: The mechanism of data replciation in Solr?

2007-09-05 Thread Thorsten Scherler
an anybody help me? Thank you in advance. http://wiki.apache.org/solr/CollectionDistribution HTH > > Best Wishes. -- Thorsten Scherler thorsten.at.apache.org Open Source Java consulting, training and solutions

RE: storing the document URI in the index

2007-06-12 Thread Thorsten Scherler
over http, like from a webdav call) > > Anyway, thx for all answers, and again, sry for mails not indenting properly > at the moment, it irritates me as well :-) > > Regards Ard Hi Ard, you may want to have a look at http://wiki.apache.org/solr/SolrForres

Re: Packaging solr for Debian: using debian-supplied lucene-*.jar

2007-06-04 Thread Thorsten Scherler
On Sun, 2007-06-03 at 09:55 +0200, Jan-Pascal van Best wrote: > Hi all, > > I'm working on packaging Solr for Debian. Very nice. :) Since this is a developer topic I think this topic should be discussed on our dev list. salu2 -- Tho

Re: unsubscribe

2007-05-10 Thread Thorsten Scherler
On Thu, 2007-05-10 at 10:05 +0100, Kainth, Sachin wrote: > unsubscribe Hi Sachin, you need to send to a different mailing address: [EMAIL PROTECTED] HTH salu2 -- Thorsten Scherler thorsten.at.apache.org Open Source Java consulting, train

[Standings] Solr logo poll

2007-04-10 Thread Thorsten Scherler
Hi all, I did a small count till now we have: a) 21 b) 13 salu2 -- Thorsten Scherler thorsten.at.apache.org Open Source Java consulting, training and solutions

Re: Solr logo poll

2007-04-07 Thread Thorsten Scherler
B Graffiti style. -- Thorsten Scherler thorsten.at.apache.org Open Source Java consulting, training and solutions

Re: SolrSearchGenerator for Cocoon (2.1)

2007-03-27 Thread Thorsten Scherler
d/plugins/org.apache.forrest.plugin.output.solr/build/org.apache.forrest.plugin.output.solr.jar $cocoon-2.1.x_webapp/WEB-INF/lib/ >From there you can use the cocoon components as usual in your project. salu2 -- Thorsten Scherler thorsten.at.apache.org Open Source Java & XMLconsulting, training and solutions

RE: Using cocoon to update index

2007-03-26 Thread Thorsten Scherler
On Mon, 2007-03-26 at 09:30 -0400, Winona Salesky wrote: > Thanks Chris, I'll take another look at the forest plugin. Have a look as well at http://wiki.apache.org/solr/SolrForrest it points out the cocoon components. salu2 -- Thorsten

[Droids] Re: crawler feed?

2007-02-08 Thread Thorsten Scherler
ome to join the effort. Who knows maybe the first droid is a standalone nutch crawler with plugin extension points if some nutch committer joins the lab. Thanks rubdabadub for your feedback. salu2 > > Regards > > On 2/7/07, Thorsten Scherler <[EMAIL PROTECTED]> wrote: >

Re: crawler feed?

2007-02-07 Thread Thorsten Scherler
ity. salu2 > > -- > Sami Siren > > [1]http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html -- Thorsten Scherler thorsten.at.apache.org Open Source Java & XMLconsulting, training and solutions

Re: crawler feed?

2007-02-07 Thread Thorsten Scherler
If somebody is interested in such a standalone crawler project, I welcome any help, ideas, suggestion, feedback and/or questions. salu2 -- Thorsten Scherler thorsten.at.apache.org Open Source Java & XML consulting, training and solutions

Re: Analyzers and Tokenizers?

2007-02-06 Thread Thorsten Scherler
issingLast="true" omitNorms="true"> > > > more > > > Is there any complete list somewhere ..or how can I find more info about them? http://svn.apache.org/viewvc/lucene/solr/trunk/src/java/org/apache

Re: Posting from Ant

2007-02-01 Thread Thorsten Scherler
>>>>> From: "Binkley, Peter" <[EMAIL PROTECTED]> > >>>>>> Date: January 31, 2007 1:56:06 PM EST > >>>>>> To: > >>>>>> Subject: Posting from Ant > >>>>>> Reply-To: solr-user@lucene.apache.org > >>>>>> > >>>>>> Is there an Ant task out there somewhere that can POST > >>> bunches of > >>>>>> files > >>>>>> to Solr, doing what the post.sh script does but with filesets? > >>>>>> > >>>>>> I've found the http post task > >>>>>> (http://antelope.tigris.org/nonav/docs/manual/bk03ch17.html), > >>>>> but it > >>>>>> just posts name-value pairs, not files; and Slide's set of > >>> webdav > >>>>>> client > >>>>>> tasks > >>>>>> (http://gulus.usherbrooke.ca/pub/appl/apache/jakarta/slide/ > >>>>> binaries/ > >>>>>> jaka > >>>>>> rta-slide-ant-webdav-bin-2.1.zip) has PUT and GET but not > >>> POST. It > >>>>>> shouldn't be hard to adapt one of these, but something pre- > >>> existing > >>>>>> would be better. > >>>>>> > >>>>>> Peter > >>>>>> > >>>>>> Peter Binkley > >>>>>> Digital Initiatives Technology Librarian Information Technology > >>>>>> Services 4-30 Cameron Library University of Alberta Libraries > >>>>>> Edmonton, Alberta Canada T6G 2J8 > >>>>>> Phone: (780) 492-3743 > >>>>>> Fax: (780) 492-9243 > >>>>>> e-mail: [EMAIL PROTECTED] > >>>>> > >>>>> > >>> > >>> > -- Thorsten Scherler thorsten.at.apache.org Open Source Java & XMLconsulting, training and solutions

Re: Solr "autostart"

2007-01-28 Thread Thorsten Scherler
On Sun, 2007-01-28 at 10:34 -0500, Tim Archambault wrote: > Using Solr with Jetty on linux VPS server. When I ssh and run "start.jar" I > can go to a web browser and with success to the /solr/admin page. I acn > query with the whole "nine" no problems. > However when I close out my terminal sessi

Re: solr + cocoon problem

2007-01-17 Thread Thorsten Scherler
On Wed, 2007-01-17 at 10:25 -0500, [EMAIL PROTECTED] wrote: > Hi, > > I agree, this is not a legal URL. But the thing is that cocoon itself is > sending the unescaped URL. ...because you told it so. You use http://hostname/solr/select/?q={request-param:q}"; type="file" > The request para

Re: XML querying

2007-01-17 Thread Thorsten Scherler
On Wed, 2007-01-17 at 09:36 +, Luis Neves wrote: > Hi, > > Thorsten Scherler wrote: > > On Mon, 2007-01-15 at 13:42 +, Luis Neves wrote: > > > > > I think you should explain your use case a wee bit more. > > > >>>> What I do now t

Re: XML querying

2007-01-16 Thread Thorsten Scherler
On Mon, 2007-01-15 at 13:42 +, Luis Neves wrote: > Hi! > > Thorsten Scherler wrote: > > > On Mon, 2007-01-15 at 12:23 +, Luis Neves wrote: > >> Hello. > >> What I do now to index XML documents it's to use a Filter to strip the > >> ma

Re: Calling Solr requests from java code - examples?

2007-01-16 Thread Thorsten Scherler
On Tue, 2007-01-16 at 13:56 +0100, Bertrand Delacretaz wrote: > On 1/16/07, Thorsten Scherler <[EMAIL PROTECTED]> wrote: > > > ...Have a look at > > https://issues.apache.org/jira/browse/SOLR-86... > > Right, I should have mentioned this one as well. I have linked

Re: solr + cocoon problem

2007-01-16 Thread Thorsten Scherler
On Tue, 2007-01-16 at 16:02 -0500, [EMAIL PROTECTED] wrote: > Hi, > > I am trying to implement a cocoon based application using solr for searching. > In particular, I would like to forward the request from my response page to > solr. I have tried several alternatives, but none of them worked for

Re: solr + cocoon problem

2007-01-16 Thread Thorsten Scherler
On Tue, 2007-01-16 at 16:19 -0500, Walter Lewis wrote: > [EMAIL PROTECTED] wrote: > > Any ideas on how to implement a cocoon layer above solr? I just finished a forrest plugin (in the whiteboard, our testing ground in forrest) that is doing what you asked for and some pagination. Forrest is cocoon

Re: Converting Solr response back to pojo's, experiences?

2007-01-16 Thread Thorsten Scherler
On Tue, 2007-01-16 at 14:58 +0100, [EMAIL PROTECTED] wrote: > Anyone having experience converting xml responses back to pojo's, > which technologies have you used? > > Anyone doing json <-> pojo's? Using pure xml myself but have a look at https://issues.apache.org/jira/browse/SOLR-20 and http

Re: Calling Solr requests from java code - examples?

2007-01-16 Thread Thorsten Scherler
On Tue, 2007-01-16 at 12:52 +0100, [EMAIL PROTECTED] wrote: > Thanks! > > and how would you do it calling it from another web application, let's > say from a servlet or so? I need to do some stuff in my web java code, > then call the Solr service and do some more stuff afterwards > Have a lo

Re: XML querying

2007-01-15 Thread Thorsten Scherler
On Mon, 2007-01-15 at 12:23 +, Luis Neves wrote: > Hello. > What I do now to index XML documents it's to use a Filter to strip the > markup, > this works but it's impossible to know where in the document is the match > located. > What would it take to make possible to specify a filter query

Re: How can I update a specific field of an existing document?

2007-01-11 Thread Thorsten Scherler
On Thu, 2007-01-11 at 17:48 +0100, Thorsten Scherler wrote: > On Thu, 2007-01-11 at 10:19 -0600, Iris Soto wrote: > > Hello everybody, > > I want update a specific field in a document, but i don't find how do it > > in the documentation of Solr. > > Is that posibl

Re: How can I update a specific field of an existing document?

2007-01-11 Thread Thorsten Scherler
On Thu, 2007-01-11 at 10:19 -0600, Iris Soto wrote: > Hello everybody, > I want update a specific field in a document, but i don't find how do it > in the documentation of Solr. > Is that posible?, I need to index only a field for a document, Do i have > to index all the document for this? > The

Re: Performance tuning

2007-01-11 Thread Thorsten Scherler
On Thu, 2007-01-11 at 14:57 +, Stephanie Belton wrote: > Hello, > > > > Solr is now up and running on our production environment and working great. > However it is taking up a lot of extra CPU and memory (CPU usage has doubled > and memory is swapping). Is there any documentation on perfo

Re: [ANN] Apache Forrest/Cocoon based solr client plugin

2007-01-10 Thread Thorsten Scherler
ocs/plugins_0_80/org.apache.forrest.plugin.output.solr/images/result.png I added a transformer that adds the paginator part to the solr select result. The paginator is the "Result pages" part of above screenshot. Hmm, that makes me think whether that (the paginator) would be better dire

Re: newbie question on determining fieldtype

2007-01-08 Thread Thorsten Scherler
On Mon, 2007-01-08 at 10:29 -0300, mike topper wrote: > Hi, > > I have a question that I couldn't find the exact answer to. > > I have some fields that I want to add to my schema but will never be > searched on. They are only used as additional information about a > document when retrieved.

Re: Seeking FAQs

2007-01-08 Thread Thorsten Scherler
On Sat, 2007-01-06 at 10:25 -0500, David Halsted wrote: > I wonder what would happen if we used a clustering engine like Carrot > to categorize either the e-mails in the archive or the results of > searches against them? Perhaps we'd find some candidates for the FAQ > that way. Not sure about too

[ANN] Apache Forrest/Cocoon based solr client plugin

2007-01-07 Thread Thorsten Scherler
Hi all, I am happy to announce that I just add a Apache Forrest based Apache Solr client plugin to the forrest whiteboard. It may be from interest for the ones using Apache Cocoon based Apache Forrest and Apache Lucene based Apache Solr. org.apache.forrest.plugin.output.solr generates Apache Solr

Re: How to tell the highlighter not to escape?

2007-01-03 Thread Thorsten Scherler
vior. Thanks again Edward. salu2 > > > > > > > > select="substring($insideEm, string-length($preEm)+5)"/> > > > > > > > > > >

Re: How to tell the highlighter not to escape?

2007-01-03 Thread Thorsten Scherler
7;m not really familiar with > > xslt so I'm unsure why that isn't so (perhaps it is automatically > > html-escaping the values after un-xml-escaping them?) > > > > Be careful of documents containing html fragments natively. > > > > cheers, > > -M

How to tell the highlighter not to escape?

2007-01-02 Thread Thorsten Scherler
Hi all, I am playing around with the highlighter and found that all highlight terms get escaped. I mean solr will return TERM and not TERM I am not sure where this escaping is happening but I would need the highlighting to NOT escape the hl.simple.pre and hl.simple.post tag since it

Re: Is there a BasicSummarizer for solr?

2007-01-02 Thread Thorsten Scherler
Erik, with this information and a small change in my schema changed stored="false" to stored="true" on my main content, I get exactly what I needed. Now I have to see the effect of storing the content in the index regarding size and response time. Thanks again. salu2 >

Is there a BasicSummarizer for solr?

2007-01-02 Thread Thorsten Scherler
Hi all, I need to implement a summary function with solr like there is in nutch. Basically it returns x words before and after the query term to show the content where the term is embedded (like as google does). In nutch this functionality is provided by http://svn.apache.org/viewvc/lucene/nutch

Re: Solr 1.1 released

2006-12-25 Thread Thorsten Scherler
On Fri, 2006-12-22 at 17:07 -0500, Yonik Seeley wrote: > Solr 1.1 is now available for download! Very nice. :) Thanks a lot to this community and especially to Yonik who packed the release. salu2

Re: Help with spellchecker integration

2006-12-22 Thread Thorsten Scherler
On Thu, 2006-12-21 at 21:27 -0800, Otis Gospodnetic wrote: > Hi, > I'm trying to integrate the Lucene-based spellchecker > (http://wiki.apache.org/jakarta-lucene/SpellChecker + contrib/spellchecker > under Lucene) with Solr (http://issues.apache.org/jira/browse/SOLR-81) in > order to provide a

Re: Realtime directory change...

2006-12-22 Thread Thorsten Scherler
On Thu, 2006-12-21 at 12:23 -0800, escher2k wrote: > Hi, > We currently use Lucene to do index user data every couple of hours - the > index is completely rebuilt, > the old index is archived and the new one copied over to the directory. > Example - > > /bin/cp ${LOG_FILE} ${CRON_ROOT}/index/hel

Re: solr index reusable with nutch?

2006-12-20 Thread Thorsten Scherler
On Thu, 2006-12-14 at 11:14 -0800, Chris Hostetter wrote: > : In my use case I need the reverse. Nutch searches the index created by > : my solr application. The application is just one component in the portal > : and the portal will provide a "global" search engine which should use > : the index f

Re: solr index reusable with nutch?

2006-12-15 Thread Thorsten Scherler
On Thu, 2006-12-14 at 11:14 -0800, Chris Hostetter wrote: > : In my use case I need the reverse. Nutch searches the index created by > : my solr application. The application is just one component in the portal > : and the portal will provide a "global" search engine which should use > : the index f

Re: solr index reusable with nutch?

2006-12-13 Thread Thorsten Scherler
out I will keep you informed. Thanks for the feedback. salu2 > Otis > > ----- Original Message > From: Thorsten Scherler <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Wednesday, December 13, 2006 8:26:51 AM > Subject: solr index reusable with nutc

solr index reusable with nutch?

2006-12-13 Thread Thorsten Scherler
Hi all, is it possible to directly use the solr index in nutch? My client is creating a portal search based on nutch. In this portal there is as well my project and ATM I prefer to go with solr instead of nutch since it its much better for my use case. Now the question is whether the portal sear

Re: search engine for regional bulletins

2006-11-28 Thread Thorsten Scherler
On Tue, 2006-11-28 at 11:30 -0500, Yonik Seeley wrote: > On 11/28/06, Thorsten Scherler > <[EMAIL PROTECTED]> wrote: > > That is good to hear, so if somebody (e.g. me) would check all files for > > cleanness then we could release, right? Perfect. > > Correct. All I

Re: search engine for regional bulletins

2006-11-28 Thread Thorsten Scherler
On Tue, 2006-11-28 at 10:00 +0100, Bertrand Delacretaz wrote: > Hi Thorsten, good to see you here! :) Hi Bertrand, thanks very much for this warm welcome and I am as well glad to meet you here. > > On 11/28/06, Thorsten Scherler > <[EMAIL PROTECTED]> wrote: > > > ..

search engine for regional bulletins

2006-11-28 Thread Thorsten Scherler
Hi all, I am developing a search engine for a governmental body. This search engine has to index pure xml documents which follow a custom xml schema. The xml documents contain information about laws and official announcements for Andalusia. I need to implement different filter for the search. The