facetting when using field collapsing

2010-11-07 Thread Lukas Kahwe Smith
Hi, I am pondering making use of field collapsing. I am currently indexing clauses (sections) inside UN documents: http://resolutionfinder.org/search/unifiedResults?q=africa&=&t[22]=medication&dc=&st=clause Now since right now my data set is still fairly small I am doing field collapsing in use

Re: Adding Carrot2

2010-11-07 Thread Lance Norskog
There are three xml sets. The solr/example set, the drupal solr, AND the set in contrib/clustering/src/test/resources/solr/conf/. These are what clustering is actually tested with. So, the first order of business is to check if clustering works with example/solr/conf. The diffs looked like the clus

Re: Tomcat special character problem

2010-11-07 Thread Dennis Gearon
In a post document, or a get document with URL encoded variables in the BODY of the document, it's possible to specify/use different encodings that are actually specified in the headers. For SURE in post, and I'm pretty sure in GET also. Dennis Gearon Signature Warning It is

Re: Tomcat special character problem

2010-11-07 Thread Em
I also thought that this might be the case a few hours ago. However, I have to verify that tomorrow. >From a debugging point of view: How can I set the encoding of my browser's adress-bar? When I pressed enter the encoding switched from clear-text to an urlencoded version. The urlencoded version

RE: Adding Carrot2

2010-11-07 Thread Eric Martin
Yeah I know, you have to download the libraries and copy them to your /lib inside of Solr. In Solr 1.4 the plugin is available but the libraries are not. http://www.lucidimagination.com/blog/2009/09/28/solrs-new-clustering-capabilities/ I think there is something wrong with the schema and solrco

Re: Adding Carrot2

2010-11-07 Thread Lance Norskog
Carrot is already part of the Solr distributions. 1.4.1 and 3.x and the trunk. On 11/7/10, Eric Martin wrote: > Hi, > > > > Solr and nutch have been working fine. I now want to integrate Carrot2. I > followed this tutorial/quickstart: > http://www.lucidimagination.com/blog/2009/09/28/solrs-new-cl

Re: Tomcat special character problem

2010-11-07 Thread Michael Sokolov
Is it possible that your original search is being posted (HTTP POST), and the character encoding of the page with the form is not UTF-8? In that case, I believe a header gets sent with the request specifying a different character set (different from parameters in the URL, for which it's not p

RE: solr 4.0 - pagination

2010-11-07 Thread Papp Richard
p 10 groups (assuming rows=10) in memory at any one time, hence it has no idea what the total number of groups is. -Yonik http://www.lucidimagination.com ______ Information from ESET NOD32 Antivirus, version of virus signature database 5599 (20101107) __ The message was checked by ESE

RE: solr 4.0 - pagination

2010-11-07 Thread Papp Richard
nightly build. -Yonik http://www.lucidimagination.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5598 (20101107) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, ve

Re: solr 4.0 - pagination

2010-11-07 Thread Yonik Seeley
On Sun, Nov 7, 2010 at 2:45 PM, Papp Richard wrote: > Hi Yonik, > >  I've just tried the latest stable version from nightly build: > apache-solr-4.0-2010-11-05_08-06-28.war > >  I have some concerns however: I have 3 documents; 2 in the first group, 1 > in the 2nd group. > >  1. I got for matches

RE: solr 4.0 - pagination

2010-11-07 Thread Papp Richard
AM, Papp Richard wrote: >  this is fantastic, but can you tell any time it will be ready ? It already is ;-) Grab the latest trunk or the latest nightly build. -Yonik http://www.lucidimagination.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5598

Adding Carrot2

2010-11-07 Thread Eric Martin
Hi, Solr and nutch have been working fine. I now want to integrate Carrot2. I followed this tutorial/quickstart: http://www.lucidimagination.com/blog/2009/09/28/solrs-new-clustering-capabil ities/ I didn't see anything to adjust in my schema so I didn't do anything there. I did add the code

RE: Removing irrelevant URLS

2010-11-07 Thread Eric Martin
OK, thanks. I am using nutch and figuring out how to use urlfilters, unsuccessfully. Just thought there might be a way I could save some trouble this way. Thanks! -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Sunday, November 07, 2010 8:46 AM To: solr-user

Re: Removing irrelevant URLS

2010-11-07 Thread Erick Erickson
You can always do a delete-by-query, but that pre-supposes you can form a query that would remove only those documents with URLs you want removed... Assuming you do this, an optimize would then physically remove the documents from your index (delete by query just marks the docs as deleted). Solr h

RE: Corename after Swap in MultiCore

2010-11-07 Thread Ephraim Ofir
Do you mean solr.core.name has the wrong value after the swap? You swapped doc-temp so now it's doc and solr.core.name is still doc-temp? This completely contradicts my experience, what version of solr are you using? Why use postCommit? You're running the risk of performing a swap when you don't me

RE: solr 4.0 - pagination

2010-11-07 Thread Papp Richard
rus signature database 5598 (20101107) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5598 (20101107) __ The message was checked by ESET NOD32 Antivirus. h

Re: solr 4.0 - pagination

2010-11-07 Thread Yonik Seeley
On Sun, Nov 7, 2010 at 10:55 AM, Papp Richard wrote: >  this is fantastic, but can you tell any time it will be ready ? It already is ;-) Grab the latest trunk or the latest nightly build. -Yonik http://www.lucidimagination.com

RE: solr 4.0 - pagination

2010-11-07 Thread Papp Richard
ion of virus signature database 5576 (20101029) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com __ Information from ESET NOD32 Antivirus, version of virus signature database 5598 (20101107) __ The message was checked by ESET NOD32 Antivirus. http://www.eset.com

Re: Tomcat special character problem

2010-11-07 Thread Em
This helped a lot, since it solved the "göteburg"-problem. Thank you, Ken! Great help :-). Unfortunately there are some other encoding problems "fq=testcat%3Aacôme" worked, however the full url-encoded version "fq=testcat%3Aac%F4me" does not. The first version is the result of submitting the f

Re: Tomcat special character problem

2010-11-07 Thread Ken Stanley
On Sun, Nov 7, 2010 at 9:34 AM, Em wrote: > > Hi Ken, > > thank you for your quick answer! > > To make sure that there occurs no mistakes at my application's side, I send > my requests with the form that is available at solr/admin/form.jsp > > I changed almost nothing from the example-configurati

Re: Tomcat special character problem

2010-11-07 Thread Em
Hi Ken, thank you for your quick answer! To make sure that there occurs no mistakes at my application's side, I send my requests with the form that is available at solr/admin/form.jsp I changed almost nothing from the example-configurations within the example-package except some auto-commit par

Re: Tomcat special character problem

2010-11-07 Thread Ken Stanley
On Sun, Nov 7, 2010 at 9:11 AM, Em wrote: > > Hi List, > > I got an issue with my Solr-environment in Tomcat. > First: I am not very familiar with Tomcat, so it might be my fault and not > Solr's. > > It can not be a solr-side configuration problem, since everything worked > fine with my local Je

Tomcat special character problem

2010-11-07 Thread Em
Hi List, I got an issue with my Solr-environment in Tomcat. First: I am not very familiar with Tomcat, so it might be my fault and not Solr's. It can not be a solr-side configuration problem, since everything worked fine with my local Jetty-servlet container. However, when I deploy into Tomcat,

MoreLikeThis to extract relevant terms to the query from the index

2010-11-07 Thread farag ahmed
Hi All, I am using MoreLikeThis.java in lucene to expand the query with related terms. It works fine and I could retrieve the relevant documents to the query but I couldn’t know how to extract the related terms to the query for the index. my task is: For example query is "bank" related terms

MoreLikeThis to extract relevant terms to the query from the index

2010-11-07 Thread farag ahmed
Hi All, I am using MoreLikeThis.java in lucene to expand the query with related terms. It works fine and I could't retrieve the relevant documents to the query but I couldn’t know how to extract the related terms to the query for the index. my task is: For example query is "bank" related