Re: Working Chef Cookbook for Solr

2010-12-10 Thread György Frivolt
t; that solr cloud does!) > > Don't know when I'll get the time, though, I'm afraid. > > Upayavira > > On Fri, 10 Dec 2010 10:45 +0100, "György Frivolt" > wrote: >> Hi, >> >>      I tried to setup Solr by chef and so far found only t

Working Chef Cookbook for Solr

2010-12-10 Thread György Frivolt
Hi, I tried to setup Solr by chef and so far found only the opscode one, but this one setup only the group and the user for solr, not the solr engine. Does anyone know about a maintained solr chef cookbook? Thanks for suggestion! Georg

Solr crashing while extracting from very simple text file

2010-03-22 Thread György Frivolt
Hi, I had problem with indexing documents some months ago as well. I found that there were XML control characters in the documents and these were not handled by Solr. Maybe it is the case for you as well. Regards, Georg On Sun, Mar 21, 2010 at 5:58 PM, Ross wrote: > Hi all > > I'm tr

Re: Indexing HTML document

2010-03-03 Thread György Frivolt
cumented here, which might be of some help - > > > http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.HTMLStripCharFilterFactory > > > > Control characters can be eliminated using code like this - > > > http://bitbucket.org/cogtree/python-solr/src/tip/pytho

Indexing HTML document

2010-03-02 Thread György Frivolt
Hi, How to index properly HTML documents? All the documents are HTML, some containing charaters encodid like ží ... Is there a character filter for filtering these codes? Is there a way to strip the HTML tags out? Does solr weight the terms in the document based on where they appear?.. words in hea

Re: unexpected result using OR in query

2010-02-27 Thread György Frivolt
On Thu, 2/25/10, György Frivolt wrote: > > I run into an unexpected behaviour for Solr > > with query parsing. I need > > to fetch article which contain several expressions. > > However, I noticed the > > following behaviour: > > > > - when I fetch

unexpected result using OR in query

2010-02-25 Thread György Frivolt
Hi, I run into an unexpected behaviour for Solr with query parsing. I need to fetch article which contain several expressions. However, I noticed the following behaviour: - when I fetch results for query A I get a number of results X - for query B I get a number of results Y - for query A B a

Re: SolrException caused by illegal character

2009-11-27 Thread György Frivolt
.org/wiki/ASCII > > Otis > -- > Sematext is hiring -- http://sematext.com/about/jobs.html?mls > Lucene, Solr, Nutch, Katta, Hadoop, HBase, UIMA, NLP, NER, IR > > > > - Original Message >> From: György Frivolt >> To: solr-user >> Sent: Thu,

SolrException caused by illegal character

2009-11-26 Thread György Frivolt
Hi, I upgradeed to Solr 1.4 and tried to reindex the data. After few thousand of reindexed documents an exception is thrown, I did not meet this using 1.3 before. Do you have any idea what caused the problem? Thanks. SEVERE: org.apache.solr.common.SolrException: Illegal character ((CTRL-CHAR,

Re: Solr and LSA

2009-10-30 Thread György Frivolt
Paul Libbrecht wrote: > Not with solr but with Lucene, there is the project called semanticvectors. > It would be cute to make it a solr module. > > paul > > > Le 30-oct.-09 à 09:17, György Frivolt a écrit : > > > Hi, >> Does anyone of you have experiences with us

Solr and LSA

2009-10-30 Thread György Frivolt
Hi, Does anyone of you have experiences with using LSA, Latent Semantic Analysis with Solr? I would like to search for expressions, but also find records, which have context relevant for the given query. Would appreciate any clue where to start, what to consider. Bw, Georg

Re: Searching with or without diacritics

2009-09-18 Thread György Frivolt
Hi, Thanks for the suggestions, perhaps I am closer to the goal, but still don't get the result. I would like to find accented characters (mapped by the MappingCharFilterFactory) by writing unaccented queries. On this page: http://issues.ez.no/IssueView.php?Id=14742&activeItem=2 I've found that

Re: Searching with or without diacritics

2009-09-17 Thread György Frivolt
1:14 AM, Chris Hostetter wrote: > > Take a look at the MappingCharFilterFactory (in Solr 1.4) and/or the > ISOLatin1AccentFilterFactory. > > : Date: Thu, 27 Aug 2009 16:30:08 +0200 > : From: "[ISO-8859-1] György Frivolt" > : Reply-To: solr-user@lucene.apache.org

Searching with or without diacritics

2009-08-27 Thread György Frivolt
Hello, I started to use solr only recently using the ruby/rails sunspot-solr client. I use solr on a slovak/czech data set and realized one not wanted behaviour of the search. When the user searches an expression or word which contains dicritics, letters like š, č, ť, ä, ô,... usually the spe