Syllabification, readability metric

2014-09-30 Thread Luis Carlos Guerrero
Hi, Does Lucene support syllabification of words out of the box? If so is there support for brazilian portuguese? I'm trying to setup a readability score for short text descriptions and this would be really helpful. thanks, -- Luis Carlos Guerrero about.me/luis.guerrero

Re: solr/lucene 4.10 out of memory issues

2014-09-16 Thread Luis Carlos Guerrero
n the same field not that common? On Tue, Sep 16, 2014 at 11:06 AM, Luis Carlos Guerrero < lcguerreroc...@gmail.com> wrote: > Thanks for the response, I've been working on solving some of the most > evident issues and I also added your garbage collector parameters. First of > al

Re: solr/lucene 4.10 out of memory issues

2014-09-16 Thread Luis Carlos Guerrero
p the GC work better for > you (which is not to say there isn't a leak somewhere): > > -XX:MaxTenuringThreshold=8 -XX:CMSInitiatingOccupancyFraction=40 > > This should lead to a nice up-and-down GC profile over time. > > On Thu, Sep 11, 2014 at 10:52 AM, Luis Carlos Guerre

solr/lucene 4.10 out of memory issues

2014-09-11 Thread Luis Carlos Guerrero
easons. Was there some issue reported related to elevated memory consumption by the field cache? any help would be greatly appreciated. regards, -- Luis Carlos Guerrero about.me/luis.guerrero

Re: Clearing old nodes from zookeper without restarting solrcloud cluster

2013-07-18 Thread Luis Carlos Guerrero Covo
7;d suggest to start some machine with the old >> name, run solr, join the cluster for a while, unload a core to unregister >> it from the cluster and shutdown host at the end. This way you could have >> clear cluster state. >> >> >> >> On 16 July 2013 14:

Re: Clearing old nodes from zookeper without restarting solrcloud cluster

2013-07-16 Thread Luis Carlos Guerrero Covo
> Hello Luis, > > > > I don't think that is possible. If you delete clusterstate.json from > > zookeeper, you will need to restart the nodes.. I could be very wrong > > about this > > > > Saqib > > > > > > On Mon, Jul 15, 2013

Re: Clearing old nodes from zookeper without restarting solrcloud cluster

2013-07-15 Thread Luis Carlos Guerrero Covo
I know that you can clear zookeeper's data directoy using the CLI with the clear command, I just want to know if its possible to update the cluster's state without wiping everything out. Anyone have any ideas/suggestions? On Mon, Jul 15, 2013 at 11:21 AM, Luis Carlos Guerrero Covo <

Clearing old nodes from zookeper without restarting solrcloud cluster

2013-07-15 Thread Luis Carlos Guerrero Covo
Hi, Is there an easy way to clear zookeeper of all offline solr nodes without restarting the cluster? We are having some stability issues and we think it maybe due to the leader querying old offline nodes. thank you, Luis Guerrero

Re: Content based recommender using lucene/solr

2013-06-28 Thread Luis Carlos Guerrero Covo
gt; > > > >> > > I've been thinking of using solr/lucene since I already have all > docs > >> > > available and I want a quick version that can be deployed while we > work > >> > on > >> > > a more robust recommender. How a

Re: Content based recommender using lucene/solr

2013-06-28 Thread Luis Carlos Guerrero Covo
utes of the item for which I want to generate recommendations? I > know > > it has its issues like recomputing scores/normalization/weight > application > > at query time which could make this idea unfeasible/impractical. I'm at a > > very preliminary stage right now with this and would love some > suggestions > > from experienced users. > > > > thank you, > > > > Luis Guerrero > > -- Luis Carlos Guerrero Covo M.S. Computer Engineering (57) 3183542047

Content based recommender using lucene/solr

2013-06-28 Thread Luis Carlos Guerrero Covo
Hi, I'm using lucene and solr right now in a production environment with an index of about a million docs. I'm working on a recommender that basically would list the n most similar items to the user based on the current item he is viewing. I've been thinking of using solr/lucene since I already h

Re: data consistency in solrcloud cluster deployed in aws

2013-06-14 Thread Luis Carlos Guerrero Covo
in asuming this? What are further steps > > that I could take to verify what could be the cause of the index not > > replicating fast enough to all nodes? > > > > thanks in advance for any help provided, > > > > Luis Guerrero > -- Luis Carlos Guerrero Covo M.S. Computer Engineering (57) 3183542047

data consistency in solrcloud cluster deployed in aws

2013-06-14 Thread Luis Carlos Guerrero Covo
Hi, I currently have solrcloud setup with single shards and two nodes behind a load balancer in aws. I also have an additional node in the cluster which is outside the load balancer (not receiving any client requests) importing data into the cluster using data import handler. So that takes my clus

Re: Grouping search results by field returning all search results for a given query

2013-05-09 Thread Luis Carlos Guerrero Covo
ray > of documents to the value provided. > > Thus: > > group.limit=1&group.field=companyid&group.offset=1 > > …would return the second item in each companyid group matching your > current query. > > Jason > > On May 9, 2013, at 10:30 AM, Luis Carlos Guer

Grouping search results by field returning all search results for a given query

2013-05-09 Thread Luis Carlos Guerrero Covo
Hi, I'm using solr to maintain an index of items that belong to different companies. I want the search results to be returned in a way that is fair to all companies, thus I wish to group the results such that each company has 1 item in each group, and the groups of results should be returned sorte