Re: Facets, termvectors, relevancy and Multi word tokenizing

2014-03-11 Thread epnRui
Hi Iorixxx! I have not optimized the index but the day after this post I saw I didn't have this problem anymore. I will follow your advice next time! Now I'm avoiding so much manipulation at indexation time and I'm doing more work in the java code in the client side. If I had time I would imple

Re: Implementing a customised tokenizer

2014-03-11 Thread epnRui
Hi Ahmet, I think the expungesDelete is done automatically through SolrJ. So I don't think it was that. THe problem solved by itself apparently. I wonder if it has to do with an automatic optimization of Solr indexes? Otherwise it was something similar to XY problem :P Thanks for the help! --

Facets, termvectors, relevancy and Multi word tokenizing

2014-02-27 Thread epnRui
Hi everyone! I'm having a problem and I have searched and Haven't found a solution yet and am rather confused at the moment. I have an application that stores human readable texts in my Solr index. It finds the most relevant terms in that human readable text, I think using termvectors and facets,

Re: Facets, termvectors, relevancy and Multi word tokenizing

2014-02-28 Thread epnRui
Hi Ahmet!! I went ahead and did something I thought it was not a clean solution and then when I read your post and I found we thought of the same solution, including the European_Parliament with the _ :) So I guess there would be no way to do this more cleanly, maybe only implementing my own Tok

Re: Facets, termvectors, relevancy and Multi word tokenizing

2014-03-03 Thread epnRui
Hi guys, I'm on my way to solve it properly. This is how my field looks like now: I still have one case where I'm facing issues because in fact I want to pres

Implementing a customised tokenizer

2014-03-05 Thread epnRui
I have managed to understand how to properly implement and change the words on a CharFilter and a Filter, but I fail to understand how the Tokenizer works... I also fail to find any tutorials on the thing.. Could you provide some example implementation of incrementToken and how to manipulate the t

Re: Facets, termvectors, relevancy and Multi word tokenizing

2014-03-05 Thread epnRui
Hi guys, So, I keep facing this problem which I can't solve. I thought it was due to HTML anchors containing the name of the hashtag, and thus repeating it, but it's not. So the use case is: 1 - I need to consider hashtags as tokens. 2 - The hashtag has to show up in the facets. Right now if I i

Re: Implementing a customised tokenizer

2014-03-07 Thread epnRui
Hi iorixxx! Thanks for replying. I managed to get around well enough not to need a tokenizer customized implementation. That would be a pain in ... Anyway, now I have another problem, which is related to the following: - I had previously used replace chars and replace patterns, charfilters and

Re: Facets, termvectors, relevancy and Multi word tokenizing

2014-03-07 Thread epnRui
Hi guys! I solved my problem on the client side but at least I solved it... Anyway, now I have another problem, which is related to the following: - I had previously used replace chars and replace patterns, charfilters and filters, at index time to replace "EP" by "European Parliament". At that

setting up master and slave in same machine with diff ip's and same port

2013-01-23 Thread epnRui
Hi everyone its my first post here so I hope im doing it in the right place. Im a software developer and Im setting up a DEV environment in Ubuntu with the same configuration as in PROD. (apparently this IT department doesnt know the difference between a developer and a sys admin) In PROD we

Re: setting up master and slave in same machine with diff ip's and same port

2013-01-31 Thread epnRui
Hi, I solved the issue by setting up two different virtual network adapters in ubuntu server. case closed ;) thanks for the help!! -- View this message in context: http://lucene.472066.n3.nabble.com/setting-up-master-and-slave-in-same-machine-with-diff-ip-s-and-same-port-tp4035795p4037713.h

Stopping solr

2013-01-31 Thread epnRui
Hi people, First of all this forum is a god sent!!! Second: I have a master / slave configuration, using replication. Currently in production I have only one server, there's no backup server (really...). The webapplication is a public webapplication, everyone can see it. - How often, in your