Re: behavior of solr.KeepWordFilterFactory

2012-12-02 Thread Joe Zhang
In other words, what I wanted to achieve is case-senstive indexing on a small set of words. Can anybody help? On Sun, Dec 2, 2012 at 11:56 PM, Joe Zhang wrote: > To be more specific, this is the data type I was using: > > positionIncrementGap="100"> > >

Re: behavior of solr.KeepWordFilterFactory

2012-12-02 Thread Joe Zhang
To be more specific, this is the data type I was using: On Sun, Dec 2, 2012 at 11:51 PM, Joe Zhang wrote: > yes, that is the correct

Re: behavior of solr.KeepWordFilterFactory

2012-12-02 Thread Joe Zhang
yes, that is the correct behavior. But how do I achieve my goal, i.e, speical treatment on a list of uppercase/special words, normal treatment on everything else? On Sun, Dec 2, 2012 at 11:46 PM, Xi Shen wrote: > By the definition on > > https://lucene.apache.org/solr/api-3_6_1/org/apache/solr/a

Re: SolrCloud - Sorting Problem

2012-12-02 Thread deniz
I think I have figured out this... at least some kinda.. After putting logs here there in the code, especially in SolrCore, HttpShardHandler, SearchHandler classes, it seems like sorting is done after all of the shards finish "responding" and then before we see the results the result set is sorte

How Adding Dictionary??

2012-12-02 Thread Emiliana Suci
how to add a dictionary in lucene? please give an example. because I want to develop IndonesianStemmer by adding Indonesian dictionary. -- View this message in context: http://lucene.472066.n3.nabble.com/How-Adding-Dictionary-tp4023888.html Sent from the Solr - User mailing list archive at Nabb

Re: behavior of solr.KeepWordFilterFactory

2012-12-02 Thread Xi Shen
By the definition on https://lucene.apache.org/solr/api-3_6_1/org/apache/solr/analysis/KeepWordFilter.html, I am pretty sure it is the correct behavior of this filter :) I guess you are trying to this filter to index some special words in Chinese? On Mon, Dec 3, 2012 at 1:54 PM, Joe Zhang wrote

Re: duplicated URL sent from Nutch to solr index

2012-12-02 Thread Joe Zhang
Sorry I didn't make it perfectly clear. The "id" field is URL. On Sun, Dec 2, 2012 at 11:33 PM, Joe Zhang wrote: > Thanks! > > > On Sun, Dec 2, 2012 at 11:20 PM, Xi Shen wrote: > >> If the value for "id" field is the same, the old entry will be update; if >> it is new, a new entry will be creat

Re: duplicated URL sent from Nutch to solr index

2012-12-02 Thread Joe Zhang
Thanks! On Sun, Dec 2, 2012 at 11:20 PM, Xi Shen wrote: > If the value for "id" field is the same, the old entry will be update; if > it is new, a new entry will be created & indexed. > > This is my experience. :) > > > On Mon, Dec 3, 2012 at 1:45 PM, Joe Zhang wrote: > > > Dear list, > > > > I

Re: duplicated URL sent from Nutch to solr index

2012-12-02 Thread Xi Shen
If the value for "id" field is the same, the old entry will be update; if it is new, a new entry will be created & indexed. This is my experience. :) On Mon, Dec 3, 2012 at 1:45 PM, Joe Zhang wrote: > Dear list, > > I just want to confirm an expected behavior of solr: > > Assuming we have " id

behavior of solr.KeepWordFilterFactory

2012-12-02 Thread Joe Zhang
I defined the following data type in my solr schema.xml when I use the type "testkeep" to index a test field, my true expecation was to make sure solr indexes the uppercase form of a small list of words in the file, AND TREAT EVERY OTHER WORD AS USUAL. The goal of securing the clo

Re: Luke and SOLR search giving different results

2012-12-02 Thread Jack Krupansky
Have you tried using the Solr Admin Analysis page, using the word and a few words of context for index analysis and the word alone for query analysis? And be sure to fully reindex if you change ANYTHING in the schema fields or field types. -- Jack Krupansky From: Erol Akarsu Sent: Sunday, Dec

Luke and SOLR search giving different results

2012-12-02 Thread Erol Akarsu
Hi, I am trying to apply SOLR for Turkish Language for my research. Instead of using language identification, I manually assigned Turkish language for a sample test document. I have configured SOLR schema.xml, activated the part below. I have added the attached document testTurkishDoc.xml that is

Re: SolrCloud - Sorting Problem

2012-12-02 Thread deniz
deniz wrote > after these, I guess i need to check how the request is distributed on > cloud... any ideas where I should start checking? as for replying my own question (hopefully correct) I have started digging org.apache.solr.handler.component.SearchHandler.handleRequestBody which loops (i could

Re: SolrCloud - Sorting Problem

2012-12-02 Thread deniz
Chris Hostetter-3 wrote > w/o more information about how/where you add this information, it's going > to be really hard to give you suggestions on how to fix your problem. The modifications I made is nearly the same with score field. Basically I have added a PositionAugmenter class, modified Retu

Re: A (seemingly) unavoidable bump in qtimes shortly after replication ends

2012-12-02 Thread Otis Gospodnetic
Hi, A bit hard to tell without seeing all system and Solr metrics, but the first thing I'd look at is disk IO. Yes, you may warm up the index, but you still have some new data on disk after replication, so if a user query hits data in newly copied blocks, it will *have to* be read from disk. See

Re: SolrCloud(5x) - Errors while recovering

2012-12-02 Thread deniz
Mark Miller-3 wrote > FYI, I've fixed this 5x issue a few days ago. > > - Mark Yep, after the patch, it is not occuring anymore, thank you - Zeki ama calismiyor... Calissa yapar... -- View this message in context: http://lucene.472066.n3.nabble.com/SolrCloud-5x-Errors-while-recovering-tp

Re: multiple indexes?

2012-12-02 Thread Joe Zhang
This is very helpful. Thanks a lot, Shaun and Dikchant! So in default single-core situation, the index would live in data/index, correct? On Fri, Nov 30, 2012 at 11:02 PM, Shawn Heisey wrote: > On 11/30/2012 10:11 PM, Joe Zhang wrote: > >> May I ask: how to set up multiple indexes, and specify

Re: DefaultSolrParams ?

2012-12-02 Thread Erik Hatcher
Bernd - There never has been, nor is there now, a method that looks for q and falls back to q.alt. The logic for that is purely in DisMaxQParser.java: if (userQuery == null || userQuery.trim().length() < 1) { // If no query is specified, we may have an alternate altUserQuery = g

Re: Atomic Updates, Payloads, Non-stored data

2012-12-02 Thread Erick Erickson
Yeah, there are a bunch, some of which have been around for years. It's not an easy problem given the structure of the index. Can't come up with the JIRAs right now, sorry. Best Erick On Sun, Dec 2, 2012 at 8:13 AM, yriveiro wrote: > Hi, > > Exists some issue open in the Solr Project about t

Re: how to do a range search not on ordered data (text type)

2012-12-02 Thread Upayavira
You can't. I guess you could extract those numbers from your text and index them into a separate numeric field. Upayavira On Sun, Dec 2, 2012, at 07:08 AM, jend wrote: > Hi, > Im building a solr install which has a blurb of data in a field > "description". > > In that field there are sentences s

Re: DefaultSolrParams ?

2012-12-02 Thread Bernd Fehling
Hi Hoss, my config has definately not changed and it worked with 3.6 and 3.6.1. Yes I have a custom plugin and if q was empty with 3.6 it picked automatically q.alt from solrconfig.xml. This all was done with params.get() With 4.x this is gone due to some changes in DefaultSolrParams(?). Which is

Re: Atomic Updates, Payloads, Non-stored data

2012-12-02 Thread yriveiro
Hi, Exists some issue open in the Solr Project about this issue? Thanks - Best regards -- View this message in context: http://lucene.472066.n3.nabble.com/Atomic-Updates-Payloads-Non-stored-data-tp4006678p4023789.html Sent from the Solr - User mailing list archive at Nabble.com.