Re: edismax available in solr 3.1?

2011-05-09 Thread cyang2010
Is it a formal feature that solr 3.1 support? Or still as experimental feature? If it is experimental feature, i would still be hesitating to use it. -- View this message in context: http://lucene.472066.n3.nabble.com/edismax-available-in-solr-3-1-tp2910613p2920975.html Sent from the Solr - Use

Re: Why special character is handled differently by standard/lucene query parser?

2011-05-06 Thread cyang2010
I know about dismax. But with that, i can't perform prefix and fuzzy query. can edismax handle prefix and fuzzy query? My application logic just pass the user entered text to solr server to perform term query, phrase query, prefix and fuzzy query. And i don't want to escape the special char

Why special character is handled differently by standard/lucene query parser?

2011-05-06 Thread cyang2010
Hi, When user entered text contains special character, can this being taken care by the tokenizer/filter configured at the field? In application code, Do i need to parse the user input string and add the escape in front of those special character? If so, will those special characters differ for

edismax available in solr 3.1?

2011-05-06 Thread cyang2010
Hi, is edixmax available in solr 3.1? I don't see any documentation about it. if it is, does it support the prefix and fuzzy query? Thanks, cy -- View this message in context: http://lucene.472066.n3.nabble.com/edismax-available-in-solr-3-1-tp2910613p2910613.html Sent from the Solr - User

Re: why query chinese character with bracket become phrase query by default?

2011-05-05 Thread cyang2010
Nice, it works like a charm. I am using solr 1.4.1. Here is my configuration for the chinese field: Now when I get the expected hassle free parsing on solr side: title_zh_CN:(我活) title_zh_CN:(我活) title_zh_CN:我 title_zh_CN:活 ti

why query chinese character with bracket become phrase query by default?

2011-05-04 Thread cyang2010
Hi, In solr admin query full interface page, the following query with english become term query according to debug : title_en_US: (blood red) title_en_US: (blood red) title_en_US: (blood red) title_en_US:blood title_en_US:red title_en_US:blood title_en_US:red However, using the same syntax wi

Re: How to debug if termsComponent is used

2011-05-03 Thread cyang2010
Sorry i didn't mean to give random reply. It is just today my solr configuration/schema is different now and can't get the error message. Anyway, i rerun the test. Basically by specifying such searchcomponent and requesthandler, you won't get any error. When you query this it is fine without an

How to know which value matched for a multi-value field

2011-05-03 Thread cyang2010
Hi, I have a use case where i need to know for a paritcular multivalue field, which particular value match when a query is run on that field. For example, for a movie document, movie title name is single-value field, movie actors is multi-value field. When user search "colin", i know it is "col

Re: How to debug if termsComponent is used

2011-05-03 Thread cyang2010
I tried it. It just does not work. the debug component only works when query component is there, and it is just showing debugging information for query result, not term match result. -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-debug-if-termsComponent-is-used-tp28

How to debug if termsComponent is used

2011-05-02 Thread cyang2010
Hi, I defined a searchHanlder just for the sake of autosuggest, using TermsComponent. explicit terms debug This configuration might not even make sense, to configure "terms" and "debug" component together. Is "debug" component must be wired up

Re: Reader per query request

2011-04-26 Thread cyang2010
Thanks a lot. That makes sense. -- CY -- View this message in context: http://lucene.472066.n3.nabble.com/Reader-per-query-request-tp2867778p2867995.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Too many open files exception related to solrj getServer too often?

2011-04-26 Thread cyang2010
Just pushing up the topic and look for answers. -- View this message in context: http://lucene.472066.n3.nabble.com/Too-many-open-files-exception-related-to-solrj-getServer-too-often-tp2808718p2867976.html Sent from the Solr - User mailing list archive at Nabble.com.

Reader per query request

2011-04-26 Thread cyang2010
Hi, I was wondering if solr open a new lucene IndexReader for every query request? >From performance point of view, is there any problem of opening a lot of IndexReaders concurrently, or application shall have some logic to reuse the same IndexReader? Thanks, cy -- View this message in

Too many open files exception related to solrj getServer too often?

2011-04-11 Thread cyang2010
Hi, I get this solrj error in development environment. org.apache.solr.client.solrj.SolrServerException: java.net.SocketException: Too many open files At the time there was no reindexing or any write to the index. There were only different queries genrated using solrj to hit solr server:

Re: Fuzzy query using dismax query parser

2011-03-24 Thread cyang2010
OK, i will have to wait till solr 3 release then. -- View this message in context: http://lucene.472066.n3.nabble.com/Fuzzy-query-using-dismax-query-parser-tp2727075p2727572.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: how to run boost query for non-dismax query parser

2011-03-24 Thread cyang2010
iorixxx, thanks for your reply. Another a little bit off topic question. I looked over all the subclasses of QParserPlugin. It seesm like most of them provide complementary parsing to the default lucene/solr parser. Except prefixParser. What is the intended usage of that one? The default luc

Fuzzy query using dismax query parser

2011-03-24 Thread cyang2010
Hi, I wonder how to conduct fuzzy query using dismax query parser? I am able to do prefix query with local params and prefixQueryParser. But how to handle fuzzy query? I like the behavior of dismax except it does not support the prefix query and fuzzy query. Thanks. cy -- View this message

Re: how to run boost query for non-dismax query parser

2011-03-24 Thread cyang2010
Hi iorixxx, Thanks for your reply. yeah, an additional query with the boost value will work. However, I just wonder where you get the information that BoostQParserPlugin only handles function query? I looked up the javadoc, and still can't get that. This is the javadoc. Create a boosted quer

how to run boost query for non-dismax query parser

2011-03-23 Thread cyang2010
Hi, I need to code some boosting logic when some field equal to some value. I was able to get it work if using dismax query parser. However, since the solr query will need to handle prefix or fuzzy query, therefore, dismax query parser is not really my choice. Therefore, i want to use standa

Why boost query not working?

2011-03-23 Thread cyang2010
Hi, This solr query faile: 1. get every title regardless what the title_name is 2. within the result, boost the one which genre id = 56. (bq=genres:56^100) http://localhost:8983/solr/titles/select?indent=on&version=2.2&start=0&rows=10&fl=*%2Cscore&wt=standard&defType=dismax&qf=title_name_en_US&

Re: stopFilterFactor and SnowballPorterFilterFactory not work for Spanish

2011-03-15 Thread cyang2010
I just tried with some real spanish text: "Alquileres" --> = org.apache.solr.analysis.LowerCaseFilterFactory {} term position 1 term text alquileres term type word source start,end4,14 payload

Re: stopFilterFactor and SnowballPorterFilterFactory not work for Spanish

2011-03-15 Thread cyang2010
Sorry Robert. I just use some text translated by someone. Maybe that translation is not right. Could you please give me a spanish term which i can show the spanish stemming factory is working? Thanks, cyang -- View this message in context: http://lucene.472066.n3.nabble.com/stopFilterFactor

Re: stopFilterFactor and SnowballPorterFilterFactory not work for Spanish

2011-03-15 Thread cyang2010
Robert, Thanks for your advice. I modified my stopword text file. Now the stopwordFilter start to work. But the stemming related filter (SnowballPorterFilterFactory-- spanish) still not working. Anyone have any idea on that? Thanks, cyang -- View this message in context: http://lucene.472

stopFilterFactor and SnowballPorterFilterFactory not work for Spanish

2011-03-15 Thread cyang2010
I am using solr 1.4.1. I am trying to index a spanish field using the following tokenizer/filters: Using field analysis solr Admin i can tell StopFilterFactory and SnowballPorterFilterFactory with Spanish not working right: 1. after stop

Is WordDelimiterFilterFactory applicable to non-english language?

2011-03-14 Thread cyang2010
Does it make sense to apply WordDelimiterFilterFactory to non-english language, such as spanish? What about asian lanaguage? The following are the typical use case for WordDelimiterFilterFactory. Is 1, 2, 3, and 4 applicable to all wester language (including spanish)? For asian language, is

How to intercept the http request made by solrj

2011-03-08 Thread cyang2010
Hi, Anyone knows how to intercept the http request made by solrj? I only see the url being printed out when the request is invalid. But still as part of development/debugging process, i want to verify what http request it sent out to solr server. Thanks. CY -- View this message in context:

Re: logical relation among filter queries

2011-03-08 Thread cyang2010
Erick, Thanks for reply. Is there anyway that i can instruct to combine seperate filter queries with UNION result, without creating the 3rd filter query cache as I described above? If not, shall I give up using filter query for such scenario (where i query the same field with multiple value usin

Re: logical relation among filter queries

2011-03-08 Thread cyang2010
Right, i can combine that into one fq query. The only thing is that i want to reduce the cache size. I remember this is what i read from wiki. fq=rating:R (filter query cache A) fq=rating:PG-13 (filter query cache B) fq=rating:(R O PG-13) -- (It won't be able to leverage the fil

logical relation among filter queries

2011-03-07 Thread cyang2010
I wonder what is the logical relation among filter queries. I can't find much documentation on filter query. for example, i want to find all titles that is either PG-13 or R through filter query. The following query won't give me any result back. So I suppose by default it is intersection amo

Re: Help on Multi-language support

2011-03-04 Thread cyang2010
This is the solr schema: -- View this message in context: http://lucene.472066.n3.nabble.com/Help-on-Multi-language-support-tp2636054p2636065.html Sent from the Solr - User mailing list archive at Nabble.com.

Help on Multi-language support

2011-03-04 Thread cyang2010
Hi, I wonder how solr can satisfy our multi-language requirement. For example, for movie/tv series titles, We require that based on user preferred language, user is able to get back titles name (and actor, directors) by selected language. For example, getTitlesByGenreId. On the other hand,

Re: indexing mysql dateTime/timestamp into solr date field

2011-03-02 Thread cyang2010
It turn out you don't need to use dateFormatTransformer at all. The reason why the timestamp mysql column fail to be inserted to solr is because in schema.xml i mistakenly set "index=false, stored=false". Of course that won't make it come to index at all. No wonder schema browser always show no

how to debug dataimporthandler

2011-03-01 Thread cyang2010
I wonder how to run dataimporthandler in debug mode. Currently i can't get data correctly into index through dataimporthandler, especially a timestamp column to solr date field. I want to debug the process. According to this wiki page: Commands The handler exposes all its API as http requests .

Re: indexing mysql dateTime/timestamp into solr date field

2011-03-01 Thread cyang2010
Bill, I did try to use the way you suggested above. Unfortunately it does not work either. It is pretty much the same as my last reply, except the dateTimeFormat="-MM-dd'T'hh:mm:ss" Thanks, cyang -- View this message in context: http://lucene.472066.n3.nabble.com/indexing-mysql-dateTime-

Re: indexing mysql dateTime/timestamp into solr date field

2011-03-01 Thread cyang2010
Yes, I am pretty sure every row has a modified field. I did my testing before posting question. I tried with adding DateFormatTransformer, still not help. I assume it is o

indexing mysql dateTime/timestamp into solr date field

2011-03-01 Thread cyang2010
Hi, I can't seem to be able to index to a solr date field from a query result using DataImportHandler. Anyone else know how to resoleve the problem? When i check the solr do

Re: numberic or string type for non-sortable field?

2011-03-01 Thread cyang2010
Can I know why? I thought solr is tuned for string if no sorting of facet by range query is needed. -- View this message in context: http://lucene.472066.n3.nabble.com/numberic-or-string-type-for-non-sortable-field-tp2606353p2607932.html Sent from the Solr - User mailing list archive at Nabble.

Re: numberic or string type for non-sortable field?

2011-03-01 Thread cyang2010
Sorry i didn't make my question clear. I will only facet based on field value, not ranged query (it is just some ids for a multi-value field). And i won't do sort on the field either. In that case, is string more efficient for the requirement? -- View this message in context: http://lucene

numberic or string type for non-sortable field?

2011-03-01 Thread cyang2010
I wonder if i shall use solr int or string for such field with following requirement multi-value facet needed sort not needed The field value is a an id. Therefore, i can store as either numeric field or just a string. Shall i choose string for efficiency? Thanks. -- View this message in c

Re: How to handle special character in filter query

2011-02-28 Thread cyang2010
Thanks you both. In order to get it (e.g. filter query --> http://...&fq:genres=ACTION & ADVENTURE) work in browser, i have to explicitly put url encode for special character, regardless what following solution i use: 1. Using double quote on filter query field 2. Using escape on special char

How to handle special character in filter query

2011-02-25 Thread cyang2010
How to handle special character when constructing filter query? for example, i want to do something like: http://.&fq=genre:ACTION & ADVENTURE How do i handle the space and & in the filter query part? Thanks. -- View this message in context: http://lucene.472066.n3.nabble.com/How-to

Re: Help on query time boosting effect using standardQueryParser

2011-02-25 Thread cyang2010
Once i change the query to be: +name:memory features:battery^100 +name:memory features:battery^100 +name:memory features:battery^100 +name:memori features:batteri^100.0 +name:memori features:batteri^100.0 Then it gets rid of result that only result which does not match name:memory. However

Help on query time boosting effect using standardQueryParser

2011-02-25 Thread cyang2010
For the solr example(exampleDIH), how do i achieve the following with standard queryparser? search all docs which name field contains "memory" (primary query logic), Within that resultset, boost the doc matches features:battery (boosting logic). Note that I have to use standard queryparse

Re: solr render biased search result

2011-02-09 Thread cyang2010
That makes sense. It is a little bit indirect. You have to translate that user preference/profile into a search field value and then dictate search result boosting the doc with that preference value. -- View this message in context: http://lucene.472066.n3.nabble.com/solr-render-biased-sear

solr render biased search result

2011-02-09 Thread cyang2010
Hi, I am asked that whether solr renders biased search result? For example, for this search (query all movie title by this Comedy genre), for user who indicates a preference to 1950's movies, solr renders the 1950's movies with higher score (top in the list)?Or if user is a kid, then the re

Re: nested faceting ?

2011-02-07 Thread cyang2010
I think what you are trying to achieve is called taxonomy facet. There is a solution for that. Check for the slides for Taxonomy faceting. http://www.lucidimagination.com/solutions/webcasts/faceting However, i don't know if you are able to render the hierachy all at once. The solution i point

phrase, inidividual term, prefix, fuzzy and stemming search

2011-01-31 Thread cyang2010
My current project has the requirement to support search when user inputs any number of terms across a few index fields (movie title, actor, director). In order to maximize result, I plan to support all those searches listed in the subject, phrase, individual term, prefix, fuzzy and stemming. Of

Re: How to group result when search on multiple fields

2011-01-28 Thread cyang2010
Some typo out there in my example: I mean the first 2 movies by angelina jolie. This is the correct example: result 1: <-- title match score: 1.0 title_name: tom's story actor: Angelina Jolie result 2: <-- title match score: 0.95 title_name: tom green's store actor: Angelina Jolie result

Re: How to group result when search on multiple fields

2011-01-28 Thread cyang2010
I don't think this field collapsing can simply solve my problem after second thought. As I mentioned, user only type in a search phrase, and click on search. Underlying that, the application logic is going to compose search query against multiple fields (title_name, actors, directors, ...) by th

Re: Is relevance score related to position of the term?

2011-01-27 Thread cyang2010
Just a little clarification, when i say position of the term, i mean the position of the term within the field. For example, "Jamie Lee" -- Lee is the second position of the name field. "Lee Jamie" -- Lee is the first position of the name field in this case. -- View this message in context

Re: Is relevance score related to position of the term?

2011-01-27 Thread cyang2010
Hi Em, Thanks for reply. Basically you are saying there is no builtin solution that care about the position of the term to impact the relevancy score. In my scenario, i will get those two document with the same score. The order depends on the sequence of indexing. Thanks, Cyang -- View t

Is relevance score related to position of the term?

2011-01-27 Thread cyang2010
Let me describe the question using an example: If search "Lee" on name field as exact term match, returning result can be: Lee Jamie Jamie Lee Will solr grant higher score to "Lee Jamie" vs "Jamie Lee" based on the position of the term in name field of each document? >From what i know, the sc

Re: How to group result when search on multiple fields

2011-01-26 Thread cyang2010
By taking a quick look, that field collapsing seem to be what i want. I am not sure what clusteringcomponent is still. I will look into more. Is "Field Collapsing" a new feature for solr 4.0 (not yet released yet)? If so, i will have to wait for it. Thanks for point it out! -- View this

Re: How to group result when search on multiple fields

2011-01-26 Thread cyang2010
Since it is a search applying for all fields, and the only result that require grouping is people (actors/directors), i am guessing this: 1. The search still queries single index. 2. there are two searches underlying. One for matching movie/tv name, genres name. The other one for top two matc

How to group result when search on multiple fields

2011-01-26 Thread cyang2010
Let me give an example to illustrate my question: On netflix site, the search box allow you to search by movie, tv shows, actors, directors, and genres. If "Tomcat" is searched, it gives result as: move titles with "Tomcat" or whatever, and somewhere in between , it also show two actors, "Tom

DIH clean=false

2011-01-25 Thread cyang2010
I am not sure if i really understand what that mean by clean=false. In my understanding, for full-import with default clean=true, it will blow off all document of the existing index. Then full import data from a table into a index. Is that right? Then for clean=false, my understanding is that

Need Middleware between search client and solr?

2010-11-19 Thread cyang2010
Hi, I am new to the lucene/solr. I have a very general question, and hope to hear your recommendation. Do you need a middleware/module between your search client and solr server? The response message is very solr specific. Do you need to translate it to application object model and return b