Explicitly tell Solr the analyzed value when indexing a document

2011-11-16 Thread Tim Terlegård
Hi, I have a couple of string fields. For some of them I want from my application to be able to index a lowercased string but store the original value. Is there some way to do this? Or would I have to come up with a new field type and implement an analyzer? /Tim

Re: Add copyTo Field without re-indexing?

2011-11-16 Thread Michael Kuhlmann
Am 17.11.2011 08:46, schrieb Kashif Khan: Please advise how we can reindex SOLR with having fields stored="false". we can not reindex data from the beginning just want to read and write indexes from the SOLRJ only. Please advise a solution. I know we can do it using lucene classes using indexread

Re: Add copyTo Field without re-indexing?

2011-11-16 Thread Kashif Khan
Please advise how we can reindex SOLR with having fields stored="false". we can not reindex data from the beginning just want to read and write indexes from the SOLRJ only. Please advise a solution. I know we can do it using lucene classes using indexreader and indexwriter but want to index all fie

Re: Aggregated indexing of updating RSS feeds

2011-11-16 Thread Chris Hostetter
: ..but the request I'm making is.. : /solr/myfeed?command=full-import&rows=5000&clean=false : : ..note the clean=false. I see it, but i also see this in the logs you provided... : INFO: [] webapp=/solr path=/myfeed params={command=full-import} status=0 : QTime=8 ...which means someone somewhe

Re: to prevent number-of-matching-terms in contributing score

2011-11-16 Thread Chris Hostetter
: 1. "omitTermFreqAndPositions" is very straightforward but if I avoid : positions I'll refuse to serve phrase queries. I had searched for this in but do you really need phrase queries on your "cat" field? i thought the point was to have simple matching on those terms? : 2. Function query se

Re: to prevent number-of-matching-terms in contributing score

2011-11-16 Thread Chris Hostetter
: 1. "omitTermFreqAndPositions" is very straightforward but if I avoid : positions I'll refuse to serve phrase queries. I had searched for this in but do you really need phrase queries on your "cat" field? i thought the point was to have simple matching on those terms? : 2. Function query se

Re: Solr Score Normalization

2011-11-16 Thread Chris Hostetter
: Perhaps you can solve your usecase by playing with the new eDismax : "boost" parameter, which multiplies the functions with the other score : instead of adding. and FWIW: the "boost" param of the edismax parser is really just syntactic sugar for using the "BoostQParsre wrapped arround an edi

Re: maxFieldLength clarifications

2011-11-16 Thread Chris Hostetter
:1. is the maxFieldLength parameter deprecated? :2. what is maxFieldLength counting? I understood it's counting tokens :per document (not per field) :3. what if I simply remove the maxFieldLength setting from the :solrconfig? 1. it has been deprecated and will not be used in S

Re: size of data replicated

2011-11-16 Thread Chris Hostetter
: query response time. To get a clear picture, I would like to know how : to get the size of data being replicated for each commit. Through the : admin UI, you may read a x of y G data is being replicated; however, : "y" is the total index size, instead of data being copied over. I : couldn't find

Re: Similar documents and advantages / disadvantages of MLT / Deduplication

2011-11-16 Thread Chris Hostetter
: I index 1000 docs, 5 of them are 95% the same (for example: copy pasted : blog articles from different sources, with slight changes (author name, : etc..)). : But they have differences. : *Now i like to see 1 doc in my result set and the other 4 should be marked : as similar.* Do you actaully w

Re: Search in multivalued string field does not work

2011-11-16 Thread Erick Erickson
Attach &debugQuery=true to the URL and look at the results, that'll show you what the query parsed as on the actual server. Where did shards come from? I'd advise turning all the shard stuff off until you answer this question and querying the server directly, shards may be confusing the issue. Let

Re: Dismax and phrases

2011-11-16 Thread Chris Hostetter
: I am starting to wonder whether the module giving finnish language support : (lingsoft) might be the cause? It's extremeley possible -- the details relaly matter when debugging things like this. Since i don't have any access to these custom plugins, i don't know what they might be doing, or

Look what i found here...

2011-11-16 Thread Harsha Vardhan Muthyala
Hi friend!I think I found the answer to everyones problems Look at this articlehttp://ulysse.co.za/profile/89LeeAlien/";>http://ulysse.co.za/profile/89LeeAlien/see you later.

Re: Phrase between quotes with dismax edismax

2011-11-16 Thread Erick Erickson
Ah, ok I was mis-reading some things. So, let's ignore the category bits for now. Questions: 1> Can you refine down the problem. That is, demonstrate this with a single field and leave out the category stuff. Something like q=title:"chef de projet" getting no results and q=title:"c

strange behavior of scores and term proximity use

2011-11-16 Thread Ariel Zerbib
Hi, For this term proximity query: ab_main_title_l0:"to be or not to be"~1000 http://localhost:/solr/select?q=ab_main_title_l0%3A%22og54ct8n+to+be+or+not+to+be+5w8ojsx2%22~1000&sort=score+desc&start=0&rows=3&fl=ab_main_title_l0%2Cscore%2Cid&debugQuery=true The third first results are the fol

RE: Easy way to tell if there are pending documents

2011-11-16 Thread Latter, Antoine
Excellent. It looks like I can drill down into exactly what I want without having to load up the rest of the statistics. -Original Message- From: Justin Caratzas [mailto:justin.carat...@gmail.com] Sent: Wednesday, November 16, 2011 10:41 AM To: solr-user@lucene.apache.org Subject: Re: Ea

Re: Easy way to tell if there are pending documents

2011-11-16 Thread Justin Caratzas
You can enable the stats handler (https://issues.apache.org/jira/browse/SOLR-1750), and get inspect the json pragmatically. -- Justin "Latter, Antoine" writes: > Thank you, that does help - but I am more looking for a way to get at this > programmatically. > > -Original Message- > Fro

Re: Problems installing Solr PHP extension

2011-11-16 Thread Travis Low
Ah, ausgezeichnet, thank you Kuli! We'll just use that. On Wed, Nov 16, 2011 at 11:35 AM, Michael Kuhlmann wrote: > Am 16.11.2011 17:11, schrieb Travis Low: > > >> If I can't solve this problem then we'll basically have to write our own >> PHP Solr client, which would royally suck. >> > > Oh, i

Re: Problems installing Solr PHP extension

2011-11-16 Thread Michael Kuhlmann
Am 16.11.2011 17:11, schrieb Travis Low: If I can't solve this problem then we'll basically have to write our own PHP Solr client, which would royally suck. Oh, if you really can't get the library work, no problem - there are several PHP clients out there that don't need a PECL installation.

Re: Problems installing Solr PHP extension

2011-11-16 Thread Travis Low
Thanks so much for responding. I tried your suggestion and the pecl build *seems* to go okay, but after restarting Apache, I get this again in the error_log: > PHP Warning: PHP Startup: Unable to load dynamic library > '/usr/lib64/php/modules/solr.so' - /usr/lib64/php/modules/solr.so: > undefined

Re: How to mix solr query info into the apache httpd logging (reverseproxy)?

2011-11-16 Thread alex_mass
Thanks for the answer mixing it up with params will certainly be the easiest solution. Alex -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-mix-solr-query-info-into-the-apache-httpd-logging-reverseproxy-tp3498539p3513097.html Sent from the Solr - User mailing list arch

Re: Help! - ContentStreamUpdateRequest

2011-11-16 Thread Tod
Erick, Autocommit is commented out in solrconfig.xml. I have avoided them until after the indexing process is complete. As an experiment I tried committing every n records processed to see if varying n would make a difference, it really didn't change much. My original use case had the clie

Problems with AutoSuggest feature(Terms Components)

2011-11-16 Thread mechravi25
Hi, When i search for a data i noticed two things 1.) I noticed that *terms.regex=.*&* in the logs which does a blank search on terms because of the query time is more. Is there anyway to overcome this. My actual query should go like the first one bolded but instead of that it happens like in the

Re: Search in multivalued string field does not work

2011-11-16 Thread mechravi25
Hi, Thanks for the suggestions. The index is the same in both the servers. We index using JDBC drivers. We have not modified the request handler in solrconfig on either machine and also after the latest schema update, we have re-indexed the data. *We even checked the analysis page and there is

Re: Solr Score Normalization

2011-11-16 Thread Jan Høydahl
Perhaps you can solve your usecase by playing with the new eDismax "boost" parameter, which multiplies the functions with the other score instead of adding. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 5. nov. 2011, at 01:26,

Join and faceting by children's attributes

2011-11-16 Thread Tobias
Hello, I currently have a demand for faceting on the children of a join query. My index is set up in a way that there are parent and child documents. The child documents do have the facet information in a (precisely: some) multivalue field(s). The parent documents themselves do not have any of it

Re: Problems installing Solr PHP extension

2011-11-16 Thread Adolfo Castro Menna
Pecl installation is kinda buggy. I installed it ignoring pecl dependencies because I already had them. Try: pecl install -n solr (-n ignores dependencies) And when it prompts for curl and libxml, point the path to where you have installed them, probably in /usr/lib/ Cheers, Adolfo. On Tue, Nov

Re: Different maxAnalyzedChars value in solrconfig.xml

2011-11-16 Thread Koji Sekiguchi
(11/11/16 13:12), Shyam Bhaskaran wrote: Hi, Wanted to know whether we can set different maxAnalyzedChars values in the solrconfig.xml based on different fields. Can someone point if this is possible at all, my requirement needs me to set different values for maxAnalyzedChars parameter based

Re: OutOfMemoryError when using query with sort

2011-11-16 Thread Benson Ba
Hi Hamid, i also encounterd the same OOM issue on windows 2003 (32-bits) server... but only 3 millions articles stored in solr. i would like to know your configurations to drive so many records. Many thanks. Best Regards Benson -- View this message in context: http://lucene.472066.n3.nabble

Rich document indexing

2011-11-16 Thread kumar8anuj
I am using solr 3.4 and configured my DataImportHandler to get some data from MySql as well as index some rich document from the disk. This is the part of db-data-config file where i am indexing Rich text documents. http://localhost/resumes-new/resumes${resu

Re: Can we have lucene regular and fastVectorHiglighter together in solr

2011-11-16 Thread Koji Sekiguchi
(11/11/16 18:58), Shyam Bhaskaran wrote: Hi, Can we use Lucene regular highlighter along with fastVectorHighlighter together in solrconfig.xml (solr) ? -Shyam Yes, you can. See section in solr/example/solr/conf/solrconfig.xml for example. koji -- Check out "Query Log Visualizer" for Apac

Re: Aggregated indexing of updating RSS feeds

2011-11-16 Thread sbarriba
All, Can anyone advise how to stop the "deleteAll" event during a full import? As discussed above using clean=false with Solr 3.4 still seems to trigger a delete of all previous imported data. I want to aggregate the results of multiple imports. Thanks in advance. S -- View this message in cont

Re: Phrase between quotes with dismax edismax

2011-11-16 Thread Jean-Claude Dauphin
Thanks Erick for yr quick answer. I am using Solr 3.1 1) I have set the mm parameter to 0 and removed the categories from the search. Thus the query is only for "chef de projet" and nothing else. But the problem remains, i.e searching for "chef de projet" gives no results while searching for "che