Re: Update Index : Updating Specific Fields

2010-03-04 Thread Andrzej Bialecki
On 2010-03-04 07:41, Walter Underwood wrote: No. --wunder Or perhaps "not yet" ... http://portal.acm.org/ft_gateway.cfm?id=1458171 On Mar 3, 2010, at 10:40 PM, Kranti™ K K Parisa wrote: Hi, Is there any way to update the index for only the specific fields? Eg: Index has ONE document con

Re: 2 Cores, 1 Table, 2 DataImporter --> Import at the same time ?

2010-03-04 Thread stocki
i found it like you said. i didnt set an seperatet dataDir for my two cores ... ^^ thx =) Lance Norskog-2 wrote: > > No, a "core" is a lucene index. Two DataImportHandler sessions to the > same core will run on the same index. > > You should use lockType of simple or native. 'single' should

Re: Warning : no lockType configured for...

2010-03-04 Thread Mani EZZAT
I tired using the default solrconfig and schema (from the example in 1.3 release) and I still get the same warnings When I look at the log, the solrconfig seems correcly loaded, but something is strange : newSearcher warming query from solrconfig.xml}]} 2010-03-04 10:35:32,545 DEBUG [Config] s

english (american) spell dictionary

2010-03-04 Thread michaelnazaruk
Hi,all! Tell my please, where I can get spell dictionary for solr? -- View this message in context: http://old.nabble.com/english-%28american%29-spell-dictionary-tp27778741p27778741.html Sent from the Solr - User mailing list archive at Nabble.com.

Error 400 - By search with exclamation mark ... ?! PatternReplaceFilterFactory ?

2010-03-04 Thread stocki
Hllo again ;) i get these Error message when is searching for this : hallo ! hhtp request: select/?q=hallo+!&version=2.2&start=0&rows=10&indent=on SCHWERWIEGEND: org.apache.solr.common.SolrException: org.apache.lucene.queryParser.ParseException: Cannot parse 'tom !': Encountered "" at line 1

Re: Randomize MoreLikeThis

2010-03-04 Thread André Maldonado
Even that I do what you said, the MoreLikeThis resulset will only change when the index changes. Or I'm wrong? Thank's "Então aproximaram-se os que estavam no barco, e adoraram-no, dizendo: És verdadeiramente o Filho de Deus." (Mateus 14:33) On Wed, Mar 3, 2010 at 18:03, Otis Gospodnetic wrote

How to find the location of the highlighted snippet?

2010-03-04 Thread Mark Roberts
Hi, I want to display my results in google-style "...snippet...snippet...", except I need to be able to determine if a snippet is at the beginning or the end of the content to tell whether or not to add leading/trailing "..."s At the moment, I'm using string comparison with the content field, b

Re: Update Index : Updating Specific Fields

2010-03-04 Thread Kranti™ K K Parisa
may be this is one very imp feature to be considered for next releases of SOLR. sometimes these kind of cases would come up. Best Regards, Kranti K K Parisa On Thu, Mar 4, 2010 at 3:01 PM, Andrzej Bialecki wrote: > On 2010-03-04 07:41, Walter Underwood wrote: > >> No. --wunder >> > > Or perha

Re: Can I used .XML files instead of .OSM files

2010-03-04 Thread mamathahl
I forgot to mention that I have been working on geo-saptial examples downloaded from http://www.ibm.com/developerworks/java/library/j-spatial/. I have replaced the OSM files(data) which initially existed, with my data (i.e XML file with OSM extension). My XML file has many data records. The 1st

field not found for search

2010-03-04 Thread Suram
Hi, I newly Indexed some xml files, it was not found for search and autosuggestion My xml Index file http://old.nabble.com/file/p27780413/Nike.xml Nike.xml and my scheme is http://old.nabble.com/file/p27780413/schema.xml schema.xml how can i achive this. -- View this message i

Re: field not found for search

2010-03-04 Thread Siddhant Goel
Did you send a commit after indexing those files? On Thu, Mar 4, 2010 at 6:30 PM, Suram wrote: > > Hi, > >I newly Indexed some xml files, it was not found for search and > autosuggestion > > My xml Index file http://old.nabble.com/file/p27780413/Nike.xmlNike.xml > > and my scheme is htt

Re: weighted search and index

2010-03-04 Thread Erick Erickson
OK, lights are finally dawning. I think what you want is payloads, see: http://www.lucidimagination.com/blog/2009/08/05/getting-started-with-payloads/ for your index-time term boosting. Query time boosting is as you ind

Re: field not found for search

2010-03-04 Thread Erick Erickson
The first things I would do is examine your index, try solr admin and/or Luke. The second thing I'd do is try some queries with "&debugQuery=on" to see if it's as you expect. It's hard to help without knowing how you're searching. Erick On Thu, Mar 4, 2010 at 8:00 AM, Suram wrote: > > Hi,

Re: Error 400 - By search with exclamation mark ... ?! PatternReplaceFilterFactory ?

2010-03-04 Thread Koji Sekiguchi
stocki wrote: Hllo again ;) i get these Error message when is searching for this : hallo ! hhtp request: select/?q=hallo+!&version=2.2&start=0&rows=10&indent=on SCHWERWIEGEND: org.apache.solr.common.SolrException: org.apache.lucene.queryParser.ParseException: Cannot parse 'tom !': Encounter

Re: If you could have one feature in Solr...

2010-03-04 Thread Jorg Heymans
The ability to read solr configuration files from the classpath instead of solr.solr.home directory. Jorg 2010/3/1 Noble Paul നോബിള്‍ नोब्ळ् > On Wed, Feb 24, 2010 at 7:18 PM, Patrick Sauts > wrote: > > Synchronisation between the slaves to switch the new index at the same > time > > after rep

RE: weighted search and index

2010-03-04 Thread Jianbin Dai
Thanks! Will try it. -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Thursday, March 04, 2010 5:59 AM To: solr-user@lucene.apache.org Subject: Re: weighted search and index OK, lights are finally dawning. I think what you want is payloads, see: http://www.l

Relevance Customization

2010-03-04 Thread André Maldonado
Hi all. I want to know if its possible to customize the solr relevance, somehing like this: 1 - I create a static score for each document and index it. 2 - I change the relevance to Score(Solr) + Score(Static) where the solr score is equal to 30% of the total score. Mixing the two scores into onl

Re: weighted search and index

2010-03-04 Thread Chris Hostetter
: Subject: weighted search and index : In-reply-to: <4b8f061b.3080...@gmail.com> http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists When starting a new discussion on a mailing list, please do not reply to an existing message, instead start a fresh email. Even if

Re: Why synchronized access to FieldValueCache in getUninvertedField.java

2010-03-04 Thread Chris Hostetter
: I have noticed that in the class UninvertedField.java there is a synchronized : access to the FieldValueCache. : I would like to know why this access is synchronized. Could this end up in a : loss of performance when there are concurrent search requests? This only synchronizes when a value hasn

Re: Too many .cfs files

2010-03-04 Thread Lance Norskog
It's not that it uses 10 files, it's that when 10 files of size X exist it merges all of them into a file of size Y. If you run an optimize everything will be merged back into one large .cfs file. On Wed, Mar 3, 2010 at 11:09 PM, mklprasad wrote: > > HI All, > I set up my 'mergerfactor ' as 10.

Re: Relevance Customization

2010-03-04 Thread Ahmet Arslan
> I want to know if its possible to customize the solr > relevance, somehing > like this: > > 1 - I create a static score for each document and index > it. > 2 - I change the relevance to Score(Solr) + Score(Static) > where the solr > score is equal to 30% of the total score. Mixing the two > scor

Re: dynamic synonyms

2010-03-04 Thread Chris Hostetter
: Wondering if this is possible. I have multiple clients using solr. Each : client could have their own synonym list. During an update process if the : record being inserted belongs to a client that happens to have it's own : synonym file is there a way to use that during indexing / querying? It :

Re: replication issue

2010-03-04 Thread Matthieu Labour
Hi I just want to post a follow up on the replication issue I encountered I have a master on which many document updates (delete and add) are happening There is one slave replicating from the master. There is only search request hitting the slave. I can see the size of the downloaded data inc

HttpDataSource consume REST API with Authentication required

2010-03-04 Thread javaxmlsoapdev
I have to use http://wiki.apache.org/solr/DataImportHandler#Usage_with_XML.2BAC8-HTTP_Datasource HttpDataSource to ask Solr consume my REST service and index data returned from that service. My application/service has authentication/authorization. When Solr invokes this service it MUST have vali

Index an entire Phrase and not it's constituent parts?

2010-03-04 Thread Christopher Ball
How can I Index an entire Phrases and not it's constituent parts? I want to index collations as a single term in the index, and not as the multiple terms that comprise the phrase, for example, I want to index: "as much as" but not the independent parts: "as", "much", "as". Any guidance appr

Count Sum of Term Occurrences?

2010-03-04 Thread Christopher Ball
How can I count the total number of a specific terms occurrences? How can you get the total number of occurrences of a term across all documents (e.g. Sum of the number of occurrences of a specific term in each doc)? For example, I have 3 documents, document #1 has "The green bird is flyin

Re: HttpDataSource consume REST API with Authentication required

2010-03-04 Thread Simon Rosenthal
http://issues.apache.org/jira/browse/SOLR-1490 has a patch which will do what you want -Simon On Thu, Mar 4, 2010 at 2:21 PM, javaxmlsoapdev wrote: > > I have to use > > http://wiki.apache.org/solr/DataImportHandler#Usage_with_XML.2BAC8-HTTP_Datasource > HttpDataSource to ask Solr consume my

Re: weighted search and index

2010-03-04 Thread Erick Erickson
Huh? On Thu, Mar 4, 2010 at 1:13 PM, Chris Hostetter wrote: > > : Subject: weighted search and index > : In-reply-to: <4b8f061b.3080...@gmail.com> > > http://people.apache.org/~hossman/#threadhijack > Thread Hijacking on Mailing Lists > > When starting a new discussion on a mailing list, please d

Re: Index an entire Phrase and not it's constituent parts?

2010-03-04 Thread Erick Erickson
Try KeywordTokenizerFactory. This page is very useful: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters HTH Erick On Thu, Mar 4, 2010 at 2:31 PM, Christopher Ball < christopher.b...@metaheuristica.com> wrote: > How can I Ind

highlight multi-valued field returns weird cut-off highlighted terms

2010-03-04 Thread uwdanny
Basically, the problem is that query keyword "pizza" occurs in 4 out of 10 values in multi-valued field "TEST_KEYWORDS"; in the first two matching/highlighting, only "pizz" is highlighted with an extra space beforehand ( pizz), and "a" is left behind; and in the other two highlighting, "pizza" is

RE: logging

2010-03-04 Thread Chris Hostetter
: Hi, thanks. I looked at these sites, and also the info about "java : logging": : http://java.sun.com/j2se/1.5.0/docs/guide/logging/overview.html : : But I couldn't really follow the info about configuration for logging. typically you create a logging.properties file and put it in your classp

Re: Any way to recover a corrupt index from a "live" IndexReader?

2010-03-04 Thread Chris Hostetter
: Subject: Any way to recover a corrupt index from a "live" IndexReader? There are wasy in code to read from an IndexReader to populate a new index (ie: using the MergeIndex class) but i don't know of anyway to do that from a running JVM. I suppose in theory there is probably a way to write a

Re: Sorting by a function that depends on the current result set

2010-03-04 Thread Chris Hostetter
: When sorting by (an integer) price field I need prices under 1 : standard deviation from the mean of the current result set to be : pushed to the end of the list. I'm not 100% certain, but i *think* that with the new sort by function work granted added to the trunk a little while back that thi

Re: Boost a document score via query using MoreLikeThisHandler

2010-03-04 Thread Chris Hostetter
: Ok, I think I need to do this with BoostQParserPlugin and nested : queries, but I can't quite figure it out. you can't actually nest parsers like this ... but you can use the "$foo" sintax to seperate them out into alternate params... /solr/select/?yourChoice={!query}name:solr&q={!boost+b=$yo

Re: Issue on stopword list

2010-03-04 Thread Chris Hostetter
: I tried commongrams also but won't worked . here search "this is it" .i : would like to get exact information not for this is,is or it. : : my document like : : 101 You haven't given us enough info about what *else* you wnat to work ... ie: you've only talked about stop words, and not wantin

Re: Getting total term count

2010-03-04 Thread Chris Hostetter
: I want a want a way to get total term count per document. I am using This isn't a stat that Lucene (or solr) tracks per document ... you could probably add this as a custom plugin, but it would be somewhat tricky to implement (you'd need to take advantage of the TeeTOkenFilter and SinkTokeni

Re: highlight multi-valued field returns weird cut-off highlighted terms

2010-03-04 Thread Koji Sekiguchi
uwdanny wrote: Basically, the problem is that query keyword "pizza" occurs in 4 out of 10 values in multi-valued field "TEST_KEYWORDS"; in the first two matching/highlighting, only "pizz" is highlighted with an extra space beforehand ( pizz), and "a" is left behind; and in the other two highlight

Re: query execution date/time

2010-03-04 Thread Chris Hostetter
: I have written a "SearchComponent" which I use to write information : about queries to a log. Is it possible in a SearchComponent to get the : date (and time) the query was executed? Since the trivially obvious answer is to put "Date now = new Date()" in your SearchComponent, I feel like the

Re: Count Sum of Term Occurrences?

2010-03-04 Thread Ahmet Arslan
> How can I count the total number of a > specific terms occurrences? > > > > How can you get the total number of occurrences of a term > across all > documents (e.g. Sum of the number of occurrences of a > specific term in each > doc)? > > > > For example, I have 3 documents, document #

Re: Escaping options for tika/solr cell extract-only output

2010-03-04 Thread Chris Hostetter
: You can return it with any of the other writers, like JSON or PHP. The key being that the output from Tika is content -- that content just so happens to be a string containing xml -- which is then formated by a response writer. (although given that ExtractingRequestHandler has an extract onl

Re: highlight multi-valued field returns weird cut-off highlighted terms

2010-03-04 Thread uwdanny
in this "error" case, the origin query "q=pizza"

Re: If you could have one feature in Solr...

2010-03-04 Thread Chris Hostetter
: The ability to read solr configuration files from the classpath instead of : solr.solr.home directory. Solr has always supported this. When SolrResourceLoader.openResourceLoader is asked to open a resource it first checks if it's an absolute path -- if it's not then it checks relative the "

Re: Can I used .XML files instead of .OSM files

2010-03-04 Thread Lance Norskog
Is the 'body' field a text type? If it is a string, searching for words will not work. Does search for 'id:1' work? On Thu, Mar 4, 2010 at 3:44 AM, mamathahl wrote: > > I forgot to mention that I have been working on geo-saptial examples > downloaded from http://www.ibm.com/developerworks/java/l

Re: If you could have one feature in Solr...

2010-03-04 Thread Mark Miller
On 03/04/2010 05:56 PM, Chris Hostetter wrote: : The ability to read solr configuration files from the classpath instead of : solr.solr.home directory. Solr has always supported this. When SolrResourceLoader.openResourceLoader is asked to open a resource it first checks if it's an absolute path

Re: some scores to 0 using omitNorns=false

2010-03-04 Thread Chris Hostetter
: >> We did some tests with omitNorms=false. We have seen that in the last : >> result's page we have some scores set to 0.0. This scores setted to 0 are : >> problematic to our sorters. : >> : >> It could be some kind of bug? It could be, but it isn't neccessarily. "0.0" is a perfectly legal s

facet on null value

2010-03-04 Thread Andy
There's a field "A" I want to facet on. Some documents have no value for field "A". So they wouldn't show up in the list of facet value options. I want to find a way to let users to find those documents. One way is to make Null an option the users can choose, something like: value1 (4558) valu

Re: Faceted search in 2 indexes

2010-03-04 Thread Chris Hostetter
: I have 2 indexes with some similar fields and some distinct fields. I need : to make a faceted search that returns the union of the same search in these : 2 indexes. : : How can I make it? Assuming you don't need facets or sorting on the fields that are distinct between the two indexes (which

Re: Get list of deleted documents on commit

2010-03-04 Thread Chris Hostetter
: What I'd like to do is send a notification to another system when a document : have been deleted from the index. : : I first thought of doing it from the processDelete method of an : UpdateRequestProcessor, but because of the possibility of a rollback, it is : not a good idea. why not? ... y

Re: Solr with Tika - Text ordering garbled.

2010-03-04 Thread Chris Hostetter
: Our problem is that in the results returned from a search the words in the : 'Text' field are not returned in the same order as those in the original OCR : content in the PDF. This means that the snippet does not accurately reflect : the original document content. You're probably going to want

Re: Returning function result in results

2010-03-04 Thread Chris Hostetter
: Is there way to return function value in search results besides using : score ?=20 Not at the moment, but there are various patches looking into how to make something like this feasible... https://issues.apache.org/jira/browse/SOLR-1298 https://issues.apache.org/jira/browse/SOLR-1566 https://

Re: Solr query parsing

2010-03-04 Thread Chris Hostetter
: Why would fq=sdate:+20100110 parse via a Solr server but not via : QueryParsing.parseQuery? Its choking on the + symbol in the sdate : value. is it something as simple as the fact that with a Solr Server the "+" is a URL escaped " " ? hard to say w/o knowing qhatr exactly it is you are doing

Re: facet on null value

2010-03-04 Thread Lance Norskog
This query will find them: *:* AND -A:[* TO *] The '*:* AND' is to get around a weird quirk of Lucene. "Minus field range star TO star" is the trick. On Thu, Mar 4, 2010 at 3:06 PM, Andy wrote: > There's a field "A" I want to facet on. > > Some documents have no value for field "A". So they woul

Re: facet on null value

2010-03-04 Thread Andy
Are you saying if I append that query to the facet query then the repsonse from Solr will include facet count for null? Or that it is a separate query to get the count of null? --- On Thu, 3/4/10, Lance Norskog wrote: From: Lance Norskog Subject: Re: facet on null value To: solr-user@lucene.a

Re: facet on null value

2010-03-04 Thread Koji Sekiguchi
Andy wrote: There's a field "A" I want to facet on. Some documents have no value for field "A". So they wouldn't show up in the list of facet value options. I want to find a way to let users to find those documents. One way is to make Null an option the users can choose, something like: valu

Re: facet on null value

2010-03-04 Thread Andy
Yes. Thank you. --- On Thu, 3/4/10, Koji Sekiguchi wrote: From: Koji Sekiguchi Subject: Re: facet on null value To: solr-user@lucene.apache.org Date: Thursday, March 4, 2010, 7:21 PM Andy wrote: > There's a field "A" I want to facet on. > > Some documents have no value for field "A". So they w

merge indexes command

2010-03-04 Thread Mark Fletcher
Hi, Can someone pls suggest how to use this command as a part of linux script: * http://localhost:8983/solr/admin/cores?action=mergeindexes&core=core0&indexDir=/opt/solr/core1/data/index&indexDir=/opt/solr/core2/data/index * Will just adding *curl* at the beginning help.. I tried this but it gi

Re: facet on null value

2010-03-04 Thread Lance Norskog
Ah! I did not know this one. On Thu, Mar 4, 2010 at 5:01 PM, Andy wrote: > Yes. Thank you. > > --- On Thu, 3/4/10, Koji Sekiguchi wrote: > > From: Koji Sekiguchi > Subject: Re: facet on null value > To: solr-user@lucene.apache.org > Date: Thursday, March 4, 2010, 7:21 PM > > Andy wrote: >> The

Re: merge indexes command

2010-03-04 Thread Lance Norskog
Add quotes around the URL string: curl 'http://localhost:8983/solr/admin/cores?action=mergeindexes&core=core0&indexDir=/opt/solr/core1/data/index&indexDir=/opt/solr/core2/data/index' On Thu, Mar 4, 2010 at 5:24 PM, Mark Fletcher wrote: > Hi, > > Can someone pls suggest how to use this command a

Re: facet on null value

2010-03-04 Thread Chris Hostetter
: > I want to find a way to let users to find those documents. One way is to : > make Null an option the users can choose, something like: : Isn't it facet.missing=on? : http://wiki.apache.org/solr/SimpleFacetParameters#facet.missing that will get you the count, but if you then want to let

Clustering Search taking 4sec for 100 results

2010-03-04 Thread Allahbaksh Asadullah
Hi, I am using Solr for clustering. I am have set number of row as 100 and I am using clustering handler. The problem is that I am getting the search time for clustering search roughly 4sec. I have set -Xmx1024m. What is the best way to reduce the time. Regards, allahbaksh

Re: Documents disappearing

2010-03-04 Thread Chris Hostetter
: We index using 4 processes that read from a queue of documents. Each process : send one document at a time to the /update handler. Hmmm.. then you should have a message from the LogUpdateProcessorFactory for every individual "add" command that was recieved ... did you crunch those to see if

Re: Clustering Search taking 4sec for 100 results

2010-03-04 Thread Erick Erickson
Search time is only partially dependent on the number of results returned. Far more important is the number of docs in the index, the complexity of the query, any sorting you do, etc. So your question isn't really very answerable, you need to provide many more details. Things like your index size,

Re: facet on null value

2010-03-04 Thread Andy
What would the response look like with this query? Can you give an example? --- On Thu, 3/4/10, Chris Hostetter wrote: From: Chris Hostetter Subject: Re: facet on null value To: solr-user@lucene.apache.org Date: Thursday, March 4, 2010, 8:40 PM : > I want to find a way to let users to find t

Re: facet on null value

2010-03-04 Thread Lance Norskog
Set up the out-of-the-box example Solr. Index the documents in example/exampledocs. Run this query: http://localhost:8983/solr/select/?q=*:*&fq=-features:[* TO *]&version=2.2&start=0&rows=10&indent=on&facet=true&facet.field=features&facet.missing=on Now, change facet.missing=on to =off. There is

Re: facet on null value

2010-03-04 Thread Lance Norskog
I have added facet.limit=5 to the above to make this easier. Here is the part of the response: - - - 0 0 0 0 0 2 (What is the 2?) On Thu, Mar 4, 2010 at 7:30 PM, Lance Norskog wrote: > Set up the out-of-the-box example Solr. Index the documents in > example/examp

RE: SOLR Index or database

2010-03-04 Thread Dallan Quass
FWIW, I just implemented a system that stores the index in SOLR but the records in a partitioned set of MySQL databases. The only stored field in SOLR is an ID field, which is the key to a table in the MySQL database. I had to modify SOLR a tiny bit and write a "database" search component so that

Re: facet on null value

2010-03-04 Thread Andy
My understanding is that 2 means there are 2 documents missing a facet value. But how does adding fq=-fieldName:[* TO *] enable users to click on that value to filter? There was no value, only the count (2) was returned. --- On Thu, 3/4/10, Lance Norskog wrote: From: Lance Norskog Subject:

dynamic categorization & transactional data

2010-03-04 Thread caman
Hello all, Please see below.any help much appreciated. 1) Extracting data out of a text field to assign a category for certain configured words. e.g. If the text is "Google does it again with Android" and If 'Google' and 'Android' are the configured words, I want to b able to assign the article