Fwd: Add a plugin class to solr

2010-07-28 Thread Sanal K Stephen
Hi all, I want to add a plugin class to solr which can filter the results based on certain criteria.I have an array which has the solr document unique key as the index. and the value which will be one or zero.if it is zero I want to filter it from the result set.This filtering should happe

Re: SolrJ Response + JSON

2010-07-28 Thread Ranveer
Rajani is right you can get response by passing wt=json. But I think if you want to use solrj then you will require to parse binary format data in json format or you can use third party json parser. regards Ranveer http://www.onlymyhealth.com On Thursday 29 July 2010 09:55 AM, rajini maski wro

Re: logic required for newbie

2010-07-28 Thread Jonty Rhods
Again thanks for reply.. Actually I am getting result. But I am getting all column of the rows. I want to remove unnecessary column. In case of q=piza hut.. then I want to get only piza hut. Same if search query change to "ford motor" then want only ford motor. more example if query is "piza hut f

Re: SolrJ Response + JSON

2010-07-28 Thread rajini maski
Yeah right... This query will do it http://localhost:8090/solr/select/?q=*:*&version=2.2&start=0&rows=10&indent=on&wt=json This will do your work... This is more liike using xsl transformation supported by solr..:) Regards, Rajani Maski On Wed, Jul 28, 2010 at 6:24 PM, Mark Allan wrote:

Re: logic required for newbie

2010-07-28 Thread rajini maski
First of all I hope that in schema you have mentioned for fields indexed=true and stored=true... Next if you have done so... and now just search as q=landmark:piza... you will get one result set only.. Note : There is one constraint about applying analyzers and tokenizers... IF you apply white spa

Is solr able to merge index on different nodes

2010-07-28 Thread Chengyang
Once I want to create a large index, can I split the index on different nodes and the merge all the indexs to one node. Any further suggestion for this case?

Help with schema design

2010-07-28 Thread Pramod Goyal
Hi, I have a use case where i get a document and a list of events that has happened on the document. For example First document: Some text content Events: Event TypeEvent By Event Time Update Pramod 06062010 2:30:00 Update Raj 06062010 2:30:00

Re: Scoring Search for autocomplete

2010-07-28 Thread Chris Hostetter
You weren't really clear on how you are generating your autocomplete results -- ie: via TermsComponent on your "main" index? or via a search on a custom index where each document is a "word" to suggested? Assuming the later, then the approach you describe below sounds good to me, but it doesn'

Re: WordDelimiterFilter and phrase queries?

2010-07-28 Thread Chris Hostetter
: pos token offset : 1 3 0-1 : 2 diphenyl 2-10 : 3 propanoic 11-20 : 3 diphenylpropanoic 2-20 : Say someone enters the query string 3-diphenylpropanoic : : The query parser I'm using transforms this into a phrase query and the : indexed form is missed because based the positions of the terms '3'

Re: Solr using 1500 threads - is that normal?

2010-07-28 Thread Erick Erickson
Your commits are very suspect. How often are you making changes to your index? Do you have autocommit on? Do you commit when updating each document? Committing too often and consequently firing off warmup queries is the first place I'd look. But I agree with dc tech, 1,500 is wy more than I wou

Re: Solr using 1500 threads - is that normal?

2010-07-28 Thread dc tech
1,500 threads seems extreme by any standards so there is something happening in your install. Even with appservers for web apps, typically 100 would be a fair # of threads. On 7/28/10, Christos Constantinou wrote: > Hi, > > Solr seems to be crashing after a JVM exception that new threads cannot

Re: simple question from a newbie

2010-07-28 Thread Erick Erickson
What is the query you submit (don't forget &debugQuery=on"? In particular, what field are you sorting on? But yes, if you're searching on a tokenized field, you'll get matches on all tokens in that field. Which are probably single words. And no matter how you sort, you're still getting documents w

Re: Show elevated Result Differently

2010-07-28 Thread Erick Erickson
Please expand on what this means, it's quite vague. You might review: http://wiki.apache.org/solr/UsingMailingLists Best Erick On Wed, Jul 28, 2010 at 8:43 AM, Vishal.Arora wrote: > > I want to show elevated Result Different from others is there any way to do > this > -- > View this message in

Know which terms are in a document

2010-07-28 Thread Max Lynch
I would like to be search against my index, and then *know* which of a set of given terms were found in each document. For example, let's say I want to show articles with the word "pizza" or "cake" in them, but would like to be able to say which of those two was found. I might use this to handle

Re: Using Solr to perform range queries in Dspace

2010-07-28 Thread Chris Hostetter
: I'm trying to use dspace to search across a range of index created and stored : using Dsindexer.java class. I have seen where Solr can be use to perform I've never headr of Dsindexer.java but since this is hte first result google returns... http://scm.dspace.org/trac/dspace/browser/trunk/dspa

Re: SolrCore has a large number of SolrIndexSearchers retained in "infoRegistry"

2010-07-28 Thread skommuri
Hi, It didn't seem like it improved the situation. The same exception stack traces are found. I have explicitly defined the index readers to be reopened by specifying in the solrconfig.xml The exception occurs when the remote cores are being searched. I am attaching the exceptions in a text fi

Problem with field collapsing

2010-07-28 Thread Moazzam Khan
Hi All, Whenever I use field collapse, the "numFound" attribute contains exactly as many rows as I put in rows parameter instead of returning total number of documents that matched the query. Is there a way to rectify this? Thanks, Moazzam

RE: How to 'filter' facet results

2010-07-28 Thread Nagelberg, Kallin
ManBearPig is still a threat. -Kallin Nagelberg -Original Message- From: Jonathan Rochkind [mailto:rochk...@jhu.edu] Sent: Tuesday, July 27, 2010 7:44 PM To: solr-user@lucene.apache.org Subject: RE: How to 'filter' facet results > Is there a way to tell Solr to only return a specific se

Re: Total number of terms in an index?

2010-07-28 Thread Jonathan Rochkind
At first I was thinking the TermsComponent might give you this, but oddly it seems not to. http://wiki.apache.org/solr/TermsComponent

RE: Extracting PDF text/comment/callout/typewriter boxes with Solr CELL/Tika/PDFBox

2010-07-28 Thread David Thibault
Tommasso, I used your patch and tried it with the 1.4.1 solr.war from a fresh 1.4.1 distribution, and it still gave me that NoSuchMethodError. However, when I tried it with the newly-patched-and-compiled apache-solr-1.4.2-dev.war file it works. I think I tried that before and it didn't work.

Re: Total number of terms in an index?

2010-07-28 Thread Jason Rutherglen
Tom, The total number of terms... Ah well, not a big deal, however yes the flex branch does expose this so we can show this in Solr at some point, hopefully outside of Solr's Luke impl. On Tue, Jul 27, 2010 at 9:27 AM, Burton-West, Tom wrote: > Hi Jason, > > Are you looking for the total number

RE: display solr result in JSP

2010-07-28 Thread Ma, Xiaohui (NIH/NLM/LHC) [C]
Thanks so much for your reply. I don't have much experience at JSP. I found tag library, and am trying to use " ". Unfortunately I didn't get it work. Would you please give me more information? I really appreciate your help! Thanks, Xiaohui -Original Message- From: Ranveer [mailto:r

How do "NOT" queries work?

2010-07-28 Thread Kaan Meralan
I wonder how do "NOT" queries work. Is it a pass on the result set and filtering out the "NOT" property or something like that? Also is there anybody who does some performance checks on "NOT" queries? I want to know whether there is a significant performance degradation or not when you have "NOT"

Re: slave index is bigger than master index

2010-07-28 Thread Muneeb Ali
>> In solrconfig.xml, these two lines control that. Maybe they need to be increased. >> 5000 >> 1 Where do I add those in solrconfig? These lines doesn't seem to be present in the example solrconfig file... -- View this message in context: http://lucene.472066.n3.nabble.com/slave-i

Re: slave index is bigger than master index

2010-07-28 Thread Muneeb Ali
Well I do have disk limitations too, and thats why I think slave nodes died, when replicating data from master node. (as it was just adding on top of existing index files). :: What do you mean here? Optimizing is too CPU expensive? What I meant by avoid playing around with slave nodes is that d

Solr 1.4.1 field collapse

2010-07-28 Thread Moazzam Khan
Hi guys, I read somewhere that Solr 1.4.1 has field collapse support by default (without patching it) but I haven't been able to confirm it. Is this true? - Moazzam

RE: simple question from a newbie

2010-07-28 Thread Nguyen, Vincent (CDC/OSELS/NCPHI) (CTR)
I think I got it to work. If I do a wildcard search using the dc3.title field it seems to work fine (dc3.title:c*). The dc.title:c* returns every title that has a word in it that starts with 'c', which isn't exactly what I wanted. I'm guessing it's because of the type="caseInsensitiveSort". W

Re: SolrJ Response + JSON

2010-07-28 Thread MitchK
Hi Chantal, thank you for the feedback. I did not see the wood for the trees! The SolrDocument's javadoc says the following: http://lucene.apache.org/solr/api/org/apache/solr/common/SolrDocument.html |*getFieldValue <../../../../org/apache/solr/common/SolrDocument.html#getFieldValue%28java.la

Re: Is there a cache for a query?

2010-07-28 Thread Moazzam Khan
As far as I know all searches get cache at least for some time. I am not sure about field collapse results being cached. - Moazzam http://moazzam-khan.com On Mon, Jul 26, 2010 at 9:48 PM, Li Li wrote: > I want a cache to cache all result of a query(all steps including > collapse, highlight and

Re: Spellchecking and frequency

2010-07-28 Thread Jonathan Rochkind
I therefore wrote an implementation of SolrSpellChecker that wraps jazzy, the java aspell library. I also extended the SpellCheckComponent to take the matrix of suggested words and query the corpus to find the first combination of suggestions which returned a match. This works well for my use ca

RE: Indexing Problem: Where's my data?

2010-07-28 Thread Michael Griffiths
Thanks - but my schema.xml is not recognizing field names specified in the data-config.xml. For example - and I just tested this now - if I have in my data-config.xml: And then in my schema.xml: Then no documents are processed (e.g. I get rows queried, but 0 in the data handler UI). But if

Re: simple question from a newbie

2010-07-28 Thread Ranveer
I think you using wild-card search or should use wild-card search. but first of all please provide the schema and configuration file for more details. regards Ranveer On Wednesday 28 July 2010 07:51 PM, Nguyen, Vincent (CDC/OSELS/NCPHI) (CTR) wrote: Hi, I'm new to Solr and have a rather

Re: display solr result in JSP

2010-07-28 Thread Ranveer
Hi, very simple to display value in jsp. if you are using solrj then simply store value in bean from java class and can display. same thing you can do in servlet too.. get the solr server response and return in bean or can display directly(in servlet). hope you will able to do. regards Ranvee

Re: logic required for newbie

2010-07-28 Thread Jonty Rhods
Hi thanks for reply.. Actually requirement is diffrent (sorry if I am unable to clerify in first mail). basically follwoing are the fields name in schema as well: > 1. id > 2. name > 3. user_id > 4. location > 5. country > 6. landmark1 > 7. landmark2 > 8. landmark3 > 9. landmark4 > 10. landmark5

Re: SolrJ Response + JSON

2010-07-28 Thread Chantal Ackermann
Hi Mitch On Wed, 2010-07-28 at 16:38 +0200, MitchK wrote: > Thank you, Chantal. > > I have looked at this one: http://www.json.org/java/index.html > > This seems to be an easy-to-understand-implementation. > > However, I am wondering how to determine whether a SolrDocument's field > is multiVa

display solr result in JSP

2010-07-28 Thread Ma, Xiaohui (NIH/NLM/LHC) [C]
I am new for solr. Just got example xml file index and search by following solr tutorial. I wonder how I can get the search result display in a JSP. I really appreciate any suggestions you can give. Thanks so much, Xiaohui

Re: SolrJ Response + JSON

2010-07-28 Thread MitchK
Thank you, Chantal. I have looked at this one: http://www.json.org/java/index.html This seems to be an easy-to-understand-implementation. However, I am wondering how to determine whether a SolrDocument's field is multiValued or not. The JSONResponseWriter of Solr looks at the schema-configurat

simple question from a newbie

2010-07-28 Thread Nguyen, Vincent (CDC/OSELS/NCPHI) (CTR)
Hi, I'm new to Solr and have a rather dumb question. I want to do a query that returns all the Titles that start with a certain letter. For example I have these titles: Results of in-mine research in support Cancer Reports State injury indicators report Cancer Reports Indexed dermal

Re: Extracting PDF text/comment/callout/typewriter boxes with Solr CELL/Tika/PDFBox

2010-07-28 Thread Tommaso Teofili
This was my same feeling :-) and so I went for the trunk to have things working quickly, but I also have to consider which one is the best version since I am going to deploy it in the near future in an enterprise environment and choosing the best version is an importat step. I am quite new to Solr

Re: logic required for newbie

2010-07-28 Thread rajini maski
you can index each of these field separately... field1-> Id field2-> name field3->user_id field4->country. field7-> landmark While quering you can specify "q=Landmark9" This will return you results.. And if you want only particular fields in output.. use the "fl" parameter in query...

RE: Extracting PDF text/comment/callout/typewriter boxes with Solr CELL/Tika/PDFBox

2010-07-28 Thread David Thibault
Thanks, I'll try that then. I kind of figured that'd be the answer, but after fighting with Solr & ExtractingRequestHandler for 2 days I also just wanted to be done with it once it started working with 4.0...=) However, stability would be better in the long run. Best, Dave -Original Messa

RE: Solr 3.1 and ExtractingRequestHandler resulting in blank content

2010-07-28 Thread David Thibault
If you don't store the content then you can't do highlighting, right? Also, don't you just have to switch the text field to say stored="true" in your schema to store the text? I don't understand why you're differentiating the behavior of ExtractingRequestHandler from the behavior of Solr in ge

Re: SolrJ Response + JSON

2010-07-28 Thread Chantal Ackermann
You could use org.apache.solr.handler.JsonLoader. That one uses org.apache.noggit.JSONParser internally. I've used the JacksonParser with Spring. http://json.org/ lists parsers for different programming languages. Cheers, Chantal On Wed, 2010-07-28 at 15:08 +0200, MitchK wrote: > Hello , > > S

Re: Extracting PDF text/comment/callout/typewriter boxes with Solr CELL/Tika/PDFBox

2010-07-28 Thread Alessandro Benedetti
In my opinion, the 1.4.1 version with the Patch is more Stable. Until 4.0 will be released 2010/7/28 David Thibault > Yesterday I did get this working with version 4.0 from trunk. I haven't > fully tested it yet, but the content doesn't come through blank anymore, so > that's good. Would

RE: Extracting PDF text/comment/callout/typewriter boxes with Solr CELL/Tika/PDFBox

2010-07-28 Thread David Thibault
Yesterday I did get this working with version 4.0 from trunk. I haven't fully tested it yet, but the content doesn't come through blank anymore, so that's good. Would it be more stable to stick with 1.4.1 and your patch to get to Tika 0.8, or to stick with the 4.0 trunk version? Best, Dave -

Re: SolrJ Response + JSON

2010-07-28 Thread MitchK
Thank you Markus, Mark. Seems to be a problem with Nabble, not with the mailing list. Sorry. I can create a JSON-response, when I query Solr directly. But I mean, that I query Solr through a SolrJ-client (CommonsHttpSolrServer). That means my queries look a litte bit like that: http://wiki.apa

Re: SolrJ Response + JSON

2010-07-28 Thread Markus Jelsma
Hi, I got a response to your e-mail in my box 30 minutes ago. Anyway, enable the JSONResponseWriter, if you haven't already, and query with wt=json. Can't get mucht easier. Cheers, On Wednesday 28 July 2010 15:08:26 MitchK wrote: > Hello , > > Second try to send a mail to the mailing list...

Re: SolrJ Response + JSON

2010-07-28 Thread Mark Allan
On 28 Jul 2010, at 2:08 pm, MitchK wrote: Second try to send a mail to the mailing list... Your first attempt got through as well. Here's my original response. I think you should just be able to add &wt=json to the end of your query (or change whatever the existing wt parameter is in your

SolrJ Response + JSON

2010-07-28 Thread MitchK
Hello , Second try to send a mail to the mailing list... I need to translate SolrJ's response into JSON-response. I can not query Solr directly, because I need to do some math with the responsed data, before I show the results to the client. Any experiences how to translate SolrJ's response i

Re: SolrJ Response + JSON

2010-07-28 Thread Mark Allan
I think you should just be able to add &wt=json to the end of your query (or change whatever the existing wt parameter is in your URL). Mark On 28 Jul 2010, at 12:54 pm, MitchK wrote: Hello community, I need to transform SolrJ - responses into JSON, after some computing on those results

Show elevated Result Differently

2010-07-28 Thread Vishal.Arora
I want to show elevated Result Different from others is there any way to do this -- View this message in context: http://lucene.472066.n3.nabble.com/Show-elevated-Result-Differently-tp1002081p1002081.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: clustering component

2010-07-28 Thread Stanislaw Osinski
> The patch should also work with trunk, but I haven't verified it yet. > I've just added a patch against solr trunk to https://issues.apache.org/jira/browse/SOLR-1804. S.

Highlighted match snippets highlight non-matched words (such as 0.1 and 0.2)

2010-07-28 Thread Jon Cram
Hi, I'm observing some strange highlighted words in field value snippets returned from Solr when matched term highlighting (http://wiki.apache.org/solr/HighlightingParameters) is enabled. In some cases, highlighted field value snippets contain highlighted words that are not matches: -

Get unique values

2010-07-28 Thread Rafal Bluszcz Zawadzki
Hi, In my schema I have (inter ali) fields CollectionID, and CollectionName. These two values always match together, which means that for every value of CollectionID there is matching value from CollectionName. I am interested in query which allow me to get unique values of CollectionID with mat

SolrJ Response + JSON

2010-07-28 Thread MitchK
Hello community, I need to transform SolrJ - responses into JSON, after some computing on those results by another application has finished. I can not do those computations on the Solr - side. So, I really have to translate SolrJ's output into JSON. Any experiences how to do so without writing

Re: Strange search

2010-07-28 Thread stockii
try to delete "solr.SnowballPorterFilterFactory" from your analyzerchain. i had similar problems by using german SnowballPorterFilterFactory -- View this message in context: http://lucene.472066.n3.nabble.com/Strange-search-tp998961p1001990.html Sent from the Solr - User mailing list archive at

Solr using 1500 threads - is that normal?

2010-07-28 Thread Christos Constantinou
Hi, Solr seems to be crashing after a JVM exception that new threads cannot be created. I am writing in hope of advice from someone that has experienced this before. The exception that is causing the problem is: Exception in thread "btpool0-5" java.lang.OutOfMemoryError: unable to create new n

Re: DIH : SQL query (sub-entity) is executed although variable is not set (null or empty list)

2010-07-28 Thread Chantal Ackermann
Hi Lance! On Wed, 2010-07-28 at 02:31 +0200, Lance Norskog wrote: > Should this go into the trunk, or does it only solve problems unique > to your use case? The solution is generic but is an extension of XPathEntityProcessor because I didn't want to touch the solr.war. This way I can deploy the e

Re: Indexing Problem: Where's my data?

2010-07-28 Thread Chantal Ackermann
make sure to set stored="true" on every field you expect to be returned in your results for later display. Chantal

Re: Spellchecking and frequency

2010-07-28 Thread dan sutton
Hi Mark, Thanks for that info looks very interesting, would be great to see your code. Out of interest did you use the dictionary and the phonetic file? Did you see better results with both? In regards to the secondary part to check the corpus for matching suggestions, would another way to do thi

solr log file rotation

2010-07-28 Thread Christos Constantinou
Hi all, I am running a Solr 1.4 instance on FreeBSD that generates large log files in very short periods. I used /etc/newsyslog to configure log file rotation, however once the log file is rotated then Solr doesn't write logs to the new file. I'm wondering if there is a way to let Solr know tha

Re: Integration Problem

2010-07-28 Thread Jörg Wißmeier
Nobody out there who can help me with this problem? I need to edit the result of the javabin writer (adding the results from the webservice). I hope it is possible to do that. thanks in advance. Am Mo 26.07.2010 10:25 schrieb Jörg Wißmeier : >Hi everybody, > >since a while i'm working with solr

Re: SpatialSearch: sorting by distance

2010-07-28 Thread Pavel Minchenkov
Does anybody know if this feature works correctly? Or I'm doing something wrong? 2010/7/27 Pavel Minchenkov > Hi, > > I'm trying to sort by distance like this: > > sort=dist(2,lat,lon,55.755786,37.617633) asc > > In general results are sorted, but some documents are not in right order. > I'm usi

Re: Extracting PDF text/comment/callout/typewriter boxes with Solr CELL/Tika/PDFBox

2010-07-28 Thread Tommaso Teofili
I attached a patch for Solr 1.4.1 release on https://issues.apache.org/jira/browse/SOLR-1902 that made things work for me. This strange behaviour for me was due to the fact that I copied the patched jars and war inside the dist directory but forgot to update the war inside the example/webapps direc

Re: Any tips/guidelines to turning the Solr/luence performance in a master/slave/sharding environment

2010-07-28 Thread Tommaso Teofili
Hi, I think the starting point should be : http://wiki.apache.org/solr/SolrPerformanceFactors For example you could start playing with the mergeFactor parameter. My 2 cents, Tommaso 2010/7/27 Chengyang > How to reduce the index files size, decreate the sync time between each > nodes. decrease th

Re: question about relevance

2010-07-28 Thread Bharat Jain
Well you are correct Erik that this is a database-ish thing try to achieve in solr and unfortunately the sin :) had been committed by somebody else :) and now we are running into relevancy issues. Let me try to state the problem more casually. 1. There are user records of type A, B, C etc. (userI