Re: 2 solr dataImport requests on a single core at the same time

2010-07-22 Thread kishan
Hi Tq very much its solved my problem , having multiple Request Handlers will not degrade the performance ... unless we are sending parallel requests? am i right ? Thansk, Prasad -- View this message in context: http://lucene.472066.n3.nabble.com/2-solr-dataImport-requests-on-a-single-core-

RE: Tree Faceting in Solr 1.4

2010-07-22 Thread Jonathan Rochkind
Solr does not, yet, at least not simply, as far as I know, but there are ideas and some JIRA's with maybe some patches: http://wiki.apache.org/solr/HierarchicalFaceting From: rajini maski [rajinima...@gmail.com] Sent: Friday, July 23, 2010 12:34 AM To: s

Re: Tree Faceting in Solr 1.4

2010-07-22 Thread rajini maski
I am also looking out for same feature in Solr and very keen to know whether it supports this feature of tree faceting... Or we are forced to index in tree faceting formatlike 1/2/3/4 1/2/3 1/2 1 In-case of multilevel faceting it will give only 2 level tree facet is what i found.. If i give

Re: Novice seeking help to change filters to search without diacritics

2010-07-22 Thread HSingh
Hoss, thank you for your helpful response! : i think what's confusing you is that you are using the : MappingCharFilterFactory with that file in your "text" field type to : convert any ISOLatin1Accent characters to their "base" characters The problem is that a large range of characters are not g

Re: filter query on timestamp slowing query???

2010-07-22 Thread Chris Hostetter
: You are correct, first of all i haven't move yet to the TrieDateField, but i : am still waiting to find out a bit more information about it, and there's : not a lot of info, other then in the xml file. In general TrieFields are a way of trading disk space for range query speed. they are expla

Re: DIH stalling, how to debug?

2010-07-22 Thread Tommy Chheng
Ok, it was a runaway SQL query which isn't using an index. @tommychheng Programmer and UC Irvine Graduate Student Find a great grad school based on research interests: http://gradschoolnow.com On 7/22/10 4:26 PM, Tommy Chheng wrote: Hi, When I run my DIH script, it says it's "busy" but the "

DIH stalling, how to debug?

2010-07-22 Thread Tommy Chheng
Hi, When I run my DIH script, it says it's "busy" but the "Total Requests made to DataSource" and "Total Rows Fetched" remain unchanged at 4 and 6. It hasn't reported a failure. How can I debug what is blocking the DIH? -- @tommychheng Programmer and UC Irvine Graduate Student Find a great

Re: Duplicates

2010-07-22 Thread Erick Erickson
If the field is a single token, just define the uniqueKey on it in your schema. Otherwise, this may be of interest: http://wiki.apache.org/solr/Deduplication Haven't used it myself though... best Erick On Thu, Jul 22, 2010 at 6:14 PM, Pavel Minchenkov wrote: > Hi, > > Is it possible to remove

Re: Clustering results limit?

2010-07-22 Thread Darren Govoni
This seems to work from SolrJ now: ModifiableSolrParams params = new ModifiableSolrParams(); params.set("qt", "/clustering"); params.set("q", query); params.set("carrot.title", "filename_s"); params.set("rows","100"); params.se

Re: Clustering results limit?

2010-07-22 Thread Darren Govoni
Yeah, my results count is 151 and only 21 documents appear in 6 clusters. This is true whether I use URL or SolrJ. When I use carrot workbench and point to my Solr using local clustering, the workbench has numerous clusters and all documents are placed On Thu, 2010-07-22 at 18:06 +0200, Stan

Re: boosting particular field values

2010-07-22 Thread Chris Hostetter
I blieve this cam up on IRC, and the end result wsa that the bq was working fine, Justin just wasn't noticing because he added it to his solrconfig.xml (and not to the query URL) and his browser was still caching the page -- so he didn't see his boost affect anything) (but i may be confusing j

Re: Finding distinct unique IDs in documents returned by fq -- Urgent Help Req

2010-07-22 Thread Chris Hostetter
: > being returned (consider the case where we are sorting in term order - once : > we have collected counts for ${facet.limit} constraints, we can stop : > iterating over terms -- but to compute the total umber of constraints (ie: : > terms) we would have to keep going and test every one of them a

Duplicates

2010-07-22 Thread Pavel Minchenkov
Hi, Is it possible to remove duplicates in search results by a given field? Thanks. -- Pavel Minchenkov

Re: calling other core from request handler

2010-07-22 Thread Chris Hostetter
: It looks I can : call coreA.getCoreDescriptor().getCoreContainer().getCore("coreB"); and then get : the Searcher and release it when I am done. : : Is there a better way? not really ... not unless you want to do it via HTTP to "localhost" : And it also appears that during the inform or init

WordDelimiterFilter and phrase queries?

2010-07-22 Thread Drew Farris
Hi All, A question about the WordDelimiterFilter and position increments / phrase queries: I have a string like: 3-diphenyl-propanoic When indexed gets it is broken up into the following tokens: pos token offset 1 3 0-1 2 diphenyl 2-10 3 propanoic 11-20 3 diphenylpropanoic 2-20 The WordDelimit

Re: Finding distinct unique IDs in documents returned by fq -- Urgent Help Req

2010-07-22 Thread Jonathan Rochkind
Chris Hostetter wrote: computing the number: in some algorithms it's relatively cheap (on a single server) but in others it's more expensive then computing the facet counts being returned (consider the case where we are sorting in term order - once we have collected counts for ${facet.limit} c

Re: Providing token variants at index time

2010-07-22 Thread Jonathan Rochkind
Paul Dlug wrote: On Thu, Jul 22, 2010 at 4:01 PM, Jonathan Rochkind wrote: The synonym approach won't work as I need to provide them in a file. The variants may be more dynamic and not known in advance, the process creating the documents to index does have that logic and could easily put th

Re: Novice seeking help to change filters to search without diacritics

2010-07-22 Thread Chris Hostetter
: I am new to Solr and seeking your help to change filter from : ISOLatin1AccentFilterFactory to ASCIIFoldingFilterFactory files. I am not According to the files you posted, you aren't using the ISOLatin1AccentFilterFactory -- so problem solved w/o making any changes. : sure what change is to

commit is taking very very long time

2010-07-22 Thread bbarani
Hi, I am not sure why some commits take very long time. I have a batch indexing which commits just once after it completes the indexing. I tried to index just 36 rows but the total time taken to index was like 12 minutes. The indexing time was very less just some 30 seconds but it took the remai

Re: about warm up

2010-07-22 Thread Chris Hostetter
: I want to load full text into an external cache, So I added so codes : in newSearcher where I found the warm up takes place. I add my codes ... : public void newSearcher(SolrIndexSearcher newSearcher, : SolrIndexSearcher currentSearcher) { : warmTextCache(newSearcher,warmTextCache,

Re: How to get the list of all available fields in a (sharded) index

2010-07-22 Thread Chris Hostetter
: I cannot find any info on how to get the list of current fields in an index : (possibly sharded). With dynamic fields, I cannot simply parse the schema to there isn't one -- the LukeRequestHandler can tell you what fields *actually* exist in your index, but you'd have to query it on each shard

Re: stats on a field with no values

2010-07-22 Thread Chris Hostetter
: : When I use the stats component on a field that has no values in the result set : (ie, stats.missing == rowCount), I'd expect that 'min'and 'max' would be : blank. : : Instead, they seem to be the smallest and largest float values or something, : min = 1.7976931348623157E308, max = 4.9E-324 .

RE: Finding distinct unique IDs in documents returned by fq -- Urgent Help Req

2010-07-22 Thread Chris Hostetter
: > I would like get the total count of the facet.field response values : : I'm pretty sure there's no way to get Solr to do that -- other than not : setting a facet.limit, getting every value back in the response, and : counting them yourself (not feasible for very large counts). I've : look

Re: Providing token variants at index time

2010-07-22 Thread Paul Dlug
On Thu, Jul 22, 2010 at 4:01 PM, Jonathan Rochkind wrote: > I think the Synonym filter should actually do exactly what you want, no? > http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.SynonymFilterFactory > > Hmm, maybe not exactly what you want as you describe it. It comes close,

Re: a bug of solr distributed search

2010-07-22 Thread Chris Hostetter
: As the comments suggest, it's not a bug, but just the best we can do : for now since our priority queues don't support removal of arbitrary FYI: I updated the DistributedSearch wiki to be more clear about this -- it previously didn't make it explicitly clear that docIds were suppose to be uni

Re: Providing token variants at index time

2010-07-22 Thread Jonathan Rochkind
I think the Synonym filter should actually do exactly what you want, no? http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.SynonymFilterFactory Hmm, maybe not exactly what you want as you describe it. It comes close, maybe good enough. Do you REALLY need to support "I Business M

calling other core from request handler

2010-07-22 Thread Kevin Osborn
I have a multi-core environment and a custom request handler. However, I have one place where I would like to have my request handler on coreA query to coreB. This is not distributed search. This is just an independent query to get some additional data. I am also guaranteed that each server wi

Providing token variants at index time

2010-07-22 Thread Paul Dlug
Is there a tokenizer that supports providing variants of the tokens at index time? I'm looking for something that could take a syntax like: International|I Business|B Machines|M Which would take each pipe delimited token and preserve its position so that phrase queries work properly. The above wo

Re: Solr on iPad?

2010-07-22 Thread mbklein
Hi Stephan, On a lark, I hacked up solr running under a small-footprint servlet engine on my jailbroken iPad. You can see the console here: http://imgur.com/tHRh3 It's not a particularly practical solution, though, since Apple would never approve a Java-based app for the App Store. Or a backgrou

Re: a bug of solr distributed search

2010-07-22 Thread Yonik Seeley
As the comments suggest, it's not a bug, but just the best we can do for now since our priority queues don't support removal of arbitrary elements. I guess we could rebuild the current priority queue if we detect a duplicate, but that will have an obvious performance impact. Any other suggestions?

Re: Solr on iPad?

2010-07-22 Thread Andreas Jung
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Stephan Schwab wrote: > Dear Solr community, > > does anyone know whether it may be possible or has already been done to > bring Solr to the Apple iPad so that applications may use a local search > engine? huh? Solr requires Java. iPad does not supp

Solr on iPad?

2010-07-22 Thread Stephan Schwab
Dear Solr community, does anyone know whether it may be possible or has already been done to bring Solr to the Apple iPad so that applications may use a local search engine? Greetings, Stephan -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-on-iPad-tp987655p987655.htm

Re: Tree Faceting in Solr 1.4

2010-07-22 Thread Eric Grobler
Thank you for the link. I was not aware of the multifaceting syntax - this will enable me to run 1 less query on the main page! However this is not a tree faceting feature. Thanks Eric On Thu, Jul 22, 2010 at 4:51 PM, SR wrote: > Perhaps the following article can help: > http://www.craftyf

Delta import processing duration

2010-07-22 Thread Qwerky
I'm using Solr to index data from our data warehouse. The data is imported through text files. I've written a custom FileImportDataImportHandler that extends DataSource and it works fine - I've tested it with 280,000 records and it manages to build the index in about 3 minutes. My problem is that

Re: Clustering results limit?

2010-07-22 Thread Stanislaw Osinski
Hi, In my SolrJ, I used ModifiableSolrParams and I set ("rows",50) but it > still returns less than 10 for each cluster. > Oh, the number of documents per cluster very much depends on the characteristics of your documents, it often happens that the algorithms create larger numbers of smaller clus

Re: Tree Faceting in Solr 1.4

2010-07-22 Thread SR
Perhaps the following article can help: http://www.craftyfella.com/2010/01/faceting-and-multifaceting-syntax-in.html -S On Jul 22, 2010, at 5:39 PM, Eric Grobler wrote: > Hi Solr Community > > If I have: > COUNTRY CITY > Germany Berlin > Germany Hamburg > Spain Madrid > > Can I do faceting

Tree Faceting in Solr 1.4

2010-07-22 Thread Eric Grobler
Hi Solr Community If I have: COUNTRY CITY Germany Berlin Germany Hamburg Spain Madrid Can I do faceting like: Germany Berlin Hamburg Spain Madrid I tried to apply SOLR-792 to the current trunk but it does not seem to be compatible. Maybe there is a similar feature existing in the latest

Re: solrconfig.xml and xinclude

2010-07-22 Thread Tommaso Teofili
Just an update to say that the only way I figured out to include my 2 tags was via the element() scheme : * http://www.w3.org/2001/XInclude";> * obviously this is not desirable and clean at all even if it can make the trick if the number of fields is very small. Any ot

Re: faceted search with job title

2010-07-22 Thread Ken Krugler
Hi Savannah, A few comments below, scattered in-line... -- Ken On Jul 21, 2010, at 3:08pm, Savannah Beckett wrote: And I will have to recompile the dom or sax code each time I add a job board for crawling. Regex patten is only a string which can be stored in a text file or db, and retrie

Getting FileNotFoundException with repl command=backup?

2010-07-22 Thread Peter Sturge
Informational Hi, This information is for anyone who might be running into problems when performing explicit periodic backups of Solr indexes. I encountered this problem, and hopefully this might be useful to others. A related Jira issue is: SOLR-1475. The issue is: When you execute a 'command=b

Using Solr to perform range queries in Dspace

2010-07-22 Thread Mckeane
I'm trying to use dspace to search across a range of index created and stored using Dsindexer.java class. I have seen where Solr can be use to perform numerical range queries using either TrieIntField, TrieDoubleField,TrieLongField, etc.. classes defined in Solr's api or SortableIntField.java, So

Re: Dismax query response field number

2010-07-22 Thread scrapy
I'm using Solr 1.4.1 -Original Message- From: Justin Lolofie To: solr-user@lucene.apache.org Sent: Thu, Jul 22, 2010 2:57 pm Subject: Re: Dismax query response field number scrapy what version of solr are you using? I'd like to do "fq=city:Paris" but it doesnt seem to work

Re: Dismax query response field number

2010-07-22 Thread Justin Lolofie
scrapy what version of solr are you using? I'd like to do "fq=city:Paris" but it doesnt seem to work for me (solr 1.4) and the docs seem to suggest its a feature that is coming but not there yet? Or maybe I misunderstood? On Thu, Jul 22, 2010 at 6:00 AM, wrote: > >  Thanks, > > That was the pr

Re: Clustering results limit?

2010-07-22 Thread Darren Govoni
I set the rows=50 on my clustering URL in a browser and it returns more. In my SolrJ, I used ModifiableSolrParams and I set ("rows",50) but it still returns less than 10 for each cluster. Is there a way to set rows wanted with ModifiableSolrParams? thanks and sorry for the double post. Darren

Re: Clustering results limit?

2010-07-22 Thread Darren Govoni
Staszek, Thank you. The cluster response has a maximum of 10 documents in each cluster. I didn't set this limit and the query by itself returns 500+ documents. There should be many more than 10 in each cluster. Does it default to 10 maybe? Or is there a way to say, cluster every result in the que

Re: Dismax query response field number

2010-07-22 Thread scrapy
Thanks, That was the problem! select?q=moto&qt=dismax& fq =city:Paris -Original Message- From: Chantal Ackermann To: solr-user@lucene.apache.org Sent: Thu, Jul 22, 2010 12:47 pm Subject: Re: Dismax query response field number is this a typo in your query or in your

Re: Dismax query response field number

2010-07-22 Thread Chantal Ackermann
is this a typo in your query or in your e-mail? you have the "q" parameter twice. use "fq" for query inputs that mention a field explicitly when using dismax. So it should be: select?q=moto&qt=dismax& fq =city:Paris (the whitespace is only for visualization) chantal On Thu, 2010-07-22 at 11:

Re: solrconfig.xml and xinclude

2010-07-22 Thread Tommaso Teofili
Hi, I am trying to do a similar thing within the schema.xml (using Solr 1.4.1), having a (super)schema that is common to 2 instances and specific fields I would like to include (with XInclude). Something like this: * ... ... * and it works with the sp

Re: Dismax query response field number

2010-07-22 Thread Peter Karich
maybe its too simple, but did you try the rows=20 or sth. greater as Lance suggested? => select?rows=20&qt=dismax Regards, Peter. > Yes i've data... maybe my query is wrong? > > select?q=moto&qt=dismax&q=city:Paris > > Field city is not showing? > > > > > > > > > -Original Message

Re: Securing Solr 1.4 in a glassfish container AS NEW THREAD

2010-07-22 Thread Bilgin Ibryam
Are you using the same instance of CommonsHttpSolrServer for all the requests? On Wed, Jul 21, 2010 at 4:50 PM, Sharp, Jonathan wrote: > > Some further information -- > > I tried indexing a batch of PDFs with the client and Solr CELL, setting > the credentials in the httpclient. For some reason

Re: Dismax query response field number

2010-07-22 Thread scrapy
Yes i've data... maybe my query is wrong? select?q=moto&qt=dismax&q=city:Paris Field city is not showing? -Original Message- From: Grijesh.singh To: solr-user@lucene.apache.org Sent: Thu, Jul 22, 2010 10:07 am Subject: Re: Dismax query response field number Do u have data

Re: 2 solr dataImport requests on a single core at the same time

2010-07-22 Thread Alexey Serba
DataImportHandler does not support parallel execution of several requests. You should either send your requests sequentially or register several DIH handlers in solrconfig and use them in parallel. On Thu, Jul 22, 2010 at 11:20 AM, kishan wrote: > > please help me > -- > View this message in con

Re: Clustering results limit?

2010-07-22 Thread Stanislaw Osinski
Hi, I am attempting to cluster a query. It kinda works, but where my > (regular) query returns 500 results the cluster only shows 1-10 hits for > each cluster (5 clusters). Never more than 10 docs and I know its not > right. What could be happening here? It should be showing dozens of > documents

Re: Dismax query response field number

2010-07-22 Thread Grijesh.singh
Do u have data in that field also,Solr returns field which have data only. -- View this message in context: http://lucene.472066.n3.nabble.com/Dismax-query-response-field-number-tp985567p986417.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Dismax query response field number

2010-07-22 Thread scrapy
No, i'm talking about fields. In my schema i've got about 15 fields with: stored="true" Like this: But when i run a query it return me only 10 fields, the last 4 or 5 are not the the response?? -Original Message- From: Lance Norskog To: solr-user@lucene.apache.org Sent:

Re: 2 solr dataImport requests on a single core at the same time

2010-07-22 Thread kishan
please help me -- View this message in context: http://lucene.472066.n3.nabble.com/2-solr-dataImport-requests-on-a-single-core-at-the-same-time-tp978649p986351.html Sent from the Solr - User mailing list archive at Nabble.com.