SolrCloud and Optimize

2012-09-11 Thread Nikhil Chhaochharia
Hi, I am using a recent nightly of Solr 4 and have setup a simple SolrCloud cluster of 2 shards without any replicas.  If I send the 'optimize' command, then it is executed on the shards one-by-one instead of in parallel. Is this by design?How can I run optimize in parallel on all the shards?

AW: Getting more proper results

2012-09-11 Thread Ramo Karahasan
Hi, i've set it to AND, restarted tomcat, but in my search i get the same results. So it seems that this don't have an effect. Any ideas? Ramo -Ursprüngliche Nachricht- Von: Ahmet Arslan [mailto:iori...@yahoo.com] Gesendet: Mittwoch, 12. September 2012 00:34 An: solr-user@lucene.apache

Re: Semantic document format... standards?

2012-09-11 Thread Paul Libbrecht
Otis, if you have a bit of time to research, I think your document may look a lot like the documents processed by: http://langtech.jrc.it/ which is a flagship "multilingual technology" implementation and includes a fair amount of entity disambiguation as far as I could hear in Ralph's ta

Re: Solr Sorting & Caching

2012-09-11 Thread Amey Patil
>Are you using a TrieDateField for the dates? Yes >Consider creating and re-using a filter for the keywords and let the >query consist of the date range only. In this case, do I have to configure any cache or solr's default configurations are enough? >Guessing here: You request all the results fr

Re: In-memory indexing

2012-09-11 Thread Amit Nithian
I have wondered about this too but instead why not just set your cache sizes large enough to house most/all of your documents and pre-warm the caches accordingly? My bet is that a large enough document cache may suffice but that's just a guess. - Amit On Mon, Sep 10, 2012 at 10:56 AM, Kiran Jayak

Re: Solr - Lucene Debuging help

2012-09-11 Thread Amit Nithian
The wiki should probably be updated.. maybe I'll take a stab at it. I'll also try and update my article referenced there too. When you checkout the project from SVN, do "ant eclipse" Look at this bug (https://issues.apache.org/jira/browse/SOLR-3817) and either run the ruby program or download the

Re: solr.StrField with stored="true" useless or bad?

2012-09-11 Thread Amit Nithian
This is great thanks for this post! I was curious about the same thing and was wondering why "fl" couldn't return the "indexed" representation of a field if that field were only indexed but not stored. My thoughts were return something than nothing but I didn't pay attention to the fact that gettin

Solr 4.0-BETA facet pivot returns no result

2012-09-11 Thread andy
I use the Solr 4.0-BETA version, my request url is http://localhost:8983/solr/collection1/select?q=*%3A*&rows=0&wt=xml&facet.pivot=cat,popularity,inStock&facet.pivot=popularity,cat&facet=true&facet.field=cat&facet.pivot.mincount=0 but I do not get any facet pivot info in the result true *:* cat

RE: Facet Sort by Index, missing indexes

2012-09-11 Thread Dewi Wahyuni
Hi Chris, Thanks for that tip, I checked and it is correctly because of the constraint limit. Thanks again Dewi -Original Message- From: Chris Hostetter [mailto:hossman_luc...@fucit.org] Sent: Wednesday, 12 September 2012 1:57 AM To: solr-user@lucene.apache.org Subject: Re: Facet Sor

RE: Term searches with colon(:)

2012-09-11 Thread Nemani, Raj
Sorry for not being clear. Yes, I am trying to do a wildcard search for terms that contain a colon in the text (ie: "foo:bar") in the filed list mentioned in the default requesthandler that I posted earlier. Description is one of those fields. Mpg is another field. I have not included the entir

SolrCloud fail over

2012-09-11 Thread andy
I know fail over is available in solr4.0 right now, if one server crashes,other servers also support query,I set up a solr cloud like this http://lucene.472066.n3.nabble.com/file/n4007117/Selection_028.png I use http://localhost:8983/solr/collection1/select?q=*%3A*&wt=xml for query at first, if t

Partial search

2012-09-11 Thread Mani
I have three documents with the following search field (text_en type) values. When I search for "Energy Field", I am getting the document in this order presented. However if you look at the match, I would expect the Doc3 should come first and Doc1 should be the last. Doc1 : Automic Energy and

Re: Solr search not working after copying a new field to an existing Indexed Field

2012-09-11 Thread Mani
Eric, "When you add a doc with the same unique key as an old doc, the data associated with the first version of the doc is entirely thrown away and its as though you'd never indexed it at all", I did exactly the same. The old doc and new doc there is not a change except the Name has changed. Whe

Re: solr.StrField with stored="true" useless or bad?

2012-09-11 Thread Ahmet Arslan
> The purpose of stored="true" is to store the raw string data > besides the analyzed/transformed data for displaying > purposes. This is fine for an analyzed solr.TextField, but > for an StrField both values are the same. So is there any > reason to apply stored="true" on a StrField as well? If y

Running Luke On Solr Index (getting lock error)

2012-09-11 Thread BadalChhatbar
Hi All, I am trying to run luke on my solr search index ( I have contributed 3-4 xml files only). when I try to open indexes in luke, I am getting write.lock error on index and its not showing me index. I did check "Force Unlock" option, but it didn't help either, I also tried opening index unde

Re: solr.StrField with stored="true" useless or bad?

2012-09-11 Thread Yonik Seeley
On Tue, Sep 11, 2012 at 7:03 PM, wrote: > The purpose of stored="true" is to store the raw string data besides the > analyzed/transformed data for displaying purposes. This is fine for an > analyzed solr.TextField, but for an StrField both values are the same. So is > there any reason to apply

solr.StrField with stored="true" useless or bad?

2012-09-11 Thread sysrq
Hi, I have a StrField to store an URL. The field definition looks like this: Type "string" is defined as usual: Then I realized that a StrField doesn't execute any analyzers and stored data verbatim. The data is just a single token. The purpose of stored="true" is to store the raw string dat

Re: Semantic document format... standards?

2012-09-11 Thread Jack Krupansky
My standard question for such a situation: How are you expecting your users to query this data? Are they expecting simple English/natural language text, or are they expecting structured "identifiers" that can be keys into other data sources. For example, are your "entities" simple text literal

Re: multiple filter queries and boolean operators in SolrJ

2012-09-11 Thread rajarshi.g...@gmail.com
Perfect! Many thanks Sent from my HTC One™ X, an AT&T 4G LTE smartphone - Reply message - From: "Ahmet Arslan" To: , Subject: multiple filter queries and boolean operators in SolrJ Date: Tue, Sep 11, 2012 6:36 PM --- On Wed, 9/12/12, Rajarshi Guha wrote: > From: Rajarshi Guha >

Re: multiple filter queries and boolean operators in SolrJ

2012-09-11 Thread Chris Hostetter
fq's are always intersections, if you want to union multiple queries you have to specify them in a single fq -- that's not a SolrJ/URL thing, that's just a low level detail of how solr caches & intersects filters. from SolrJ you just have to do a single addFilterQuery() call containing your uni

Re: multiple filter queries and boolean operators in SolrJ

2012-09-11 Thread Ahmet Arslan
--- On Wed, 9/12/12, Rajarshi Guha wrote: > From: Rajarshi Guha > Subject: multiple filter queries and boolean operators in SolrJ > To: solr-user@lucene.apache.org > Date: Wednesday, September 12, 2012, 12:58 AM > Hi, I am accessing our Solr > installation via SolrJ. Currently, we are > suppor

Re: Getting more proper results

2012-09-11 Thread Ahmet Arslan
> I'm using it that way, because I want to have an > autocompletion list. Now > I'm wondering if I can influence some of the results I'm > getting. I have a > lot of categories in my database. If I for example search > for "iphone 3" I > would expect to get all iphone 3 from the category > electron

Re: What's the rules about contributing to Solr WIKI?

2012-09-11 Thread Chris Hostetter
: Subject: What's the rules about contributing to Solr WIKI? https://wiki.apache.org/solr/#How_to_edit_this_Wiki >> This Wiki is a collaborative site, anyone can contribute and share: >> >> Create an account by clicking the "Login" link at the top of any >> page, and picking a username and pass

What's the rules about contributing to Solr WIKI?

2012-09-11 Thread Alexandre Rafalovitch
I just figured out how to run custom solr core with basic jetty under Windows service with Apache procrun. Not quite a production setup and most probably not perfect, but it might save somebody several hours next time. I want to contribute that back to Solr WIKI for next person. Do I just wade in

multiple filter queries and boolean operators in SolrJ

2012-09-11 Thread Rajarshi Guha
Hi, I am accessing our Solr installation via SolrJ. Currently, we are supporting filter queries via the addFilterQuery() method of SolrQuery. However as far as I can see, the resultant documents that come out of the query are the intersection of all the filters. Ideally, what I'd like to happen is

Re: Semantic document format... standards?

2012-09-11 Thread Paul Libbrecht
As Michael hinted, I believe RDF would be the de-factor answer. Within it, things such as OWL or SKOS certainly represent classical formats. Processors such as OWLAPI can go pretty far there. As Péter hinted, schema.org might provide a way to complement an existing XML with semantic information.

Getting more proper results

2012-09-11 Thread Ramo Karahasan
Hi, i'm using solr 3.5 with the following configuration: I'm using it that way, because I want to have an autocompletion list. Now I'm wondering if I can influence some of the results I'm getting. I have a l

Re: PatternTokenizerFactory not working to split comma separated value

2012-09-11 Thread Jack Krupansky
Is it possible that you may have indexed with an earlier pattern, changed the pattern, and then tried to query? If so, you need to fully re-index to see the change take effect. I don't know of anything in solrconfig that should affect PatternTokenizerFactory. -- Jack Krupansky -Original

Re: Semantic document format... standards?

2012-09-11 Thread Péter Király
Hi, I guess the most common format today is using the schema.org's ontologies. It provides a couple of definitions, and it is supported by big players, such as Google, Yahoo, Microsoft. See http://schema.org/. Hope it helps, Péter wrote: > > Hello, > > > > If I'm extracting named entities, top

Re: PatternTokenizerFactory not working to split comma separated value

2012-09-11 Thread Suneel Pandey
Hi Jack, This is happening on only lucid cloud server not splitting comma separated value. but on solr server this fix is working perfectly. Is any configuration changes in solrconfig.xml which can enable and disable PatternTokenizerFactory? - Regards, Suneel Pandey Sr. Software Developer

Re: PatternTokenizerFactory not working to split comma separated value

2012-09-11 Thread Jack Krupansky
I tried your field type in Solr 4.0-BETA and it works fine for input such as: cat,dog|fox,bat|frog What do you see when you use the Solr Admin Analysis web page for that text for your field type? I would note that your pattern does not permit spaces as delimiters or after delimiters, so if

Re: SolrCloud vs SolrReplication

2012-09-11 Thread thaihai
Thanks for the answer Erick. thaihai -- View this message in context: http://lucene.472066.n3.nabble.com/SolrCloud-vs-SolrReplication-tp4006327p4007019.html Sent from the Solr - User mailing list archive at Nabble.com.

RE: Term searches with colon(:)

2012-09-11 Thread Chris Hostetter
: Thank you for the reply and help. The description field is part of the : defaultHandler's eDisMax search list (qf): ... : Similar queries for other escaped characters in "description" using term : search return correctly as shown from the logs correctly. Ok ... but you haven't really a

Re: Solr request/response lifecycle and logging full response time

2012-09-11 Thread Chris Hostetter
: I'd still love to see a query lifecycle flowchart, but, in case it : helps any future users or in case this is still incorrect, here's how : I'm tackling this: part of the problem is deciding what you mean by "lifecycle" - as far as the SolrCore is concerned, the lifecycle of hte request is it

PatternTokenizerFactory not working to split comma separated value

2012-09-11 Thread Suneel
Hello , I am using following field type for comma separated value but it is not working. Please suggested what i did wrong. - Regards, Suneel Pandey Sr. Software Developer -- View this message in context: http://lucene.472066.n3.nabble.com/PatternTokenizer

Re: Use field as bool flag for another, not indexed, field

2012-09-11 Thread Erick Erickson
You've outlined the possibilities pretty well. I don't think you want a custom analyzer though, consider a custom UpdateHandler and overriding the addDoc command. You can freely manipulate the document at this point, adding or removing fields etc. So see if the incoming doc has your original field

Re: Semantic document format... standards?

2012-09-11 Thread Michael Della Bitta
I'm probably a little unclear about the breadth of what you want to do, but I would recommend DC at the extremely lightweight end, and TEI at the very heavyweight end. Perhaps you could come up with a mashup of DC and your own fields in RDF as well. Michael Della Bitta ---

Re: Facet Sort by Index, missing indexes

2012-09-11 Thread Chris Hostetter
: I did the query twice, once with the sorting and once without the sort: : : 1. Without f.a.facet.sort=index : I have all l1, l2, l3 in count order : (all l1, l2, and l3 facets have counts on them) : : 2. With f.a.facet.sort=index : The facet is sorted accordingly : "l1:..","l2:.." but "l3"

Re: RES: RES: RES: RES: Problem with accented words sorting

2012-09-11 Thread Toke Eskildsen
On Tue, 2012-09-11 at 14:21 +0200, Claudio Ranieri wrote: > Ok Toke, Is it worth opening a ticket in jira to implement the > collator-key + original in facet? I think it would be best to discuss it on the developer mailing list first. I have send a mail there: "Collator-based facet sorting in Solr

Solr on https

2012-09-11 Thread Cool Techi
Hi, We are trying to run solr on https, these are few of the issues or problems that are coming up. Just wanted to understand if anyone else is facing these problems, we have some shards running on https, but in shards parameter in solr we don't specify the protocol, how can we achieve thisWil

Re: [Solr4 beta] error 503 on commit

2012-09-11 Thread Antoine LE FLOC'H
Hoss, After investigating more, here is the tomcat log herebelow. It is indeed the same problem: "exceeded limit of maxWarmingSearchers=2,". It is an indexing box and the comment says that we could rise this number to 4 or something. I can do that but I have four questions though: - is it someth

Re: Running Solr Unit Test in Eclipse

2012-09-11 Thread Jack Krupansky
Generally, source folders come in pairs - "java" for the actual code source, and "test" for the unit tests. So, make sure that "/test" appears in the source folder name. And unit test file names either begin or end with "Test". If you right click on a unit test and select "Run As", you should

Re: RES: RES: RES: Problem with accented words sorting

2012-09-11 Thread Ahmet Arslan
> This is an interesting feature to be implemented, because we > can sort the results correctly, but not in the facets. > The facets also does not bring the total count for > pagination. > I'm using the facets to get the distinct values ​​of a > field. I wish to sort and pagination them. Distinct

Re: Solr 3.6.1 Source Code

2012-09-11 Thread Jack Krupansky
The "branch" will be the source for the next release (3.6.2), if there is one. To get the exact source for a release, go to "tags" rather than "branches". Use: http://svn.apache.org/viewvc/lucene/dev/tags/lucene_solr_3_6_1/ -- Jack Krupansky -Original Message- From: mechravi25 Sent:

Re: Solr 3.6.1 Source Code

2012-09-11 Thread Thomas Matthijs
On Tue, Sep 11, 2012 at 2:43 PM, mechravi25 wrote: > Hi, > > I would like to know the base lined version of Solr 3.6.1 Source code for > svn Check out. We tried to check out from the following link and found many > base lined versions related to Solr 3.6.x version. > > https://svn.apache.org/repos

Solr 3.6.1 Source Code

2012-09-11 Thread mechravi25
Hi, I would like to know the base lined version of Solr 3.6.1 Source code for svn Check out. We tried to check out from the following link and found many base lined versions related to Solr 3.6.x version. https://svn.apache.org/repos/asf/lucene/dev/branches/ Can anyone tell me the exact svn chec

RES: RES: RES: RES: Problem with accented words sorting

2012-09-11 Thread Claudio Ranieri
Ok Toke, Is it worth opening a ticket in jira to implement the collator-key + original in facet? -Mensagem original- De: Toke Eskildsen [mailto:t...@statsbiblioteket.dk] Enviada em: terça-feira, 11 de setembro de 2012 08:46 Para: solr-user@lucene.apache.org Assunto: Re: RES: RES: RES: Pr

Re: RES: RES: RES: Problem with accented words sorting

2012-09-11 Thread Toke Eskildsen
On Tue, 2012-09-11 at 12:14 +0200, Claudio Ranieri wrote: > This is an interesting feature to be implemented, because we can sort > the results correctly, but not in the facets. At work (State and University Library, Denmark) we use collator-ordered faceting for author & title, but out current imp

Return only matched multiValued field

2012-09-11 Thread Dotan Cohen
Assuming a multivalued, stored and indexed field with name "comment". When performing a search, I would like to return only the values of "comment" which contain the match. For example: When searching for "gold" instead of getting this result: Theres a lady whos sure all that

Re: Re: Get parent when the child is a search hit

2012-09-11 Thread Stein Gran
Hi, Thanks for all the suggestions :-) Seems like denormalization is the way to go to do this without losing scalability and speed. BlockJoins seems so solve another requirement I have, and that is for the parent-child relationship between for instance an email and email attachments. This relati

RES: RES: RES: Problem with accented words sorting

2012-09-11 Thread Claudio Ranieri
Ok Toke. Thanks for your explanation. This is an interesting feature to be implemented, because we can sort the results correctly, but not in the facets. The facets also does not bring the total count for pagination. I'm using the facets to get the distinct values ​​of a field. I wish to sort and

Solr backup replication -> restore from snapshot

2012-09-11 Thread roySolr
Hello, I have some question about restoring from a snapshot backup. I have a master and do the following command: http://solr.test.uk:/solr/replication?command=backup It created a directory in my data directory: snapshot.20120911224532 When i want to use this backup on master i replace the

Re: use of filter queries in Lucene/Solr Alpha40 and Beta4.0

2012-09-11 Thread guenter.hip...@unibas.ch
feedback to patch: I used build #85 (Revision: 1382192) to test the same use case (build up an initial index of 18 Mio and run updates with around 200.000 documents) result: the use of fq to drill down facets is now consistent! (available under http://sb-tp1.swissbib.unibas.ch) Thanks for pr

Re: RES: RES: Problem with accented words sorting

2012-09-11 Thread Toke Eskildsen
On Mon, 2012-09-10 at 16:04 +0200, Claudio Ranieri wrote: > When I used the CollationKeyFilterFactory in my facet (example below), > the value of facet went wrong. When I remove the > CollationKeyFilterFactory of type of facet, the value went correct. As Ahmed wrote, CollationKeyFilter is meant fo

Re: Solr Sorting & Caching

2012-09-11 Thread Toke Eskildsen
On Tue, 2012-09-11 at 08:00 +0200, Amey Patil wrote: > Our solr index (Solr 3.4) has over 100 million docuemnts. [...] > *((keyword1 AND keyword2...) OR (keyword3 AND keyword4...) OR ...) AND > date:[date1 TO *]* > No. of keywords can be in the range of 100 - 1000. > We are adding sort parameter *'