Re: Solr WIKI seems really dead this time

2014-12-10 Thread Alexandre Rafalovitch
Thanks. I guess we'll see tomorrow. I just find it funny that the status pages says: "Everything is supercalifragilisticexpialidocious!". One has to wonder what would cause the change to that message :-) I - second - guess that their monitoring tools are not pointing at the Wiki, only at the Conf

Re: Solr WIKI seems really dead this time

2014-12-10 Thread Steve Rowe
@infrabot tweeted 4 hours ago "Currently experiencing some issues with our OSUOSL colo. Current cause is unknown, and no ETA.” 45 minutes ago David Nalley (VP Infra) emailed infra@a.o: That network issue took down our entire presence in Oregon for almost 2 hours. The colo staff has gone

Solr WIKI seems really dead this time

2014-12-10 Thread Alexandre Rafalovitch
Solr WIKI seems to die periodically. But usually it is back up in 30 minutes or so. This time it (the whole http://wiki.apache.org/ ) seems to be down for hours. Anybody knows what's up/down with that? Regards, Alex. I can't find any mention on Status or JIRA about it, so not sure whether Per

Re: Solr 4.10.2 "Found core" but I get "No cores available" in dashboard page

2014-12-10 Thread Chris Hostetter
: We are trying out solr 4.10.2 (as an upgrade from 4.0) and are seeing an odd : issue. ... : Note that we are testing under Windows 7, and that the sample solr in 4.10.2 : runs fine (with the same folder structure, etc tho with the default schema : and solrconfig.xml). I have run thru t

Re: SOLR shards stay down forever

2014-12-10 Thread Shalin Shekhar Mangar
If you send explicit commits to the cluster then SOLR-6530 can cause shards to be put into down state during network partitions. If you rely only on configured autocommits then you won't be affected by this bug. This is fixed in 4.10.2 On Wed, Dec 10, 2014 at 5:02 AM, Norgorn wrote: > The proble

Has anyone used Automatic Phrase Tokenization (AutoPhrasingTokenFilterFactory) ?

2014-12-10 Thread Shamik Bandopadhyay
Hi, I'm trying to use AutoPhrasingTokenFilterFactory which seems to be a great solution to our phrase query issues. But doesn't seem to work as mentioned in the blog : https://lucidworks.com/blog/automatic-phrase-tokenization-improving-lucene-search-precision-by-more-precise-linguistic-analysis

Re: How I raise the maxUpdateConnections under Solr 4.6.1

2014-12-10 Thread Shawn Heisey
On 12/10/2014 8:51 AM, yriveiro wrote: How I can raise this two variables: maxUpdateConnections, maxUpdateConnectionsPerHost in Solr 4.6.1 with the old solr.xml style? One way to handle this is the shardHandlerFactory setting in solrconfig.xml. I think you'd need it on the /update handler and

Re: Details on why ConccurentUpdateSolrServer is reccommended for maximum index performance

2014-12-10 Thread Erick Erickson
The process if you don't use CUSS is this: 1> assemble the packet of docs 2> send it to Solr 3> wait until Solr is done indexing it 4> start assembling the second doc. So, several things are going on here. 1> the client is sitting idle while Solr is indexing and 2> Solr is sitting idle when t

Re: Solr Error when making GeoPrefixTree polygon filter search

2014-12-10 Thread Guido Medina
Hi Mathaix, David could be right, we have geo-polygons and stuff and what we do is that we put the JTS jar inside solr.war (and other jars using our own automated scripts to add extra jars inside it) Here is the link of latest maven JTS dependency where you can download it directly: http:/

Re: Solr Error when making GeoPrefixTree polygon filter search

2014-12-10 Thread david.w.smi...@gmail.com
Mathaix, I bet you don’t have JTS on your classpath. See the spatial page in the Solr ref guide. ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Wed, Dec 10, 2014 at 4:50 PM, mathaix wrote: > So I downloaded and attached the sp

Re: Solr Error when making GeoPrefixTree polygon filter search

2014-12-10 Thread mathaix
So I downloaded and attached the spatial4j source. (where it is breaking). Notice that there is no POLYGON type in the WktShapeParser file. (See attached source). https://github.com/spatial4j/spatial4j/blob/master/src/main/java/com/spatial4j/core/io/WktShapeParser.java -- View this message i

Suggester: weight (term frequency) and 'mm' feasibility

2014-12-10 Thread Boon Low
Hi, Solr suggester is wonderful. We have been testing the built-in dictionary implementations for some large-ish datasets (36m, 132m), and getting single/teen milli-seconds response times with 9 multiple dictionaries per request. Most of the resulting dictionaries have millions entries too. In

Re: CloudSolrServer, concurrency and too many connections

2014-12-10 Thread Greg Solovyev
This was a user error. My code was re-instantiating CloudSolrServer for each request and never calling CloudSolrServer::shutdown(). Thanks, Greg - Original Message - From: "Greg Solovyev" To: solr-user@lucene.apache.org Sent: Wednesday, December 10, 2014 11:08:10 AM Subject: Re: CloudS

Solr Error when making GeoPrefixTree polygon filter search

2014-12-10 Thread mathaix
I am try to make a solr query using the following filter. location_rpt:"IsWithin( POLYGON((-122.42048263549805 37.79762790889688, -122.42048263549805 37.787860934698, -122.44726181030273 37.787860934698, -122.44726181030273 37.79762790889688, -122.42048263549805 37.79762790889688)) ) distErrPct=0"

Re: Question regarding Solr 4.7 solr joining across multiple cores and sorting

2014-12-10 Thread Mikhail Khludnev
https://issues.apache.org/jira/browse/SOLR-6234 {!scorejoin} which is a Solr QParser brings Lucene JoinUtil, for sure. replying into appropriate list. On Wed, Dec 10, 2014 at 10:14 PM, Parnit Pooni wrote: > Hi, > I'm running into an issue attempting to sort, here is the scenario. > > I have my

Re: Solr 4.10.2 "Found core" but I get "No cores available" in dashboard page

2014-12-10 Thread solr-user
log tab shows "No Events available" no errors at all in the CMD console my test version hasnt got any logging changes that are already in the default solr 4.10.2 package some kind of warning or error message would have been helpful... -- View this message in context: http://lucene.472066.n3.n

Re: Solr 4.10.2 "Found core" but I get "No cores available" in dashboard page

2014-12-10 Thread Alexandre Rafalovitch
Anything on the log tab? Sometimes there are issues with loading libraries. Then you should get bright red messages. Ignore the ones about admin-extra though, they are not the cause. Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources and newsletter: http://www.

Re: Solr 4.10.2 "Found core" but I get "No cores available" in dashboard page

2014-12-10 Thread solr-user
definitely puzzling. am running this on my local box (ie using http://localhost:8086/solr) and it is the only running instance of any solr. -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-4-10-2-Found-core-but-I-get-No-cores-available-in-dashboard-page-tp4173602p417361

Re: Solr 4.10.2 "Found core" but I get "No cores available" in dashboard page

2014-12-10 Thread Alexandre Rafalovitch
On 10 December 2014 at 13:56, solr-user wrote: > When Solr starts up it finds the core ("coreA") where we put it (we see > "Found core coreA" in the solr console) but we see "No cores available" when > we go to the solr dashboard. Looks ok to me from a glance. Are you absolutely sure you are not

Details on why ConccurentUpdateSolrServer is reccommended for maximum index performance

2014-12-10 Thread Tom Burton-West
Hello all, In the example schema.xml for Solr 4.10.2 this comment is listed under the "PERFORMANCE NOTE" "For maximum indexing performance, use the ConcurrentUpdateSolrServer java client." Is there some documentation somewhere that explains why this will maximize indexing peformance? In par

Re: CloudSolrServer, concurrency and too many connections

2014-12-10 Thread Greg Solovyev
I am seeing this problem with Java 1.8.0_25-b17 on Ubuntu 14.04.1 LTS ZK 3.4.6, Solr 4.10.2 Thanks, Greg - Original Message - From: "JoeSmith" To: "solr-user" Sent: Monday, December 8, 2014 6:19:08 PM Subject: Re: CloudSolrServer, concurrency and too many connections Thanks, Shawn. I

Re: CloudSolrServer, concurrency and too many connections

2014-12-10 Thread Greg Solovyev
I am seeing the same problem with 4.10.2 and 4.9.0. CloudSolrServer keeps opening connections to ZK and never closes them. Eventually (very soon) ZK runs out of connections and stops accepting new ones. Thanks, Greg - Original Message - From: "JoeSmith" To: "solr-user" Sent: Sunday,

Solr 4.10.2 "Found core" but I get "No cores available" in dashboard page

2014-12-10 Thread solr-user
hi all We are trying out solr 4.10.2 (as an upgrade from 4.0) and are seeing an odd issue. When Solr starts up it finds the core ("coreA") where we put it (we see "Found core coreA" in the solr console) but we see "No cores available" when we go to the solr dashboard. I also noticed that the i

Fwd: WordBreakSolrSpellChecker Usage

2014-12-10 Thread Matt Mongeau
If I have my search component setup like this https://gist.github.com/halogenandtoast/cf9f296d01527080f18c and I have an entry for “Rockpoint” shouldn’t “Rock point” generate suggestions? This doesn't seem to be the case, but it works for "Blackstone" with "Black stone". Any ideas on what I might

Re: How to stop Solr tokenising search terms with spaces

2014-12-10 Thread Jack Krupansky
If possible, please post your field type for others to see the final solution. Thanks! -- Jack Krupansky -Original Message- From: Dinesh Babu Sent: Wednesday, December 10, 2014 9:54 AM To: solr-user@lucene.apache.org ; Ahmet Arslan Subject: RE: How to stop Solr tokenising search terms

Re: Duplicate unique ID in implicit collection - Illegal?

2014-12-10 Thread Chris Hostetter
: With an implicit collection, is it legal to index the same document : (same unique ID) in 2 different shards? I know, it kind of defeats the : purpose of having a unique ID... Each doc (defined by uniqueKey) must exist in one and only one shard ... when this constraint is violated, you'll star

Re: Duplicate unique ID in implicit collection - Illegal?

2014-12-10 Thread Alexandre Rafalovitch
On 10 December 2014 at 10:53, Damien Dykman wrote: > The facets do take into > account the duplicate nature but the number of results varies, for > instance depending on parameter row=xx. The facets take deleted but not yet expunged (merged segment) documents into account. One of the limitations

Re: Boosting the score using edismax for a non empty and non indexed field.

2014-12-10 Thread Meraj A. Khan
Thanks Erik, I followed this approach. On Tue, Dec 9, 2014 at 4:21 AM, Erik Hatcher wrote: > Boosting will need to be done off an indexed field. But maybe rather than > indexing the url value, maybe index another new hasImage field as a boolean > true. No need to index the false values even.

Duplicate unique ID in implicit collection - Illegal?

2014-12-10 Thread Damien Dykman
Hi all, With an implicit collection, is it legal to index the same document (same unique ID) in 2 different shards? I know, it kind of defeats the purpose of having a unique ID... The reason I'm doing this, is because I want to "move" a single document from 1 shard to an other. During the transit

How I raise the maxUpdateConnections under Solr 4.6.1

2014-12-10 Thread yriveiro
Hi, How I can raise this two variables: maxUpdateConnections, maxUpdateConnectionsPerHost in Solr 4.6.1 with the old solr.xml style? /Yago - Best regards -- View this message in context: http://lucene.472066.n3.nabble.com/How-I-raise-the-maxUpdateConnections-under-Solr-4-6-1-tp4173546.htm

Re: Priority in search an synonyms

2014-12-10 Thread Alexandre Rafalovitch
This might be written just for you: http://opensourceconnections.com/blog/2014/12/08/title-search-when-relevancy-is-only-skin-deep/ Merchant would be same as title = short text Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources and newsletter: http://www.solr-s

Re: Priority in search an synonyms

2014-12-10 Thread Ahmet Arslan
Hi, This could be due to idf drift. I suggest you a simple and easy to understand solution. create an additional field, lets call it text_intact and use an minimal analysis on it. By saying minimal, I mean exclude stemming, synonym etc. Use tokenises, lowercase and may be ascii folding. Use th

Priority in search an synonyms

2014-12-10 Thread Antoine REBOUL
hello, I have a question , I do not know if there is a solution ... I will index and search a field named " Libel " . I use a " synomims " file. I have for example the following line in my file synonyms " ipad = > Apple, Priceminister , Amazon" Research on iPad gives me much Apple, and Amazon P

RE: How to stop Solr tokenising search terms with spaces

2014-12-10 Thread Dinesh Babu
Hi Ahmet, We have gone for the Ngram solution. Thanks Regards, Dinesh Babu. -Original Message- From: Ahmet Arslan [mailto:iori...@yahoo.com.INVALID] Sent: 08 December 2014 15:27 To: solr-user@lucene.apache.org Subject: Re: How to stop Solr tokenising search terms with spaces Hi, May be

Re: SOLR shards stay down forever

2014-12-10 Thread Erick Erickson
Your shards shouldn't mysteriously go down and stay down. But tlogs shouldn't be that big either, there's not much point in having that much info. It's a long story and I have to run, but here's a discussion of that topic. https://lucidworks.com/blog/understanding-transaction-logs-softcommit-and-c

Re: Does anybody asks/answer Solr questions on Stack Overflow? Why?

2014-12-10 Thread Alexandre Rafalovitch
On 10 December 2014 at 09:32, Erick Erickson wrote: > That said, I've seen some excellent answers posted over there come up > on Google searches and the like That's a big one. Solr mailing list is not optimized for discovery well. There are third party services that do some and game Google to be

Re: Does anybody asks/answer Solr questions on Stack Overflow? Why?

2014-12-10 Thread Erick Erickson
I barely keep up with the current message boards, so I really don't have time to participate in SO as well. That said, I've seen some excellent answers posted over there come up on Google searches and the like, I've just got to be kind of brutal in my prioritizing On Tue, Dec 9, 2014 at 1:37

Re: Can I select dummy field(for count) from solr?

2014-12-10 Thread aadel
You can use Hits panel to display total query result count. You can also use terms panel to display count (and even other aggregate functions, such as min, max, mean, ...) for top N terms.

edismax and default boolean operator.

2014-12-10 Thread Modassar Ather
Hi, The default boolean operator is hard-coded in ExtendedDismaxQParser. Solr version is 4.10.1. Below is the code snippet from ExtendedDismaxQParser. public ExtendedSolrQueryParser(QParser parser, String defaultField) { super(parser, defaultField); // don't trust that our parent clas

Re: Length norm not functioning in solr queries.

2014-12-10 Thread Mikhail Khludnev
S.L, I briefly skimmed Lucene50NormsConsumer.writeNormsField(), my conclusion is: if you supply own similarity, which just avoids putting float to byte in Similarity.computeNorm(FieldInvertState), you get right this value in . Similarity.decodeNormValue(long). You may wonder but this is what's exa

Re: Length norm not functioning in solr queries.

2014-12-10 Thread Ahmet Arslan
Hi, Or even better, you can use your new field for tie break purposes. Where scores are identical. e.g. sort=score desc, wordCount asc Ahmet On Wednesday, December 10, 2014 11:29 AM, Ahmet Arslan wrote: Hi, You mean update processor factory? Here is augmented (wordCount field added) versio

Re: Length norm not functioning in solr queries.

2014-12-10 Thread Ahmet Arslan
Hi, You mean update processor factory? Here is augmented (wordCount field added) version of your example : doc1: phoneName:"Details about Apple iPhone 4s - 16GB - White (Verizon) Smartphone Factory Unlocked" wordCount: 11 doc2: phoneName:"Apple iPhone 4S 16GB for Net10, No Contract, White" w

Re: Length norm not functioning in solr queries.

2014-12-10 Thread S.L
Hi Ahmet, Is there already an implementation of the suggested work around ? Thanks. On Tue, Dec 9, 2014 at 6:41 AM, Ahmet Arslan wrote: > Hi, > > Default length norm is not best option for differentiating very short > documents, like product names. > Please see : > http://find.searchhub.org/doc