Turning on KeywordRepeat and RemoveDups on an existing fieldType.

2014-05-05 Thread Michael Tracey
As per the stemming docs ( https://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#Stemming ), I want to score the original term higher than the stemmed version by adding: to a field type that is already created (with Stemming). I have 100M documents in this index, and it gets slo

SolrCloud Nodes autoSoftCommit and (temporary) missing documents

2014-05-23 Thread Michael Tracey
Hey all, I've got a number of nodes (Solr 4.4 Cloud) that I'm balancing with HaProxy for queries. I'm indexing pretty much constantly, and have autoCommit and autoSoftCommit on for Near Realtime Searching. All works nicely, except that occasionally the auto-commit cycles are far enough off th

Solr 4.5.1 and Illegal to have multiple roots (start tag in epilog?). (perhaps SOLR-4327 bug?)

2013-10-24 Thread Michael Tracey
Hey Solr-users, I've got a single solr 4.5.1 node with 96GB ram, a 65GB index (105 million records) and a lot of daily churn of newly indexed files (auto softcommit and commits). I'm trying to bring another matching node into the mix, and am getting these errors on the new node: org.apache.so

Re: Solr 4.5.1 and Illegal to have multiple roots (start tag in epilog?). (perhaps SOLR-4327 bug?)

2013-10-25 Thread Michael Tracey
ing Tomcat, and since increasing some >> limits on the connector, I haven't had any issues since. I'll close that >> ticket. >> >> > connectionTimeout="6" >> redirectPort="8443" maxPostSize="1048

Re: Solr 4.5.1 replication Bug? "Illegal to have multiple roots (start tag in epilog?)."

2013-10-28 Thread Michael Tracey
Hey, this is Michael, who was having the exact error on the Jetty side with an update. I've upgraded jetty from the 4.5.1 embedded version (in the example directory) to version 9.0.6, which means I had to upgrade my OpenJDK from 1.6 to 1.7.0_45. Also, I added the suggested (very large) setting

SolrCloud (4.4) and CurrencyField refresh intervals

2013-11-04 Thread Michael Tracey
I've got a 4.4 solrCloud cluster running, and have an external process that rebuilds the currency.xml file and uploads to zookeeper the latest version every X minutes. It looks like with CurrencyField the OpenExchangeRatesOrgProvider provider has a refreshInterval setting, but the documentation

Is this a reasonable way to boost?

2013-11-07 Thread Michael Tracey
I'm trying to boost results slightly on a price (not currency) field that are closer to a certain value. I want results that are not too expensive or too inexpensive to be favored. Here is what we currently are trying: bf=sub(1,abs(sub(15,price)))^0.2 where 15 is that "median" I want to boost

qf match density?

2013-11-11 Thread Michael Tracey
While doing a search like: q=great+gatsby&defType=edismax&qf=title^1.8 records with a title of "great gatsby / great gatsby" always score higher than "great gatsby" just a single time. How do I express that a single match should be just as important as having the query match multiple times in

Re: Problems bulk adding documents to Solr Cloud in 4.5.1

2013-11-19 Thread Michael Tracey
Dave, that's the exact symptoms we all have had in SOLR-5402. After many attempted fixes (including upgrading jetty, switching to tomcat, messing with buffer settings) my solution was to fall back to 4.4 and await a fix. - Original Message - From: "Dave Seltzer" To: solr-user@lucene.ap

Controlling traffic between solr 4.1 nodes

2013-02-05 Thread Michael Tracey
Hey all, new to Solr 4.x, and am wondering if there is any way that I could have a single collection (single or multiple shards) replicated into two datacenters, where only 1 solr instance in each datacenter communicate. (for example, 4 servers in one DC, 4 servers in another datacenter and onl

Sort order, return the first 20 results, and the last 80 results

2019-02-12 Thread Michael Tracey
Hey all, I'm interested returning 100 rows in a query, with a sort order on a tfloat field, but return the first 20 results, then the last 80 results. I'd like to do this without two requests, to keep down requests per second. Is there any way to do this in one query with function queries or

SolrCloud (7.3) and Legacy replication slaves

2019-05-21 Thread Michael Tracey
Is it possible set up an existing SolrCloud cluster as the master for legacy replication to a slave server or two? It looks like another option is to use Uni-direction CDCR, but not sure what is the best option in this case. -- Michael Tracey