[Simplified my question] How to enhance solr.StandardTokenizerFactory? (was: Why is Standard Tokenizer not separating at this comma?)

2017-05-24 Thread Robert Hume
Hi, Following up on my last email question ... I've learned more and I simplified by question ... I have a Solr 3.6 deployment. Currently I'm using solr.StandardTokenizerFactory to parse tokens during indexing. Here's two example streams that demonstrate my issue: Example 1: `bob,a-z,000123,xy

Why is Standard Tokenizer not separating at this comma?

2017-05-24 Thread Robert Hume
I have a Solr 3.6 deployment I inherited. The schema.xml specifies the use of StandardTokenizerFactory like so ... ... ... According to this reference guide ( https://home.apache.org/~ctargett/RefGuidePOC/jekyll/Tokenizers.html) ... the StandardTokenizer will treat punctuati

Re: Question about best way to architect a Solr application with many data sources

2017-02-21 Thread Robert Hume
se) for > control/display and enhance with the fast model (solr) for retrieval/search > > > > > On Feb 21, 2017, at 7:57 PM, Robert Hume wrote: > > > > To learn how to properly use Solr, I'm building a little experimental > > project with it to search

Question about best way to architect a Solr application with many data sources

2017-02-21 Thread Robert Hume
To learn how to properly use Solr, I'm building a little experimental project with it to search for used car listings. Car listings appear on a variety of different places ... central places Craigslist and also many many individual Used Car dealership websites. I am wondering, should I: (a) depl

how to tell SolrHttpServer client to accept/ignore all certs?

2016-11-14 Thread Robert Hume
I'm using HttpSolrServer (in Solr 3.6) to connect to a Solr web service and perform a query. The certificate at the other end has expired and so connections now fail. It will take the IT at the other end too many days to replace the cert (this is out of my control). How can I tell the HttpSolrSe

[Newbie question] what is a "core" and are they different from 3.x to 5.x ?

2015-11-05 Thread Robert Hume
Trying to learn about SOLR. I can see there is something called a "core" ... it appears there can be many cores for a single SOLR server. Can someone "explain like I'm five" -- what is a core? And how do "cores" differ from 3.x to 5.x. Any pointers in the right direction are helpful! Thanks! R

[Newbie question] in SOLR 5, would I have a "master-to-slave" relationship for two servers?

2015-11-05 Thread Robert Hume
Hi, In my SOLR 3 deployment (inherited it), I have (1) one SOLR server that is used by my web application, and (2) a second SOLR server that is used to index documents via a customer datasource. The database of server 2 is considered the "master" and it is replicated regularly to server 1, the "s

Re: Should I install 4.x or 5.x? Book recommendations?

2015-10-23 Thread Robert Hume
t; publisher pushed the price up into the ridiculous territory last time > I checked. So, don't buy it. But if you have O'Reilly Safari account > of some other ways to get to it, give it a glance too. > > Regards, >Alex. > > Solr Analyzers, Tokenizers, Filters,

Should I install 4.x or 5.x? Book recommendations?

2015-10-23 Thread Robert Hume
Hi, I'm investigating installing a new Solr deployment to be able to search about two million documents (mostly HTML and PDF). QUESTIONS: A. Should I use Solr 4.x or 5.x? My concerns are mostly to do with support. Is 5.x too new to be able to get good answers and advice from the community? Or

[newbie] questions about 3.6.0 and 4.x or 5.x ?

2015-10-21 Thread Robert Hume
Hello, I'm hoping to get some quick advice from the Solr gurus out there ... I’ve inherited a project that uses a Solr 3.6.0 deployment. (Several masters and several slaves – I think there are 6 Solr instances in total.) I’ve been tasked with investigating if upgrading our 3.6.0 deployment