localizing 'display names' for facet valus

2014-05-30 Thread Will Milspec
Hi all, What's the cleanest way to solve this problem: localize the 'display names' for facet values without storing the localized names in solr. Example: -store 'country code' field in solr document -facet on country code -translate the country code based on the user's locale For the facets

solr multi-tenant: anyone use per-tenant synonyms file?

2014-06-02 Thread Will Milspec
Hi all, I've been reading up on solr cloud (via solr in action) with an eye toward multi-tenancy. (Read: "solrcloud newbie") One question that came up: what if a "one size fits all" synonyms file does not work for all customers? i.e. different customers/industries use different sets of synonym

null pointer on FSTCompletionLookup

2014-06-04 Thread Will Milspec
Hi all, Someone posted this problem over a year ago but I did not see a clear resolution in the thread. Intermittently--i.e. for some searches, not others--the 'suggest/spellcheck' component throws a n NullPointerException (NPE) when a user executes a search. It fails on FSTCompletionLookup (li

Re: null pointer on FSTCompletionLookup

2014-06-04 Thread Will Milspec
n executed my 'problematic searches' , the problem went away. Why would this happen Thanks in advance On Wed, Jun 4, 2014 at 9:32 AM, Will Milspec wrote: > Hi all, > > Someone posted this problem over a year ago but I did not see a clear > resolution in the thread. &g

Re: null pointer on FSTCompletionLookup

2014-06-05 Thread Will Milspec
Update: this was a configuration error. In my haste/carelessness, instead of defining separate "spellcheck" and "suggest" components, I defined only "suggest". (More specifically I copied over the ch10 examples from "solr in action", but did not copy the "spellcheck" component." ) When solr com

solrcloud without faceting, i.e. for failover only

2015-01-06 Thread Will Milspec
Hi all, We have a smallish index that performs well for searches and are considering using solrcloud --but just for high availability/redundancy, i.e. without any sharding. The indexes would be replicated, but not distributed. I know that "there are no stupid questions..Only stupid people"...but

how to get TermVectorComponent using xml , vs. SOLR-949

2010-11-02 Thread Will Milspec
Hi all, This seems a basic question: what's the best way to get TermVectorComponents. from the Solr XmL response? SolrJ does not include TermVectorComponents in its api; the SOLR-949 patch adds this ability, but after 2 years it's still not in the mainline. (And doesn't patch cleanly to the curre

Override SynonymFilterFactory to load synonyms from alternate data source

2010-11-03 Thread Will Milspec
Hi all, Can anyone comment on the ease/merit of overriding the shipped SynonymFilterFactory with a version that could load the synonyms from an alternate data source? Our application currently maintains synonyms in its database ; we could export this data to 'synonyms.txt', but would prefer a db

Any Copy Field Caveats?

2010-11-05 Thread Will Milspec
Hi all, we're moving from an old lucene version to solr and plan to use the "Copy Field" functionality. Previously we had "rolled our own" implementation, sticking title, description, etc. in a field called 'content'. We lose some flexibility (i.e. java layer can no longer control what gets in t

Solr Git Tags

2010-11-08 Thread Will Milspec
Hi all, (This question is more oriented to the developer but may find relevant to the solr user interested in perusing the source) I've cloned the git lucene-solr repository and was surprised to find no tags. empty here: http://git.apache.org/lucene-solr.git/refs/tags/ Whereas the 'older' git

best way to get maxDocs in java (i.e. as on stats.jsp page).

2010-12-01 Thread Will Milspec
hi all, What's the best way to programmatically-in-java get the 'maxDoc' attribute (as seen on the stats.jsp page). I don't see any hooks on the solrj api. Currently I plan to use an http client to get stats.jsp (which returns xml) and parse it using xpath. If anyone can recommend a better appr

nexus of synonyms and stemming, take 2

2010-12-03 Thread Will Milspec
hi all, [This is a second attempt at emailing. The apache mailing list spam filter apparently did not like my synonyms entry, ie.. classified my email as spam. I have replaced phone with 'foo' , 'cell' with 'sell' and 'mobile' with 'nubile' ] This is a fairly basic synonyms question: how does syn

can solrj swap cores?

2010-12-03 Thread Will Milspec
hi all, Does solrj support "swapping cores"? One of our developers had initially tried swapping solr cores (e.g. core0 and core1) using the solrj api, but it failed. (don't have the exact error) He susequently replaced the call with straight http (i.e. http client). Unfortunately I don't have th

How badly does NTFS file fragmentation impact search performance? 1.1X? 10X? 100X?

2010-12-08 Thread Will Milspec
Hi all, Pardon if this isn't the best place to post this email...maybe it belongs on the lucene-user list . Also, it's basically windows-specific,so not of use to everyone... The question: does NTFS fragmentation affect search performance "a little bit" or "a lot"? It's obvious that "fragmentat

Tips for 'staggered date facets', i.e. 'last 24 hours, last week, last month, last year' , ala google news?

2010-12-10 Thread Will Milspec
hi all, We wish to implement date faceting with a 'sliding date range', 'last 24 hours, last week, last month, last year' . Google New currently implements such faceting when you search for a topic. As Solr's standard date faceting does not appear to meet this need, we will need to use facetin

Where does admin UI visually distinguish between "master" and "slave"?

2011-01-12 Thread Will Milspec
Hi all, I'm getting started with a master/slave configuration for two solr instances. Two distinguish between 'master' and 'slave', I've set he system properties (e.g. "-Dmaster.enabled") and using the same 'solrconfig.xml'. I can see via the system properties admin UI that the jvm (and thus sol

Re: Where does admin UI visually distinguish between "master" and "slave"?

2011-01-12 Thread Will Milspec
2011 17:24:57 Otis Gospodnetic wrote: > > Hi Will, > > > > I don't think we have a clean "master" or "slave" label anywhere in the > > Admin UI. > > > > Otis > > > > Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch >

Anyone seen measurable performance improvement using Apache Portable Runtime (APR) with Solr and Tomcat

2011-01-12 Thread Will Milspec
Hi all, Has anyone seen used Apache Portable Runtime (APR) in conjunction with Solr and Tomcat? Has anyone seen (or better, measured) performance improvements when using APR? APR is a library that implements some functionality using Native C (see http://apr.apache.org/ and http://en.wikipedia.o

SOLR-236 (Field Collapsing) patch and 3.1

2011-04-08 Thread Will Milspec
Hi all, We're using the solr-236 (field collapsing) patch on solr 1.4.1 and wish to upgrade to 3.1 Has anyone applied this patch to 3.1, successfully or unsuccessfully? [ftr, Solr 4.x includes field collapsing; 3.1 does not ] The issue has several patch files, including some for 1.4.1 specifica

Git tag for 3.1 release?

2011-04-18 Thread Will Milspec
Hi all, Does the lucene-solr git repository have a tag that marks the 3.1 release? Context: I want to apply a patch to 3.1 and wish to start from a well-defined point (i.e. official 3.1 release) Executing these commands, I would have expected to see a tag marking the 3.1 release. I only see "b

Any chance of getting SOLR-949 into the application

2011-07-07 Thread Will Milspec
hi all, Our applications requires term vectors and uses SOLR-949 solrj patch to simplify the client layer. This patch eliminates the need to manually parse the xml returned by the tvrh (term vector response handler) https://issues.apache.org/jira/browse/SOLR-949 Can we get this in the head/tru

how to build lucene-solr (espeically if behind a firewall)?

2011-07-12 Thread Will Milspec
hi all, building lucene/solr behind the firewall fails for us due to proxy errors. I tried setting the ant_opts -Dhttp.proxyHost, etc, but found the "lucene" portion still failed on javadoc links. I worked round this by changing failonjavadocerror to 'false' in lucene/common-build.xml (or altern

Synonym and Whitespaces and optional TokenizerFactory

2011-08-17 Thread Will Milspec
Hi all, This may be obvious. My question pertains to use of tokenizerFactory together with SynonymFilterFactory. Which tokenizerFactory does one use to treat "synonyms with spaces" as one token, Example these two entries are synonyms: "lms", "learning management system" index time expansion wou

overhead of empty, unused fields

2011-08-18 Thread Will Milspec
hi all, What are the cost of unused field types? Our application supports multiple languages. We envision separate Lucene/Solr fields (and field types) per language (conten_en, content_fr, content_zh_CN,etc). We thought of a few optons: a) auto-generating the 'multilingual' portion of the schema