RE: Newbie SolR - Need advice

2013-07-02 Thread David Quarterman
Hi Fabio, Like Jack says, try the tutorial. But to answer your question, SOLR isn't a bolt on to SQLServer or any other DB. It's a fantastically fast indexing/searching tool. You'll need to use the DataImportHandler (see the tutorial) to import your data from the DB into the indices that SOLR u

RE: Newbie SolR - Need advice

2013-07-02 Thread David Quarterman
obile ---- Original message From: "David Quarterman [via Lucene]" Date: 02/07/2013 16:57 (GMT+00:00) To: fabio1605 Subject: RE: Newbie SolR - Need advice Hi Fabio, Like Jack says, try the tutorial. But to answer your question, SOLR isn't a bolt on to S

RE: Newbie SolR - Need advice

2013-07-03 Thread David Quarterman
Hi Fabio, Sandeep is right - it'll take time. SOLR isn't straightforward when you first start out but the tutorial is the best first step. You can then adapt the various config files in the tutorial to adapt to your situation. I'd recommend a simple approach to get the hang of it and just index

SOLR 4.0 frequent admin problem

2013-07-04 Thread David Quarterman
Hi, About once a week the admin system comes up with SolrCore Initialization Failures. There's nothing in the logs and SOLR continues to work in the application it's supporting and in the 'direct access' mode (i.e. http://123.465.789.100:8080/solr/collection1/select?q=bingo:*). The cure is to

RE: SOLR 4.0 frequent admin problem

2013-07-04 Thread David Quarterman
m Yes :-) see SOLR-118, seems an old issue... On 4 Jul 2013 06:43, "David Quarterman" wrote: > Hi, > > About once a week the admin system comes up with SolrCore > Initialization Failures. There's nothing in the logs and SOLR > continues to work in the application

RE: Commit different database rows to solr with same "id" value?

2013-07-10 Thread David Quarterman
Hi Jason, Assuming you're using DIH, why not build a new, unique id within the query to use as the 'doc_id' for SOLR? We do something like this in one of our collections. In MySQL, try this (don't know what it would be for any other db but there must be equivalents): select @rownum:=@rownum+1

RE: Facet sorting seems weird

2013-07-15 Thread David Quarterman
Hi Henrik, Try setting up a copyfield in your schema and set the copied field to use something like 'text_ws' which implements LowerCaseFilterFactory. Then sort on the copyfield. Regards, DQ -Original Message- From: Henrik Ossipoff Hansen [mailto:h...@entertainment-trading.com] Sent:

Edismax odd results

2013-02-19 Thread David Quarterman
Hi all, We have an index of boots which contains harness boots, engineer boots , ankle boots, etc. An edismax search on the index for 'harness boots' brings back 2,175 boots with 'harness' results at the top. 'Searching 'engineer boots' brings back everything but 'engineer boots', same for 'ank

RE: Edismax odd results

2013-02-19 Thread David Quarterman
ing scored - you may have to add some specific query phrases to force "engineer boot" into the top results to comparing the scoring. -- Jack Krupansky -Original Message- From: David Quarterman Sent: Tuesday, February 19, 2013 6:21 AM To: solr-user@lucene.apache.org Subject: Edis

RE: Edismax odd results

2013-02-19 Thread David Quarterman
em there, then the Solr log (assuming you haven't changed the default log level of INFO) should have a record of what parameters were actually received when the query was made. Thanks, Shawn On 2/19/2013 9:14 AM, David Quarterman wrote: > Hi Jack, > > Here's q test query we&

RE: Edismax odd results

2013-02-19 Thread David Quarterman
Hi Shawn/Jack, The log shows the query going in okay, nothing gets stripped out so we're still at a loss to understand this. Could it be theta Snowball stemming is too invasive? Regards, DQ -Original Message- From: David Quarterman [mailto:da...@corexe.com] Sent: 19 February 20

RE: Edismax odd results

2013-02-19 Thread David Quarterman
mming reduces 'engineer' to 'engin' so I'd have expected a lot more results. Anyone got any ideas? Regards, DQ -----Original Message- From: David Quarterman [mailto:da...@corexe.com] Sent: 19 February 2013 17:09 To: solr-user@lucene.apache.org Subject: RE: Edismax od

Re: Edismax odd results

2013-02-19 Thread David Quarterman
Hi Shawn, Now finished for the day but will post the schema tomorrow. Thanks for the help (and Jack too). Regards, DQ P.S. did reindex after changing schema and the analyzer/query stuff matches precisely!! Shawn Heisey wrote: On 2/19/2013 11:16 AM, David Quarterman wrote: > This

RE: Edismax odd results

2013-02-20 Thread David Quarterman
2013 11:16 AM, David Quarterman wrote: > This is definitely driving us mad now! Changed to PorterStemming and there's > very little difference. > > If we add fq=engineer, we get 0 results. Add fq=engineer* and we get the 90 > in the system. Try with fq=ankle* and we get 2

RE: Edismax odd results

2013-02-20 Thread David Quarterman
rg Subject: Re: Edismax odd results When you get back to this tomorrow, also try and paste the parsed query bits you get back when you append &debug=all. Sometimes it's surprising what the parsed query _really_ looks like Best Erick On Tue, Feb 19, 2013 at 3:13 PM, David Quart

RE: Edismax odd results

2013-02-20 Thread David Quarterman
ts you get back when you append &debug=all. Sometimes it's > surprising what the parsed query _really_ looks like > > Best > Erick > > > On Tue, Feb 19, 2013 at 3:13 PM, David Quarterman > wrote: > > > Hi Shawn, > > > > Now finished for

RE: If we Open Source our platform, would it be interesting to you?

2013-02-21 Thread David Quarterman
Hi Marcelo, Looked through your site and the framework looks very powerful as an aggregator. We do a lot of data aggregation from many different sources in many different formats (XML, JSON, text, CSV, etc) using RDBMS as the main repository for eventual SOLR indexing. A 'one-stop-shop' for all

RE: Edismax odd results

2013-02-22 Thread David Quarterman
Hi Erick, Funnily enough, I cracked it about 5 minutes before your email arrived! Problem was using WhiteSpaceTokenizer instead of Standard AND had the LowerCaseFilter after the PorterStemmingFilter. Getting them in the right order has solved all the problems and we get all our engineer boots,

RE: Building a central index with Lucene + Solr

2013-03-05 Thread David Quarterman
Hi Alvaro, I agree with Otis & Alexandre (esp. Windows + PHP!). However, there are plenty of people using Solr & PHP out there very successfully. There's another good package at http://code.google.com/p/solr-php-client/ which is easy to implement and has some example usage. Regards, DQ Fr

SOLR 4.0 Beta documents being duplicated

2012-10-05 Thread David Quarterman
Hi, We've been using V4.x of SOLR since last November without too much trouble. Our MySQL database is refreshed daily and a full import is run automatically after the refresh and generally produces around 86,000 products, obviously on unique doc_id's. So, we upgraded to 4.0 Beta a few days ago

RE: SOLR 4.0 Beta documents being duplicated

2012-10-05 Thread David Quarterman
staging_area/lucene-solr-4.0RC1-rev13911 44/solr/ There will be an RC2 sometime, a couple of problems have been found, but using RC1 should minimize any update to the official 4.0 plus have a lot of improvements over BETA... Best Erick On Fri, Oct 5, 2012 at 10:25 AM, David Quarterman wrote:

RE: Feature & design question: use autocomple?te to search on 2 different fields, and return 2 different data groups

2012-11-01 Thread David Quarterman
We had a similar requirement and found the best solution (unfortunately) was to spend a small amount of money. Have a look at Sematext's site (www.sematext.com). Their Autocomplete is awesome and we have a fantastic looking AC now on our development site, grouped by category, product & brand with p

RE: Feature & design question: use autocomple?te to search on 2 different fields, and return 2 different data groups

2012-11-01 Thread David Quarterman
Fernando, Pretty much the problem we came up against. We had a basic AC running using SpellChecker a while ago but it was the grouping that floored us and sent us elsewhere. Again, multiple queries seemed like the only possible answer but in an AC scenario, even with SOLR's speed, probably too slo