RE: Indexing data from pdf

2012-05-11 Thread Dyer, James
get away with changing this: to this: This assumes, of course, for these pdf documents the "fake_id" should always be the same as the "id". James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: anarchos78 [mailto:rigasathan

RE: Issue in Applying patch file

2012-05-15 Thread Dyer, James
1.6 or 1.7 JDK (the default JDK on some linux distributions is often inadequate) Issue "javac -version" from the command line as an insanity check. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: mechravi25 [mailto:mechrav...@yahoo.c

RE: Exception in DataImportHandler (stack overflow)

2012-05-15 Thread Dyer, James
nd up with strange conditions like the one you're getting? It could be a case where your data has grown just over the limit your setup can handle under such an unfortunate circumstance. Let me know if this solves it. If so, we probably should open a bug report and get this fixed in DIH

RE: Issue in Applying patch file

2012-05-17 Thread Dyer, James
the Maven repository. Possibly your internet connection was down or you're behind a proxy that doesn't allow it? James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: mechravi25 [mailto:mechrav...@yahoo.co.in] Sent: Thursday, May 17, 2012 7

RE: Use DIH with more than one entity at the same time

2012-05-17 Thread Dyer, James
But the approach taken in the link Jack sites below does work. Its a pain to set it up though. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 From: Jack Krupansky [mailto:j...@basetechnology.com] Sent: Thursday, May 17, 2012 10:21 AM To: solr-user@lucene.apache.org Subject:

RE: Exception in DataImportHandler (stack overflow)

2012-05-17 Thread Dyer, James
Shawn, Do you think this behavior is because, while the indexing is paused, you reach some type of timeout so either your db or the jdbc cuts the connection? Or, ar you thinking something in the DIH/JDBCDataSource code is causing the connection to drop under these circumstances? James Dyer E

RE: index-time boosting using DIH

2012-05-22 Thread Dyer, James
See http://wiki.apache.org/solr/DataImportHandler#Special_Commands and the $docBoost pseudo-field name. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: geeky2 [mailto:gee...@hotmail.com] Sent: Tuesday, May 22, 2012 2:12 PM To: solr-user

RE: index-time boosting using DIH

2012-05-22 Thread Dyer, James
ELSE null END as '$docBoost', ...other fields... from some_table, etc Either way I wouldn't expect it to make the indexing be noticably slower. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: geeky2 [mailto:gee...@hotmail.co

RE: problem on running fullimport

2012-05-24 Thread Dyer, James
chSize%28int%29 James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: pla [mailto:patrick.archib...@gmail.com] Sent: Thursday, May 24, 2012 9:10 AM To: solr-user@lucene.apache.org Subject: Re: problem on running fullimport Thanks Alexey Serba. I

RE: spellcheck collate with fq parameters SOLR-2010

2012-05-31 Thread Dyer, James
ue you set). You can verify the correctness of this by setting "spellcheck.maxCollationTries" to zero (no checking) and then re-trying the collation(s) it suggests by hand (with the same "fq" params, etc). James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -

RE: possible status codes from solr during a (DIH) data import process

2012-05-31 Thread Dyer, James
t. Someday it'll be nice if we can come up with a better way than this to programitically interact with DIH... James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: geeky2 [mailto:gee...@hotmail.com] Sent: Thursday, May 31, 2012 2:43 PM To: solr

RE: why DIH works in normal mode,error in debug mode

2012-06-01 Thread Dyer, James
ally changes it to "Integer.MIN_VALUE". According to the wiki this is to fix this error.) The value of "batchSize" is used on the java.sql.Statement objects with "setFetchSize(batchSize)". example: see: http://wiki.apache.org/solr/DataImportHandler#Configuring_

RE: Data Import Handler fields with different values in column and name

2012-06-01 Thread Dyer, James
a-config.xml, then go ahead and double-check spelling and case in all your config files. Besides a typo somewhere, I'm not sure what else would cause this not to map.) James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Rafael Taboada [mailto:kalim

RE: why DIH works in normal mode,error in debug mode

2012-06-01 Thread Dyer, James
Try setting it to 0 or -1. Or check the Mysql JDBC driver documentation about valid values for "Statement.setFetchSize()" I think someone else recently asked on this same list about problems with the latest Mysql driver and fetch sizes, so this driver may be particularly finicky.

RE: Data Import Handler fields with different values in column and name

2012-06-01 Thread Dyer, James
one, I'd definitely try it without multiple mappings to see if it fixes it. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Jack Krupansky [mailto:j...@basetechnology.com] Sent: Friday, June 01, 2012 10:34 AM To: solr-user@lucene.apache.org

RE: Can't index sub-entitties in DIH

2012-06-05 Thread Dyer, James
ELECT with "AS" clauses. - in cases the columns are the same in SELECT as what you have in schema.xml, omit the tags (see http://wiki.apache.org/solr/DataImportHandler#A_shorter_data-config) These are shot-in-the-dark guesses. I wouldn't expect this to matter but you might as w

RE: issues with spellcheck.maxCollationTries and spellcheck.collateExtendedResults

2012-06-06 Thread Dyer, James
o remove them from the dictionary). For that there is https://issues.apache.org/jira/browse/SOLR-2585 which is part of Solr 4. The only other workaround is "onlyMorePopular" which has its own issues. (see http://wiki.apache.org/solr/SpellCheckComponent#spellcheck.alternativeTerm

RE: Writing custom data import handler for Solr.

2012-06-11 Thread Dyer, James
essor: http://svn.apache.org/repos/asf/lucene/dev/branches/lucene_solr_3_6/solr/contrib/dataimporthandler/src/java/org/apache/solr/handler/dataimport/SolrEntityProcessor.java James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Lance Norskog [mailto

RE: PageRanking with DIH

2012-06-12 Thread Dyer, James
29 . Also, the Packt Solr book (Smiley&Pugh) has a nice section about boosting scores based on page-rank or popularity type fields. In the old first edition its chapter 5, "enhanced searching". James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Mess

RE: DIH idle in transaction forever

2012-06-14 Thread Dyer, James
he least caused some log warnings, and it wouldn't work at all without readOnly=false. Not sure abour PostgreSql. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Jasper Floor [mailto:jasper.fl...@m4n.nl] Sent: Thursday, June 14, 2012 8

RE: WordBreak and default dictionary crash Solr

2012-06-15 Thread Dyer, James
? https://issues.apache.org/jira/browse/SOLR-2993 Shortly after the initial commit of this feature, a bug similar to the one you're reporting was later fixed with these subsequent revisions. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From:

Containers running SOLR: supported or unsupported?

2010-11-18 Thread Dyer, James
t. I guess if anyone has a story of how they had difficulty with SOLR and the container it ran and having Support helped, or if you wish you had Support and didn't, I'd like to hear it. I appreciate your feedback and advice. Thanks. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311

RE: Dataimport performance

2010-12-16 Thread Dyer, James
e process. However, I have it running fairly fast by partitioning the data by a modulus of the primary key and then running several jobs in parallel. The trick is not to get I/O bound. Things run fast if you can set it up to maximize CPU. James Dyer E-Commerce Systems Ingram Content Group (615)

RE: A schema inside a Solr Schema (Schema in a can)

2010-12-17 Thread Dyer, James
our old system, we had to do something like this: contrib: dyer, james|author|123 contrib: smith, sam|editor|456 But Lucene/Solr will guanantee that multivalued fields return in exactly the same order you put them in. So with SOLR we can do this: contrib_name: dyer, james contrib_n

RE: A schema inside a Solr Schema (Schema in a can)

2010-12-17 Thread Dyer, James
There's also one "gotcha" we've experienced when searching acrosse multi-valued fields: SOLR will match across field occurences. In the example below, if you were to search q=contrib_name:(james AND smith), you will get this record back. It matches one name from one con

RE: A schema inside a Solr Schema (Schema in a can)

2010-12-20 Thread Dyer, James
Dennis, If you need to search a key/value pair, you'll have to put them both in the same field, somehow. One way is to re-index them using the key in the fieldname. For instance, suppose you have: contributor: dyer, james contributor: smith, sam role: author role: editor ...but you

RE: White space in facet values

2010-12-22 Thread Dyer, James
The phrase solution works as does escaping the space with a backslash: fq=Product:Electric\ Guitar ... actually a lot of characters need to be escaped like this (amperstands and parenthesis come to mind)... I assume you already have this indexed as "string", not "text&qu

edismax inconsistency -- AND/OR

2010-12-22 Thread Dyer, James
t always insert the operator (as in examples 6 & 7). However, this is an extra burden on our clients that I'd like to avoid if at all possible. See below for more configuration information. Any ideas are appreciated. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 Snippets from schema.xml: ... ...

Using two request handlers in the same query...

2010-12-22 Thread Dyer, James
_SI}faith") , using the technique found at http://www.lucidimagination.com/blog/2009/03/31/nested-queries-in-solr/ Is there any way to do this? I'm using version 1.4.1 James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311

RE: edismax inconsistency -- AND/OR

2010-12-22 Thread Dyer, James
eturns an instance of "BooleanQuery", right? Any more advice anyone can give is appreciated! For the client I'm responsible for, I'm just inserting explicit operators between all of the user's queries. But for the client I'm not responsible for I would love to h

RE: Issue using filter query with spellCheck component

2010-12-30 Thread Dyer, James
1 as a patch. See https://issues.apache.org/jira/browse/SOLR-2010 James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: taimurAQ [mailto:taimur_qure...@hotmail.com] Sent: Thursday, December 30, 2010 12:20 AM To: solr-user@lucene.apache.org Subj

StopFilterFactory and "qf" containing some fields that use it and some that do not

2011-01-12 Thread Dyer, James
stemming gets applied regardless. But stop words do not. Does anyone have ideas on what is going on? Is this a feature or possibly a bug? Any known workarounds? Any advice is appreciated. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311

RE: StopFilterFactory and "qf" containing some fields that use it and some that do not

2011-01-12 Thread Dyer, James
asking the mail list! It looks like the only workarounds I have is to either filter out the stopwords in the client when this happens, or enable stop words for all the fields that are used in "qf" with stopword-enabled fields. Unless...someone has a better idea?? James Dyer E-Commerce

RE: StopFilterFactory and "qf" containing some fields that use it and some that do not

2011-01-13 Thread Dyer, James
(e)dismax had a flag you could set that would tell it that if any analyzers removed a term, then that term would become optional for any fields for which it remained? I'm not sure what the development effort would perhaps it would be a nice way to circumvent this problem in a future rele

RE: Spell Checking a multi word phrase

2011-01-17 Thread Dyer, James
on the component's wiki page: http://wiki.apache.org/solr/SpellCheckComponent#spellcheck.collate This feature is committed to 3.x and 4.x and is available as a patch for 1.4.1 (here: https://issues.apache.org/jira/browse/SOLR-2010). James Dyer E-Commerce Systems Ingram Content Group (615) 213-

RE: spellchecking even the key is true....

2011-01-17 Thread Dyer, James
st of the time. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: satya swaroop [mailto:satya.yada...@gmail.com] Sent: Monday, January 17, 2011 10:32 AM To: solr-user@lucene.apache.org Subject: spellchecking even the key is true Hi All,

RE: Spell Checking a multi word phrase

2011-01-17 Thread Dyer, James
g "spellcheck.q" in this case in order to get the analysis right. I'm not sure if this is the solution but I thought I'd mention it. I never tried spell checking this way because it seemed very limited and possibly quite expensive. James Dyer E-Commerce Syst

RE: [POLL] Where do you get Lucene/Solr from? Maven? ASF Mirrors?

2011-01-18 Thread Dyer, James
mirrors them internally or via a downstream project) James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Grant Ingersoll [mailto:gsing...@apache.org] Sent: Tuesday, January 18, 2011 3:04 PM To: java-u...@lucene.apache.org; solr-user@lucene.apache.org

RE: Spellcheck Phrases

2011-02-23 Thread Dyer, James
astException on the float value: text_spelling spellchecker Spelling_Dictionary text_spelling true .001 I have it on my to-do list to look into this further but haven't yet. If you decide to try it and can get it to work, please let me know how you do it. James Dyer E

RE: setting different solrconfig.xml for a core

2011-02-28 Thread Dyer, James
"solrconfig.xml". But I primed it by giving it its config file in-sync to start with. I ended up going the same-config-file-everywhere route though because we're using our master to handle requests when its not indexing (one less server to buy)... James Dyer E-Commerce Syst

RE: setting different solrconfig.xml for a core

2011-02-28 Thread Dyer, James
H will honor it. Also see the unit test "TestPluginEnable.java", which seems to show that the StandardRequestHandler will honor it. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Jonathan Rochkind [mailto:rochk...@jhu.edu] Sent: Monday, Februar

noobie question: sorting

2011-03-15 Thread James Lin
Hi Guys, came across this sorting query query ({!v="category: 445"}) desc I understand it is sorting on exact match of category = 445, I don't quite understand the syntax, could someone please elaborate a bit for me? So I can reuse this syntax in the future. Regards James

Re: noobie question: sorting

2011-03-16 Thread James Lin
AWESOME, thanks for your time! Regards James On Wed, Mar 16, 2011 at 6:14 PM, David Smiley (@MITRE.org) < dsmi...@mitre.org> wrote: > Hi. Where did you find such an obtuse example? > > Recently, Solr supports sorting by function query. One such function is > named "que

Helpful new JVM parameters

2011-03-17 Thread Dyer, James
" pauses during CMS much shorter: -XX:+CMSParallelRemarkEnabled We use these in conjunction with CMS/ParNew and a 22gb heap (64gb total on the box), with a 1.2G newSize/maxNewSize. In case anyone else is having similar issues, we thought we would share our experience with these newer options. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311

RE: Helpful new JVM parameters

2011-03-17 Thread Dyer, James
all depends on your situation and you need to test. In our case, this flag solved a real problem we were having. Whoever wrote the JRocket book you refer to no doubt had other scenarios in mind... James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From:

RE: Spellcheck suggestions as solr docs

2011-04-14 Thread Dyer, James
tps://issues.apache.org/jira/browse/SOLR-2462 for information.) This does not affect Solr 1.4 unless you've applied SOLR-2010. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: sivaprasad [mailto:sivaprasa...@echidnainc.com] Sent: Th

solr sorting on multiple conditions, please help

2011-04-25 Thread James Lin
the actual score even the results all matches area_id 78153 I am getting results like this Area 2, score 0.21 Area 3, score 0.38 Area 4, score 0.23 but the result should be like this Area 3, score 0.38 Area 4, score 0.23 Area 2, score 0.21 Thanks heaps in advanced. Regards James

RE: Patch problems solr 1.4 - solr-2010

2011-05-04 Thread Dyer, James
egment, you can get it to apply the change to "PossibilityIterator.java", which is all that is necessary to fix the bug. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: roySolr [mailto:royrutten1...@gmail.com] Sent: Wednesday, May 04,

RE: Patch problems solr 1.4 - solr-2010

2011-05-04 Thread Dyer, James
I also should mention that solr-2010 is incorporated in Solr 3.1, so if you can upgrade you won't need a patch. Note, however, that you will still want to apply the fix in solr-2462 regardless of the version as this fix hasn't been committed anywhere. James Dyer E-Commerce Syst

Text Only Extraction Using Solr and Tika

2011-05-05 Thread Emyr James
Hi All, I have solr and tika installed and am happily extracting and indexing various files. Unfortunately on some word documents it blows up since it tries to auto-generate a 'title' field but my title field in the schema is single valued. Here is my config for the extract handler... class

Re: Text Only Extraction Using Solr and Tika

2011-05-05 Thread Emyr James
Luker wrote: Hi Emyr, You could try using the "extractOnly=true" parameter [1]. Of course, you'll need to repost the extracted text manually. --jay [1] http://wiki.apache.org/solr/ExtractingRequestHandler#Extract_Only On Thu, May 5, 2011 at 9:36 AM, Emyr James wrote: Hi All,

RE: Patch problems solr 1.4 - solr-2010

2011-05-05 Thread Dyer, James
more popular" terms. To my knowledge, there is no way to get the spellchecker to do that currently. On the other hand, if you're pretty sure "man" is not in the dictionary, try upping spellcheck.count to something higher than the default (20 maybe?)... James Dyer E-Commerc

Re: Text Only Extraction Using Solr and Tika

2011-05-05 Thread Emyr James
n help you for the meta data fields. References- http://wiki.apache.org/solr/SchemaXml#Dynamic_fields http://wiki.apache.org/solr/ExtractingRequestHandler#Input_Parameters http://wiki.apache.org/solr/TikaExtractOnlyExampleOutput Regards, Anuj On Thu, May 5, 2011 at 7:28 PM, Emyr James wrote:

RE: SolrQuery API for adding group filter

2011-05-10 Thread Dyer, James
;group.main=true", then the response will come back with a normal SolrDocumentList and you won't need code like above. One caveat to this is my testing showed a significant performance and/or memory usage hit from using these two parameters. I didn't dig any deeper to try and fig

RE: spellcheck.collate returning all results

2011-05-23 Thread Dyer, James
ionTries") you should apply SOLR-2462 first (see https://issues.apache.org/jira/browse/SOLR-2462 for information & a patch). It is likely this (or a similar fix) will eventually get committed and included in the next bug-fix release, should there be one. James Dyer E-Commerce Sys

RE: DIH import and postImportDeleteQuery

2011-05-25 Thread Dyer, James
y only needs minor tweaking. The jira ticket is here: https://issues.apache.org/jira/browse/SOLR-2492 James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Alexandre Rocco [mailto:alel...@gmail.com] Sent: Wednesday, May 25, 2011 12:54 PM To: solr-us

RE: DIH import and postImportDeleteQuery

2011-05-25 Thread Dyer, James
Great. I wasn't aware of the other issue. I put a link on the 2 issues in JIRA so people can know in the future. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Alexandre Rocco [mailto:alel...@gmail.com] Sent: Wednesday, May 25, 2011

RE: Spellcheck: Two dictionaries

2011-05-26 Thread Dyer, James
ry. Create a new field and "copyField" the values into it. Base your uber-dictionary on this new field. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: roySolr [mailto:royrutten1...@gmail.com] Sent: Thursday, May 26, 2011 3:24 AM T

RE: Returning documents using multi-valued field

2011-05-26 Thread Dyer, James
the position of the next occurrence by the positionIncrementGap value, this will effectively prevent Lucene/Solr from matching across occurences. The downside to this workaround is that wildcards are not permitted in phrase searches. So if you need wildcard support also, then you're o

RE: Spellcheck: Two dictionaries

2011-05-27 Thread Dyer, James
llation queries and testing them against the index prior to returning them. SOLR-2010 (included in 3.1) got us part of the way there but there is still more work to do. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: roySolr [mailto:royrutten1..

RE: Spellcheck Phrases

2011-06-01 Thread Dyer, James
ling_Dictionary text ./spellchecker .01 James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Tanner Postert [mailto:tanner.post...@gmail.com] Sent: Friday, May 27, 2011 6:04 PM To: solr-user@lucene.apache.org Subject: Re: Spellcheck Phrases

RE: Spellcheck Phrases

2011-06-02 Thread Dyer, James
s the quality of our spell suggestions, now that I know how to use it properly. Sorry about the mis-information earlier. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Dyer, James Sent: Wednesday, June 01, 2011 3:02 PM To: solr

RE: SpellCheckComponent performance

2011-06-07 Thread Dyer, James
Demian, If you omit "spellcheckIndexDir" from the configuration, it will create an in-memory spelling dictionary. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Demian Katz [mailto:demian.k...@villanova.edu] Sent: Tuesday, June

RE: Processing/Indexing CSV

2011-06-09 Thread Dyer, James
might not work at all ) ... I also didn't do anything with encodings but I'm not sure this will be an issue either... James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Helmut Hoffer von Ankershoffen [mailto:helmut...@googlemail.com] Sen

RE: Spellcheck compounded words

2011-07-25 Thread Dyer, James
t might be your best hope if this is a feature you've absolutely got to have. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: O. Klein [mailto:kl...@octoweb.nl] Sent: Friday, July 22, 2011 8:11 PM To: solr-user@lucene.apache.org Subjec

RE: Spellcheck compounded words

2011-07-25 Thread Dyer, James
ution... James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Markus Jelsma [mailto:markus.jel...@openindex.io] Sent: Monday, July 25, 2011 10:13 AM To: solr-user@lucene.apache.org Cc: Dyer, James Subject: Re: Spellcheck compounded words This will wor

RE: Spellcheck compounded words

2011-07-26 Thread Dyer, James
ons" to something higher. Just keep in mind that with "maxCollationTries" at zero, the collations aren't guaranteed to return any hits. - I'm not so sure shingles will work with the collation feature at all. - I've heard that when using shingles, you have to put

RE: Spellcheck compounded words

2011-07-26 Thread Dyer, James
to try and figure out if this is a bug and if so, then try and fix it. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: O. Klein [mailto:kl...@octoweb.nl] Sent: Tuesday, July 26, 2011 11:37 AM To: solr-user@lucene.apache.org Subject: RE: Spellcheck

RE: Spellcheck compounded words

2011-07-27 Thread Dyer, James
sn't look like a possibility. If you are able to figure anything more out on this yourself, then please post. If this is a real bug, then we ought to get it fixed. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: O. Klein [mailto:kl...@oct

RE: Solr DataImport with multiple DBs

2011-07-28 Thread Dyer, James
have to run the deltas using "command=full-import / clean=false" as shown here: http://wiki.apache.org/solr/DataImportHandlerFaq#fullimportdelta James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.

RE: field with repeated data in index

2011-07-28 Thread Dyer, James
You need to index the field you want to facet on. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Mark juszczec [mailto:mark.juszc...@gmail.com] Sent: Thursday, July 28, 2011 3:50 PM To: solr-user@lucene.apache.org Subject: field with repeated

RE: field with repeated data in index

2011-07-28 Thread Dyer, James
"Lucid reference guide". These are both excellent detailed tutorials and should help you get up-to-speed pretty fast. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Mark juszczec [mailto:mark.juszc...@gmail.com] Sent: Thursday, Jul

RE: Updating opinion

2011-07-29 Thread Dyer, James
y long as you please (and commit) on the new core. At the stroke of midnight the next day, swap cores. This way you can control (nearly) the exact moment the new data becomes public. See http://wiki.apache.org/solr/CoreAdmin#SWAP James Dyer E-Commerce Systems Ingram Content Group (615)

RE: Updating opinion

2011-07-29 Thread Dyer, James
r than the commit, so you want to avoid doing one if you can. You still won't be able to guarantee the new documents are available right at the stroke of midnight be you can probably usually be close. If you need to be precise, you'll probably want to use 2 cores. James Dyer E-Com

RE: Spell Check

2011-08-02 Thread Dyer, James
ictionary, Solr's spellchecker will not try to correct it. See https://issues.apache.org/jira/browse/SOLR-2585 for a potential solution to this problem. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: tamanjit.bin...@yahoo.co.in [mailto

RE: how to enable MMapDirectory in solr 1.4?

2011-08-08 Thread Dyer, James
be Solaris. In Windows, there is a definite advantage for using MMapDirectory on a 64-bit system. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Li Li [mailto:fancye...@gmail.com] Sent: Monday, August 08, 2011 4:09 AM To: solr-user@lucene.apac

RE: Problem with DIH: How to map key value pair stored in 1-N relation from a JDBC Source?

2011-08-09 Thread Dyer, James
If you would rather use a scripting language such as javascript instead of writing your Transformer in java, see http://wiki.apache.org/solr/DataImportHandler#ScriptTransformer . James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Christian Bordi

RE: SolR : eDismax does not always use the defaultOperator "AND"

2011-08-17 Thread Dyer, James
he person who opened the issue reported this for 3.3. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Valentin [mailto:igorlacro...@gmail.com] Sent: Wednesday, August 17, 2011 9:05 AM To: solr-user@lucene.apache.org Subject: Re: SolR : eDisma

RE: Return records based on aggregate functions?

2011-08-17 Thread Dyer, James
Daniel, This looks like a good usecase for FieldCollapsing (see http://wiki.apache.org/solr/FieldCollapsing). Perhaps try something like: &group=true&group.field=documentId&group.limit=1&group.sort=version desc James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311

RE: Return records based on aggregate functions?

2011-08-17 Thread Dyer, James
Field Collapsing than what was committed so it might not be 100% accurate. http://www.lucidimagination.com/search/document/148ba23aec5ee2d8/solrquery_api_for_adding_group_filter James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Daniel Skiles

RE: Return records based on aggregate functions?

2011-08-17 Thread Dyer, James
Yes: solrquery.add("group.main", true); solrquery.add("group.format", "simple"); James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Daniel Skiles [mailto:daniel.ski...@docfinity.com] Sent: Wednesday, August

RE: Solr spellcheck and multiple collations

2011-08-17 Thread Dyer, James
o know how many hits each collation will return, also specify "spellcheck.collateExtendedResults=true" - See http://wiki.apache.org/solr/SpellCheckComponent#spellcheck.maxCollations for more information James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Me

RE: Solr spellcheck and multiple collations

2011-08-17 Thread Dyer, James
won't work in 1.4 without the SOLR-2010 patch. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Herman Kiefus [mailto:herm...@angieslist.com] Sent: Wednesday, August 17, 2011 4:55 PM To: solr-user@lucene.apache.org Subject: RE: Solr spellch

RE: Full sentence spellcheck

2011-08-18 Thread Dyer, James
If you use "spellcheck.q", you also still need to specify "q" for the queryhandler, otherwise you'll get an NPE. Not sure that's your problem but its one thing to check. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Origin

RE: Getting DIH status with SolrJ

2011-08-24 Thread Dyer, James
ot;"); } Not sure this is what you're after, but maybe it'd be helpful. Like I say, I wish [I knew of|there was] a better way to do this... James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Shawn Heisey [mailto:s...@elyograg.o

RE: LineEntityProcessor Usage

2012-06-28 Thread Dyer, James
s in fixed-width or delimited format. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: kiran kumar [mailto:kirankumarsm...@gmail.com] Sent: Wednesday, June 27, 2012 11:03 PM To: solr-user@lucene.apache.org Subject: LineEntityProcessor Usage Hello

RE: WordBreakSolrSpellChecker ignores MinBreakWordLength?

2012-06-28 Thread Dyer, James
rks out for you. Any more feedback you can provide on the newer spellcheck features you're using is appreciated. Thanks. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Carrie Coy [mailto:c...@ssww.com] Sent: Thursday, June 28, 2012

RE: DIH - unable to ADD individual new documents

2012-07-03 Thread Dyer, James
batching isn't your answer. But DIH isn't designed at all to work well with NRT either... James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Klostermeyer, Michael [mailto:mklosterme...@riskexchange.com] Sent: Tuesday, July 03,

RE: Better (and valid) Spellcheck in combination with other parameters with at least one occurance

2012-07-06 Thread Dyer, James
pellcheck.collateParam.mm=100%", if your original query had a very low "mm" value. This will make the collations returned more meaningful to the user. - maybe specify "spellcheck.alternativeTermCount" to something >0 if you want the spellchecker to consider that the

RE: Better (and valid) Spellcheck in combination with other parameters with at least one occurance

2012-07-09 Thread Dyer, James
Yes, "maxCollationTries" tests the new (collation) queries with all the same parameters as the original query. Most notably, it uses the same "fq" parameters so it will take in account any filters you were using. James Dyer E-Commerce Systems Ingram Content Group (615) 213-

RE: maxNumberOfBackups does not cleanup - jira 3361

2012-07-10 Thread Dyer, James
I'm also certain that it would apply to both "oncommit" and "onoptimize". James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: geeky2 [mailto:gee...@hotmail.com] Sent: Tuesday, July 10, 2012 8:48 AM To: solr-user@

RE: How To apply transformation in DIH for multivalued numeric field?

2012-07-18 Thread Dyer, James
lds as multi-valued automatically. I guess your approach leads to a lot fewer rows getting sent from your db to Solr though. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Pranav Prakash [mailto:pra...@gmail.com] Sent: Wednesday, Ju

RE: SpellCheckComponent Collation query

2012-08-09 Thread Dyer, James
the client, along with some type of notation that the result is actually an automatically-applied spellcheck response. Then there's the added complexity of doing this all in a distributed setup. It would be a nice feature to have though. James Dyer E-Commerce Systems Ingram Content Group

RE: Dataimport Handler in solr 3.6.1

2012-08-14 Thread Dyer, James
ese really need to be fixed so any testing help would be much appreciated. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: mechravi25 [mailto:mechrav...@yahoo.co.in] Sent: Tuesday, August 14, 2012 8:04 AM To: solr-user@lucene.apache.org Subject

RE: Dataimport Handler in solr 3.6.1

2012-08-30 Thread Dyer, James
nfigured and there could be somethat that was broken that I have not uncovered myself. Any issues that may exist with SOLR-2382 need to be identified and fixed in the 4.x branch as soon as possible. I apologize for the late response. I was away the past week. James Dyer E-Commerce Systems

RE: Static template column in DIH

2012-08-30 Thread Dyer, James
You might just be missing . See http://wiki.apache.org/solr/DataImportHandler#TemplateTransformer James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Kiran Jayakumar [mailto:kiranjuni...@gmail.com] Sent: Thursday, August 30, 2012 1:54 PM To

RE: DIH jdbc4.MySQLSyntaxErrorException

2012-08-31 Thread Dyer, James
variable is not resolving to anything so your database gets: where blah='' ...and for some reason it doesn't like being passed an empty string like that. Just a shot in the dark here but maybe? James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message

RE: LineEntityProcessor process only one file

2012-08-31 Thread Dyer, James
No, it should process all of the files that get listed. I'm taking a look at the issue you opened, SOLR-3779. This is also similar to SOLR-3307, although that was reported as a bug with "threads" in 3.6, which is no longer a feature in 4.0. James Dyer E-Commerce Systems Ingra

RE: deletedPkQuery not work in solr 3.3

2012-09-06 Thread Dyer, James
ecial_Commands for more info on this technique. Finally, you will want to be aware of https://issues.apache.org/jira/browse/SOLR-2492 , a bug which was fixed in Solr 3.4. DIH doesn't automatically do a commit in some cases if your import only does deletes. You need to issue a commit man

RE: DIH problem

2012-09-22 Thread Dyer, James
ecure/Dashboard.jspa James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Mikhail Khludnev [mailto:mkhlud...@griddynamics.com] Sent: Friday, September 21, 2012 12:03 PM To: solr-user@lucene.apache.org Subject: Re: DIH problem Gian, The only way to handle it

<    1   2   3   4   5   6   7   8   9   >