RE: Spellchecking and frequency

2010-07-27 Thread Dyer, James
ike to see yours to compare. James Dyer E-Commerce Systems Ingram Book Company (615) 213-4311 -Original Message- From: Mark Holland [mailto:mark.holl...@zoopla.co.uk] Sent: Tuesday, July 27, 2010 1:04 PM To: solr-user@lucene.apache.org Subject: Re: Spellchecking and frequency Hi, I

RE: SpellCheckComponent question

2010-08-19 Thread Dyer, James
This possibly might be a bug. See http://lucene.472066.n3.nabble.com/Spellcheck-help-td951059.html#a990476 James Dyer E-Commerce Systems Ingram Book Company (615) 213-4311 -Original Message- From: fabritw [mailto:fabr...@gmail.com] Sent: Thursday, August 19, 2010 12:51 PM To: solr-user

Re: How to delete documents from a SOLR cloud / balance the shards in the cloud?

2010-09-10 Thread James Liu
Stephan and all, I am evaluating this like you are. You may want to check http://www.tomkleinpeter.com/2008/03/17/programmers-toolbox-part-3-consistent-hashing/. I would appreciate if others can shed some light on this, too. Bests, James On Fri, Sep 10, 2010 at 6:07 AM, Stephan Raemy wrote

Improvements to SpellCheckComponent "Collate" -- Patch available for v1.4.1 (SOLR-2010)

2010-09-22 Thread Dyer, James
provememnts. Help on applying patches is here: http://wiki.apache.org/solr/HowToContribute#TestingPatches James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311

RE: Need help with spellcheck city name

2010-09-28 Thread Dyer, James
You might want to look at SOLR-2010. This patch works with the "collation" feature, having it test the collations it returns to ensure they'll return hits. So if a user types "san jos" it will know that the combination "san jose" is in the index and "

Using separate Analyzers for querying and indexing.

2010-09-28 Thread James Norton
indexing, but I don't know how to specify that a different Analyzer (the case sensitive one) should be used for queries. Is this possible? Thanks, James

Re: Using separate Analyzers for querying and indexing.

2010-09-28 Thread James Norton
Excellent, exactly what I needed. Thanks, James On Sep 28, 2010, at 4:28 PM, Luke Crouch wrote: > Yeah. You can specify two analyzers in the same fieldType: > > > > ... > > > ... > > > > -L > > On Tue, Sep 28, 2010 at 2:31 PM, James Norton

RE: multi level faceting

2010-10-05 Thread Dyer, James
th...) But I do think this would work in a pinch if need be. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Nguyen, Vincent (CDC/OD/OADS) (CTR) [mailto:v...@cdc.gov] Sent: Tuesday, October 05, 2010 8:22 AM To: solr-user@lucene.apache.org Subj

RE: access control for spellcheck suggestions?

2010-10-07 Thread Dyer, James
is would effectively prevent users from seeing sensitive data in their spell suggestions. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Peter Wolanin [mailto:peter.wola...@acquia.com] Sent: Thursday, October 07, 2010 9:00 AM To: solr-user@lucen

RE: spellcheck component does not work with request handler

2010-10-28 Thread Dyer, James
In your "standard" Search Handler, you have the "last-components" array inside . However, it should be outside as in the "/spell" Search Handler. Try this: explicit spellcheck James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -

RE: spellchecker results not as desired

2010-10-29 Thread Dyer, James
are not considered close enough. You can tweak this by setting spellcheck.accuracy. However, I do not believe this option is available in 1.4. The wiki indicates it is 3.x/4.0 only. For more information, look at the "SpellCheckComponent" page on the wiki. James Dyer E-Commerce Systems I

RE: How does DIH multithreading work?

2010-11-01 Thread Dyer, James
;d be in parallel. This would make sense if maybe your database can only pull back so fast, but then you have an intensive transformer. Maybe adding a thread would make your processing no slower than the db... James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Mess

RE: Output Search Result in ADD-XML-Format

2010-11-10 Thread Dyer, James
I'm not sure, but SOLR-1499 might have what you want. https://issues.apache.org/jira/browse/SOLR-1499 James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Chantal Ackermann [mailto:chantal.ackerm...@btelligent.de] Sent: Wednesday, Novemb

Re: Can Solr v1.4 and v4.0 co-exist in Tomcat?

2012-11-20 Thread James Jory
Our setup generally follows these instructions: http://wiki.apache.org/solr/SolrTomcat#Multiple_Solr_Webapps BTW, we're running on Tomcat 6. You'll definitely want to undo your changes to the Tomcat lib directory since you don't want anything Solr-related shared across your web apps. On Nov 2

RE: Spellchecker for multiple sites (and languages?)

2012-11-26 Thread Dyer, James
.ingramcontent.com%3E James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Markus Jelsma [mailto:markus.jel...@openindex.io] Sent: Monday, November 26, 2012 9:32 AM To: solr-user@lucene.apache.org Subject: RE: Spellchecker for multiple sites (and

RE: Spellchecker for multiple sites (and languages?)

2012-11-26 Thread Dyer, James
or want to give the users a message about which words were misspelled, etc, you can do so easily. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: André Schild [mailto:a.sch...@aarboard.ch] Sent: Monday, November 26, 2012 11:24 AM To: solr-us

RE: DIH nested entities don't work

2012-11-27 Thread Dyer, James
t; parameter was incorrectly renamed "cachePk". For 4.0.0, this was renamed back to "cacheKey". James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: mroosendaal [mailto:mroosend...@yahoo.com] Sent: Tuesday, November 27, 2012 3

RE: DIH nested entities don't work

2012-12-05 Thread Dyer, James
features in Solr3 & especially Solr4 should make it easier and more efficient though. If you have more questions about faceting, I would start a new discussion thread about it. There are a lot of approaches to solving various problems so you may get a variety of answers. James Dyer E-Comme

RE: Spelling output solr 4

2012-12-07 Thread Dyer, James
I'm not sure what you mean. Can you paste in an example spellcheck response and explain how it differs between the older IndexBasedSpellChecker on 3.1 and the DirectSolrSpellChecker on 4.0 ? James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message-

RE: Differentiate between correctly spelled term and mis-spelled term with no corrections

2012-12-07 Thread Dyer, James
http://wiki.apache.org/solr/TermsComponent , which maybe would provide you a viable workaround. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Nalini Kartha [mailto:nalinikar...@gmail.com] Sent: Thursday, December 06, 2012 2:44 PM To: solr-user@lucene.apach

RE: Differentiate between correctly spelled term and mis-spelled term with no corrections

2012-12-07 Thread Dyer, James
t=false" to #toNamedList so that the end user gets a "normal" response back, omitting terms for which there are no suggestions. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Nalini Kartha [mailto:nalinikar...@gmail.com] Sent: F

RE: Need help with delta import

2012-12-10 Thread Dyer, James
ing that your "deltaImportQuery" is only pulling back the ID, it couldn't possibly change the values for fields like "email" and "fname". Make sense? James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: umajava [

RE: dataimport.properties not created/updated with solrcloud

2012-12-12 Thread Dyer, James
unk. This should be fixed before 4.1 is released (committing the fix is just waiting on a unit test). A patch is available at https://issues.apache.org/jira/browse/SOLR-4112 . This is only a concern if you are using a nightly snapshot or building Solr yourself from a recent svn checkout. Ja

RE: Differentiate between correctly spelled term and mis-spelled term with no corrections

2012-12-14 Thread Dyer, James
hat you want to do. You might find something else that works better. For whatever you come up with, by all means open a JIRA issue and attach your work as a patch and see where it goes from there. (subscribe to the dev list if you haven't already as that's where these type of discuss

RE: Need help with delta import

2012-12-14 Thread Dyer, James
. The wiki has since been changed. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: umajava [mailto:umaj...@gmail.com] Sent: Friday, December 14, 2012 11:27 AM To: solr-user@lucene.apache.org Subject: RE: Need help with delta import I tried full

RE: Need help with delta import

2012-12-14 Thread Dyer, James
safer to change to "dih." everywhere you use it. I had thought about removing the longer one for 4.0 and then figured most people wouldn't realize it had been depreciated (there's no warning logged, etc) and I didn't want to give people unnecessary problems. I guess ther

RE: Spell Check is not working properly

2012-12-17 Thread Dyer, James
ot;test". In other words, DirectSolrSpellChecker will never make this correction for you.) In practice, I don't think it is fruitful to try and correct anything that needs more than 2 or so edits. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message

RE: "order" question on solr multi value field

2012-12-18 Thread Dyer, James
guarantee that many many users depend on. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Jack Krupansky [mailto:j...@basetechnology.com] Sent: Tuesday, December 18, 2012 9:30 AM To: solr-user@lucene.apache.org Subject: Re: "order" questio

RE: "order" question on solr multi value field

2012-12-18 Thread Dyer, James
I suppose using this same logic you can not guarantee the tokens on a stored, single-value field would be stored in the order they arrive either, can you? A multi-valued field is the same as a single-valued field with the positions artifically incremented, so what's the difference? James

RE: "order" question on solr multi value field

2012-12-18 Thread Dyer, James
I agree with you the documentation should be more explicit. I just don't want to give new users the impression that stored fields won't return in the order they are added. This is the behavior and I think a lot of us rely on that today. James Dyer E-Commerce Systems Ingram Content

RE: "order" question on solr multi value field

2012-12-19 Thread Dyer, James
ially optimize this by always adding short metadata fields to a document first and then longer content fields at the end? James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Tuesday, December 18, 2012 7:25

RE: Ensuring SpellChecker returns corrections which satisfy fq params for default OR query

2012-12-19 Thread Dyer, James
27;t the problem here (it is an irrelevant term not in the index). So while you get words suggested there are no collations. The individual words would be helpful, but you're not sure because they might all apply to items that do not match "fq=item:in_stock". Is this the p

RE: Ensuring SpellChecker returns corrections which satisfy fq params for default OR query

2012-12-19 Thread Dyer, James
sults=true James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Nalini Kartha [mailto:nalinikar...@gmail.com] Sent: Wednesday, December 19, 2012 2:06 PM To: solr-user@lucene.apache.org Subject: Re: Ensuring SpellChecker returns corrections which satis

RE: dataimport.properties not created/updated with solrcloud

2012-12-19 Thread Dyer, James
not sure which node(s) this gets saved to. You can see the code for this yourself at: http://svn.apache.org/repos/asf/lucene/dev/branches/branch_4x/solr/contrib/dataimporthandler/src/java/org/apache/solr/handler/dataimport/ZKPropertiesWriter.java James Dyer E-Commerce Systems Ingram Content

RE: Ensuring SpellChecker returns corrections which satisfy fq params for default OR query

2012-12-20 Thread Dyer, James
lateParam.mm=100%", it won't insist that the irrelevant terms (or a "corrected" irrelevant term) match anything. On the other hand, it won't assume the query is "Correctly Spelled" just because you got some hits from it (because mm=0 will just cause the misspel

RE: Can DataImportHandler ignore Missing Tags in XML?

2012-12-21 Thread Dyer, James
"catch-all" dynamic field to your schema.xml: ...all your fields go here... This tells it to allow any column name that isn't explicitly defined but to just ignore it. This overrides Solr's default behavior in throwing an exception in such cases. James Dyer E-Commerce Sys

RE: [DIH] Script Transformer: Is there a way to import js file?

2012-12-26 Thread Dyer, James
f it could support it though. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: zakaria benzidalmal [mailto:zakib...@gmail.com] Sent: Wednesday, December 26, 2012 11:00 AM To: solr-user@lucene.apache.org Subject: [DIH] Script Transformer: Is ther

RE: Converting fq params to Filter object

2012-12-27 Thread Dyer, James
t the expense of not getting exact hit-counts. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Nalini Kartha [mailto:nalinikar...@gmail.com] Sent: Thursday, December 27, 2012 1:09 PM To: solr-user@lucene.apache.org Subject: Re: Converting fq para

RE: Converting fq params to Filter object

2012-12-27 Thread Dyer, James
rs applied. DirectSpellChecker#suggestSimilar returns an array of SuggestWord instances that contain the individual words you would want to try. To optimize this, you can use the same approach as in SOLR-3240, implementing a Collector that only looks for 1 document then quits. James Dyer

RE: search features Endeca vs Solr

2013-01-04 Thread Dyer, James
re their sales promotions, etc so as to score promoted results above others, etc. To my knowledge, the closest thing Solr has is QueryElevationComponent, which might do the job for you but it is probably a lot less rich than what Endeca can offer. http://wiki.apache.org/solr/QueryElevationCompone

SolrCloud - shard distribution

2013-01-09 Thread James Thomas
the default value of 1 for 'maxShardsPerNode', I was surprised to see that Solr created two shards on instance #16. I expected that each Solr node (there are 6) would each be assigned one shard from the collection. Is this a bug or expected behavior? Thanks, James

RE: SolrCloud - shard distribution

2013-01-09 Thread James Thomas
n a distinct node. // TODO: add smarter options that look at the current number of cores per // node? // for now we just go random Thanks, James -Original Message- From: Mark Miller [mailto:markrmil...@gmail.com] Sent: Wednesday, January 09, 2013 11:30 AM T

RE: SolrCloud - shard distribution

2013-01-09 Thread James Thomas
Oops, small copy-paste error. Had my i's and j's backwards. Should be: --- slice1, rep2 (i=1,j=2) ==> chooses node[1] --- slice2, rep1 (i=2,j=1) ==> chooses node[1] -Original Message- From: James Thomas [mailto:jtho...@camstar.com] Sent: Wednesday, January 09, 2013

RE: Transformers and Nested entities - order of execution

2013-01-10 Thread Dyer, James
n have its child's lookup key generated by a transformer, but the child cannot even if a cache is being used. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Alexandre Rafalovitch [mailto:arafa...@gmail.com] Sent: Thursday, Jan

RE: OR query

2013-01-10 Thread Dyer, James
ian AND paisley) or label:(+ian +paisley) I think this is the case with edismax, which somewhat supports boolean queries but with caveats... You can also turn debugQuery on to figure out why things are not matching. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -O

RE: OR query

2013-01-10 Thread Dyer, James
27;re going to have to verify that some documents with 'name_long:"paisley, ian"' also contain that name in either "constituencies", "label", or "office". It could be that things aren't being indexed as you think. James Dyer E-Commer

RE: how to perform a delta-import when related table is updated

2013-01-11 Thread Dyer, James
uot; clauses on the parent query, it will double the number of queries that get issued to determine which documents to update. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: PeterKerk [mailto:vettepa...@hotmail.com] Sent: Friday, January 11, 2

RE: how to perform a delta-import when related table is updated

2013-01-11 Thread Dyer, James
Try adding the "pk" attribute to the parent entity in any of these 4 ways: mailto:vettepa...@hotmail.com] Sent: Friday, January 11, 2013 1:18 PM To: solr-user@lucene.apache.org Subject: RE: how to perform a delta-import when related table is updated Hi James, Ok, so

RE: DataImportHandlerException: Unable to execute query with OPTIM

2013-01-15 Thread Dyer, James
suspect your driver requires it to be >0 (or at least >-1). If you omit "batchSize" entirely, DIH sets it to 500 as a default. Also, setting it to -1 causes DIH to change this to Integer.MIN_VALUE. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Orig

RE: SOLR 4 getting stuck during restart

2013-01-21 Thread Dyer, James
could you open a bug report in JIRA with the "before" and "after" queries so we can try and fix this? https://issues.apache.org/jira/browse/SOLR James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: vijeshnair [mailto:vijeshkn...@gmai

RE: Problems with DataImportHandler in SOLR 1.4.0

2013-01-22 Thread Dyer, James
ur DIH configuration and see if you (or someone on this list) can figure out why DIH might go into a loop. Perhaps this is a bug that needs to be fixed still. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Weigel, Christian [mailto:christian.we

RE: Deletion from database

2013-01-24 Thread Dyer, James
apache.org/solr/DataImportHandlerFaq#fullimportdelta James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: hassancrowdc [mailto:hassancrowdc...@gmail.com] Sent: Thursday, January 24, 2013 12:19 PM To: solr-user@lucene.apache.org Subject: Re: Deletion from database

RE: SOLR 4 getting stuck during restart

2013-01-25 Thread Dyer, James
dictionary to build). Also, be sure you tried "spellcheck.collate=false" and not "spellcheck.collation=false". But do get a thread dump, then open a jira issue with whatever information you can provide and also the pertinent section of the thread dump. James Dyer E-Commerc

RE: Error in DIH after upgrading from 4.0 to 4.1

2013-01-25 Thread Dyer, James
dots: ex. "solrBceDSDriver" instead of "solr.bceDS.driver". James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Boris Granveaud [mailto:bo...@granveaud.com] Sent: Friday, January 25, 2013 3:38 AM To: solr-user@lucene.apache.org S

RE: Variable expansion in DIH SimplePropertiesWriter's filename?

2013-01-30 Thread Dyer, James
This is a bug. Can you paste what you've said here into a new JIRA issue? https://issues.apache.org/jira/browse/SOLR James Dyer Ingram Content Group (615) 213-4311 -Original Message- From: Jonas Birgander [mailto:jonas.birgan...@prisjakt.nu] Sent: Wednesday, January 30, 2013 4:

RE: DIH and splitBy

2013-01-31 Thread Dyer, James
In your unit test, you have: "" + And also: runner.update("INSERT INTO test VALUES 1, 'foo,bar,baz'"); So you need to decide if you want to delimit with a pipe or a comma. James Dyer Ingram Content Group (615) 213-4311 -Original Message- From:

RE: Solr 4.1.0 not using solrcore.properties ?

2013-02-13 Thread Dyer, James
The code that resolves variables in DIH was refactored extensively in 4.1.0. So if you've got a case where it does not resolve the variables properly, please give the details. We can open a JIRA issue and get this fixed. James Dyer Ingram Content Group (615) 213-4311 -Original Me

RE: suggest only from certain documents

2013-02-13 Thread Dyer, James
The key to get this working is to set "spellcheck.maxCollationTries" > 0. It will generate collations even if there is only 1 term. James Dyer Ingram Content Group (615) 213-4311 -Original Message- From: Chris Hostetter [mailto:hossman_luc...@fucit.org] Sent: Wednesday

RE: Solr 4.1.0 not using solrcore.properties ?

2013-02-14 Thread Dyer, James
Daniel, This bug has already been recorded and hopefully will be fixed in time for 4.2. See https://issues.apache.org/jira/browse/SOLR-4361 . James Dyer Ingram Content Group (615) 213-4311 -Original Message- From: Daniel Rijkhof [mailto:daniel.rijk...@gmail.com] Sent: Wednesday

RE: Implement price range filter: DataImportHandler started. Not Initialized. No commands can be run

2013-02-14 Thread Dyer, James
the problem and restart solr. James Dyer Ingram Content Group (615) 213-4311 -Original Message- From: PeterKerk [mailto:vettepa...@hotmail.com] Sent: Thursday, February 14, 2013 5:02 AM To: solr-user@lucene.apache.org Subject: Implement price range filter: DataImportHandler started. Not

RE: Implement price range filter: DataImportHandler started. Not Initialized. No commands can be run

2013-02-14 Thread Dyer, James
think the problem might be the escaped quotes and amperstands. Change it to... James Dyer Ingram Content Group (615) 213-4311 -Original Message- From: PeterKerk [mailto:vettepa...@hotmail.com] Sent: Thursday, February 14, 2013 10:01 AM To: solr-user@lucene.apache.org Subject: RE

RE: If we Open Source our platform, would it be interesting to you?

2013-02-20 Thread Dyer, James
of sources and combine that into 1 flat-schema document? I could see your project as something we could use in Solr & Lucene, but with its appeal to more projects than just ours, possibly would not have such a problem attracting developers. James Dyer Ingram Content Group (615) 213-4311

RE: Get page number of searchresult of a pdf in solr

2013-03-01 Thread Dyer, James
Is there an easy (enough) way to do this, storing the page number as a payload on each term? James Dyer Ingram Content Group (615) 213-4311 -Original Message- From: Michael Della Bitta [mailto:michael.della.bi...@appinions.com] Sent: Thursday, February 28, 2013 3:33 PM To: solr-user

RE: can we configure spellcheck to be invoked after request processing?

2013-03-01 Thread Dyer, James
sults, the results do affect spellcheck. This is why you should put spellcheck in the "last-components" section of your request handler configuration. This ensures that the query is run before spellcheck. James Dyer Ingram Content Group (615) 213-4311 -Original Message

RE: solr-dih does multiple queries for sub-entities

2013-03-04 Thread Dyer, James
data from other entity types also. Also, it is possible to plug in cache implementations if the default in-memory cache does not scale for you. See https://issues.apache.org/jira/browse/SOLR-2382 . James Dyer Ingram Content Group (615) 213-4311 -Original Message- From: harpax

RE: SOLR - Define fields in DIH configuration file dynamically

2013-03-18 Thread Dyer, James
/ Not sure if this is the answer to your question. If not, give more details. James Dyer Ingram Content Group (615) 213-4311 -Original Message- From: kobe.free.wo...@gmail.com [mailto:kobe.free.wo...@gmail.com] Sent: Monday, March 18, 2013 4:03 AM To: solr-user@lucene.apache.org S

RE: strange behaviour of wordbreak spellchecker in solr cloud

2013-03-19 Thread Dyer, James
Can you try including in your request the "shards.qt" parameter? In your case, I think you should set it to "testhandler". See http://wiki.apache.org/solr/SpellCheckComponent?highlight=%28shards\.qt%29#Distributed_Search_Support for a brief discussion. James Dyer Ingra

RE: strange behaviour of wordbreak spellchecker in solr cloud

2013-03-19 Thread Dyer, James
as we can. Based on DSCCT and it being so superficial, I really can't vouch too much for my spell check enhancements working as well with shards as they do with a single index. James Dyer Ingram Content Group (615) 213-4311 -Original Message- From: Mark Miller [mailto:markrmil...

RE: strange behaviour of wordbreak spellchecker in solr cloud

2013-03-19 Thread Dyer, James
query url and output you get) vs the same query url and output in the distributed scenario. Then, without access to your actual index, it might be possible to come up with a failing unit test. With a failing unit test in hand, we have a good shot at getting a fix. James Dyer Ingram Content Group

RE: $deleteDocByQuery and $deleteDocByID

2012-02-01 Thread Dyer, James
/SOLR-2492 . James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: ejcirel [mailto:ejci...@yahoo.com] Sent: Wednesday, February 01, 2012 10:15 AM To: solr-user@lucene.apache.org Subject: $deleteDocByQuery and $deleteDocByID Does anyone have an exam

RE: spell checking and filtering in the same query

2012-02-09 Thread Dyer, James
kComponent does work with "fq". James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Mark Swinson [mailto:mark.swin...@bbc.co.uk] Sent: Thursday, February 09, 2012 7:38 AM To: solr-user@lucene.apache.org Subject: spell checking and filterin

RE: regular expression in solrcore.config to be passed to dataConfig via DataImportHandler

2012-02-09 Thread Dyer, James
I wouldn't feel too bad about this. This is a pretty common gotcha and going forward it would be nice if we can make it easier to parameterize data-config.xml... James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Zajkowski, Rad

RE: spellcheck configuration not providing suggestions or corrections

2012-02-13 Thread Dyer, James
ount" of maybe 10 even if you only want 1 suggestion. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: geeky2 [mailto:gee...@hotmail.com] Sent: Monday, February 13, 2012 11:08 AM To: solr-user@lucene.apache.org Subject: spellcheck configuratio

RE: spellcheck configuration not providing suggestions or corrections

2012-02-13 Thread Dyer, James
the main field ("itemDesc" in your case) and just use "itemDescSpell" from which to build your dictionary. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: geeky2 [mailto:gee...@hotmail.com] Sent: Monday, February

RE: is it possible to run deltaimport command with out delta query?

2012-02-16 Thread Dyer, James
any queries to the database. It also can be easier to maintain just 1 set of queries for both full and delta imports. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Shawn Heisey [mailto:s...@elyograg.org] Sent: Thursday, February 16, 2012 10:

RE: Spelling Corrector Algorithm

2012-03-01 Thread Dyer, James
index along with any other query parameters (other keywords, filters, etc), letting you know which suggestions are going to truly return hits in context (and how many). 4. Try Jaro-Winkler (as mentioned above). Hope this helps. But in the end, especially with 1-word queries, I doubt even

RE: Need tokenization that finds part of stringvalue

2012-03-01 Thread Dyer, James
r/HowToContribute#Review.2BAC8-Improve_Existing_Patches I would appreiate it if you tried these out to provide feedback on the JIRA issues as to how it works for you and also how it can be improved. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Walter Underwood

RE: Using multiple DirectSolrSpellcheckers for a query

2012-03-07 Thread Dyer, James
eck=true {lotsa spellcheck params here} &spellcheck.q=docUnstemmed:(eun AND jump) ...now it won't both correct and stem. The corrections would need to match the raw keyword. Is this closer to what you want? One other note here...It looks like your "docUnstemmed" and "s

RE: How to stop processing of DataImportHandler in EventListener

2012-03-07 Thread Dyer, James
uff like this until we can clean up some of DIHs more fundamental shortcomings. (DIH is great for many use cases, but the code has suffered neglect and needs a facelift in my opinion) James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Mikhail Khl

RE: DIH Delta index takes much time

2012-03-07 Thread Dyer, James
dex. (see http://wiki.apache.org/solr/UpdateCSV for detailed instructions how to do this). If the query runs just fine when run manually, AND if the CSV loader test is fast too, then maybe you've stumbled on a new DIH bug nobody has reported before? James Dyer E-Commerce Systems Ingra

RE: Solr DIH and $deleteDocById

2012-03-09 Thread Dyer, James
bably stumbled on a new bug related to SOLR-2492. In any case, the workaround (probably) is to manually issue a commit after doing your deletes. Or, combine your deletes with add/updates in the same DIH run and it should commit automatically as configured. James Dyer E-Commerce Systems

RE: DIH - FileListEntityProcessor reading from Multiple Disk Directories

2012-03-09 Thread Dyer, James
Did you try setting "baseDir" to the root directory and "recursive" to true ? (see http://wiki.apache.org/solr/DataImportHandler#FileListEntityProcessor for more information). James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 From: mike.rawl...@gxs.c

RE: Using multiple DirectSolrSpellcheckers for a query

2012-03-13 Thread Dyer, James
ature doesn't exist now but I don't think the development effort would be very big. A feature like this would also give users the ability to remove boost parameters as these are useless for spellchecking purposes and would make the check-queries slower. James Dyer E-Commerce Systems Ingr

RE: index size with replication

2012-03-14 Thread Dyer, James
uot;, then optimize is (probably) not going to do anything helpful. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Ahmet Arslan [mailto:iori...@yahoo.com] Sent: Wednesday, March 14, 2012 4:25 PM To: solr-user@lucene.apache.org Subject: Re: i

RE: SOLR 3.3 DIH and Java 1.6

2012-03-20 Thread Dyer, James
Taking a quick look at the code, it seems this exception could have been thrown for four reasons: (see org.apache.solr.handler.dataimport.ScriptTransformer#initEngine) 1. Your JRE doesn't have class "javax.script.ScriptEngineManager" (pre 1.6, loaded here via reflection) 2. Your JRE doesn't

RE: SOLR 3.3 DIH and Java 1.6

2012-03-20 Thread Dyer, James
ast message). In case #2 or #3 it will tell you the "language" you specified isn't supported. In case #4, it will tell you the script itself is invalid. See https://issues.apache.org/jira/browse/SOLR-3260 . James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 ---

RE: possible spellcheck bug in 3.5 causing erroneous suggestions

2012-03-27 Thread Dyer, James
post it on JIRA? See http://wiki.apache.org/solr/HowToContribute for more information. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: tom [mailto:dev.tom.men...@gmx.net] Sent: Tuesday, March 27, 2012 2:31 AM To: solr-user@lucene.apache.org Sub

RE: preventing words from being indexed in spellcheck dictionary?

2012-03-27 Thread Dyer, James
your spellcheck configuration. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: geeky2 [mailto:gee...@hotmail.com] Sent: Tuesday, March 27, 2012 9:07 AM To: solr-user@lucene.apache.org Subject: preventing words from being indexed in spellcheck

RE: preventing words from being indexed in spellcheck dictionary?

2012-03-27 Thread Dyer, James
Assuming you're just using this field for spellcheck and not for queries, then it doesn't matter. But the correct way to do it is to have it in both places. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: geeky2 [mailto:gee...@h

RE: dataImportHandler: delta query fetching data, not just ids?

2012-03-28 Thread Dyer, James
n't consider the "command=full-import&clean=false" method a workaround but just a different way to tackle the same problem. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: janne mattila [mailto:jannepostilis...@gmail.com]

RE: DataImportHandler: backups prior to full-import

2012-03-28 Thread Dyer, James
I don't know of any effort out there to have DIH trigger a backup automatically. However, you can set the replication handler to automatically backup after each commit. This might solve your problem if you aren't committing frequently. James Dyer E-Commerce Systems Ingram Content

RE: DataImportHandler: backups prior to full-import

2012-03-28 Thread Dyer, James
poll the replicationHandler with /replicaton?command=details and scrape the response until timestamp_here changes to a new timestamp. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Artem Shnayder [mailto:artem@gmail.com] Sent: Wedn

RE: dataImportHandler: delta query fetching data, not just ids?

2012-03-29 Thread Dyer, James
r/DataImportHandler#Special_Commands . James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: janne mattila [mailto:jannepostilis...@gmail.com] Sent: Thursday, March 29, 2012 12:45 AM To: solr-user@lucene.apache.org Subject: Re: dataImportHandler: delta query fetc

RE: Maximum Open Cursors using JdbcDataSource and cacheImpl

2012-04-18 Thread Dyer, James
Keith, Can you supply your data-config.xml ? James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Keith Naas [mailto:keithn...@dswinc.com] Sent: Wednesday, April 18, 2012 11:43 AM To: solr-user@lucene.apache.org Subject: Maximum Open Cursors using

RE: Performance problem with DIH in solr 3.3

2012-04-23 Thread Dyer, James
See this page for an alternate way to use DIH for Delta updates that does not generate n+1 Selects: http://wiki.apache.org/solr/DataImportHandlerDeltaQueryViaFullImport James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Pravin Agrawal

RE: JDBC import yields no data

2012-04-24 Thread Dyer, James
n your config) should say "indexing complete" and also the number of documents it imported. Also, if you have "commit=false" in your config, it won't issue a commit so you won't see the documents. If it fails, your servlet container's logs should have a stack trace

RE: JDBC import yields no data

2012-04-24 Thread Dyer, James
s its made, how many rows read & documents indexed. Keep refreshing the page until it is done. When it finishes, you should get either a Success or a Failure message. Is it saying success or failure? Also how many documents does it say it indexed? James Dyer E-Commerce Systems Ingram Cont

RE: Nested CachedSqlEntityProcessor running for each entity row with Solr 3.6?

2012-05-07 Thread Dyer, James
.jar Please let me know if this solves your problems with DIH Caching, giving you the functionality you had with 3.5 and prior. Your feedback is greatly appreciatd. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: not interesting [mailto:dye

RE: Nested CachedSqlEntityProcessor running for each entity row with Solr 3.6?

2012-05-08 Thread Dyer, James
Kellen, I appreciate your trying this out. Is there any way you can provide your data-config.xml file? I'd really like to get to the bottom of this. Thanks. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: not interesting [mailto:dy

RE: Indexing data from pdf

2012-05-11 Thread Dyer, James
It looks like maybe you do not have "apache-solr-dataimporthandler-extras.jar" in your classpath. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: anarchos78 [mailto:rigasathanasio...@hotmail.com] Sent: Friday, May 11, 2012 11:00 A

<    1   2   3   4   5   6   7   8   9   >