Re: Unknown query parser 'terms' with TermsComponent defined

2015-08-25 Thread P Williams
Thanks Hoss! It's obvious what the problem(s) are when you lay it all out that way. On Tue, Aug 25, 2015 at 12:14 PM, Chris Hostetter wrote: > > 1) The "terms" Query Parser (TermsQParser) has nothing to do with the > "TermsComponent" (the first is for quering many distinct terms, the > later is

Unknown query parser 'terms' with TermsComponent defined

2015-08-25 Thread P Williams
Hi, We've encountered a strange situation, I'm hoping someone might be able to shed some light. We're using Solr 4.9 deployed in Tomcat 7. We build a query that has these params: 'params'=>{ 'fl'=>'id', 'sort'=>'system_create_dtsi asc', 'indent'=>'true', 'start'=>'0',

Re: Advice on highlighting

2014-09-12 Thread P Williams
Hi Craig, Have you seen SOLR-4722 (https://issues.apache.org/jira/browse/SOLR-4722)? This was my attempt at something similar. Regards, Tricia On Fri, Sep 12, 2014 at 2:23 PM, Craig Longman wrote: > In order to take our Solr usage to the next step, we really need to > improve its highlighting

How to sync lib directory in SolrCloud?

2014-07-31 Thread P Williams
Hi, I have an existing collection that I'm trying to add to a new SolrCloud. This collection has all the normal files in conf but also has a lib directory to support the filters schema.xml uses. wget https://github.com/projectblacklight/blacklight-jetty/archive/v4.9.0.zip unzip v4.9.0.zip I add

Re: Changing Cache Properties after Indexing

2014-01-17 Thread P Williams
You're both completely right. There isn't any issue with indexing with large cache settings. I ran the same indexing job five times, twice with large cache and twice with the default values. I threw out the first job because no matter if it's cached or uncached it runs ~2x slower. This must have

Changing Cache Properties after Indexing

2014-01-13 Thread P Williams
Hi, I've gone through steps for tuning my cache sizes and I'm very happy with the results of load testing. Unfortunately the cache settings for querying are not optimal for indexing - and in fact slow it down quite a bit. I've made the caches small by default for the indexing stage and then want

Re: DataImport Handler, writing a new EntityProcessor

2013-12-18 Thread P Williams
Hi Mathias, I'd recommend testing one thing at a time. See if you can get it to work for one image before you try a directory of images. Also try testing using the solr-testframework using your ide (I use Eclipse) to debug rather than your browser/print statements. Hopefully that will give you

Re: Using data-config.xml from DIH in SolrJ

2013-11-14 Thread P Williams
nd > uses it through SolrJ. You can use Tika directly in SolrJ if you > need to parse structured documents though, see: > http://searchhub.org/2012/02/14/indexing-with-solrj/ > > Yep, you're going to be kind of reinventing the wheel a bit I'm > afraid. > > Best,

Using data-config.xml from DIH in SolrJ

2013-11-13 Thread P Williams
Hi All, I'm building a utility (Java jar) to create SolrInputDocuments and send them to a HttpSolrServer using the SolrJ API. The intention is to find an efficient way to create documents from a large directory of files (where multiple files make one Solr document) and be sent to a remote Solr in

Re: XPathEntityProcessor nested in TikaEntityProcessor query null exception

2013-09-30 Thread P Williams
tion$HttpInputStream cannot be cast > to java.io.Reader > > > > > > On 28. Sep 2013, at 1:39 AM, Andreas Owen wrote: > > > ok i see what your getting at but why doesn't the following work: > > > > > > > > > > i removed th

Re: XPathEntityProcessor nested in TikaEntityProcessor query null exception

2013-09-27 Thread P Williams
ndomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36) at com.carrotsearch.randomizedtesting.ThreadLeakControl$StatementRunner.run(ThreadLeakControl.java:358) at java.lang.Thread.run(Thread.java:722) On Fri, Sep 27, 2013 at 3:55 AM, Andreas Owen wrote: > i removed

Re: XPathEntityProcessor nested in TikaEntityProcessor query null exception

2013-09-26 Thread P Williams
Hi, Haven't tried this myself but maybe try leaving out the FieldReaderDataSource entirely. From my quick searching looks like it's tied to SQL. Did you try copying the http://wiki.apache.org/solr/TikaEntityProcessor Advanced Parsing example exactly? What happens when you leave out FieldReaderD

Re: DIH field defaults or re-assigning field values

2013-09-24 Thread P Williams
called if there are rows to transform and I needed to know when the default was appropriate to set in the root entity. Thanks for your suggestions Alex. Cheers, Tricia On Wed, Sep 18, 2013 at 1:19 PM, P Williams wrote: > Hi All, > > I'm using the DataImportHandler to import do

DIH field defaults or re-assigning field values

2013-09-18 Thread P Williams
Hi All, I'm using the DataImportHandler to import documents to my index. I assign one of my document's fields by using a sub-entity from the root to look for a value in a file. I've got this part working. If the value isn't in the file or the file doesn't exist I'd like the field to be assigned

Re: How to Manage RAM Usage at Heavy Indexing

2013-09-09 Thread P Williams
Hi, I've been seeing the same thing on CentOS with high physical memory use with low JVM-Memory use. I came to the conclusion that this was expected behaviour. Using top I noticed that my solr user's java process has Virtual memory allocated of about twice the size of the index, actual is within

Re: Total Term Frequency per ResultSet in Solr 4.3 ?

2013-07-04 Thread P Williams
Hi Tony, Have you seen the TermVectorComponent? It will return the TermVectors for the documents in your result set (note that the rows parameter matters if you want results for the whole set, the default is 10). TermVectors also must be stored fo

Re: SolrEntityProcessor doesn't grok responseHeader tag in Ancient Solr 1.2 source

2013-04-23 Thread P Williams
t; Erik > > > On Apr 23, 2013, at 13:41 , P Williams wrote: > > > Hi, > > > > I'd like to use the SolrEntityProcessor to partially migrate an old index > > to Solr 4.1. The source is pretty old (dated 2006-06-10 16:05:12Z)... > > mayb

SolrEntityProcessor doesn't grok responseHeader tag in Ancient Solr 1.2 source

2013-04-23 Thread P Williams
Hi, I'd like to use the SolrEntityProcessor to partially migrate an old index to Solr 4.1. The source is pretty old (dated 2006-06-10 16:05:12Z)... maybe Solr 1.2? My data-config.xml is based on the SolrEntityProcessor example a

Re: How do I recover the position and offset a highlight for solr (4.1/4.2)?

2013-04-16 Thread P Williams
Hi, It doesn't have the offset information, but checkout my patch https://issues.apache.org/jira/browse/SOLR-4722 which outputs the position of each term that's been matched. I'm eager to get some feedback on this approach and any improvements that might be suggested. Cheers, Tricia On Wed, Ma

Re: Results Order When Performing Wildcard Query

2013-04-09 Thread P Williams
wrote: > On 4/9/2013 12:08 PM, P Williams wrote: > >> I wrote a test of my application which revealed a Solr oddity (I think). >> The test which I wrote on Windows 7 and makes use of the >> solr-test-framework<http://**lucene.apache.org/solr/4_1_0/** >> solr-te

Results Order When Performing Wildcard Query

2013-04-09 Thread P Williams
Hi, I wrote a test of my application which revealed a Solr oddity (I think). The test which I wrote on Windows 7 and makes use of the solr-test-framework fails under Ubuntu 12.04 because the Solr results I expected for a wildcard

Re: Highlighting data stored outside of Solr

2012-12-17 Thread P Williams
Your problem seems really similar to "It should be possible to highlight external text" in JIRA. Tricia [https://issues.apache.org/jira/browse/SOLR-1397] On Tue, Dec 11, 2012 at 12:48 PM, Michael Ryan wrote: > Has anyone ever attempted to highli

Re: How does Solr know which relative paths to use?

2012-10-16 Thread P Williams
Hi Dotan, It seems that the examples now use Multiple Coresby default. If your test server is based on the stock example, you should see a solr.xml file in your CWD path which is how Solr knows about the relative paths. There should also be a README.txt fil

Re: Using

2012-10-16 Thread P Williams
ricia On Mon, Oct 15, 2012 at 11:19 AM, P Williams wrote: > Hi, > > Thanks for the suggestions. Didn't work for me :( > > I'm calling > conf="test->default"/> > > which depends on org.eclipse.jetty:jetty-server > which depends on org.eclip

Re: Using

2012-10-15 Thread P Williams
Apologies, there was a typo in my last message. org.eclipse.jetty.orbit:jettty-servlet should have been org.eclipse.jetty.orbit:javax.servlet On Mon, Oct 15, 2012 at 11:19 AM, P Williams wrote: > Hi, > > Thanks for the suggestions. Didn't work for me :( > > I

Re: Using

2012-10-15 Thread P Williams
.org/maven2/org/eclipse/jetty/orbit/javax.servlet/3.0.0.v201112011016/javax.servlet-3.0.0.v201112011016.jar hence my troubles. I'm an IVY newbie so maybe there is something I'm missing here? Is there another 'conf' value other than 'default' I can use? Thanks,

Re: Solr - Tika(?) memory leak

2012-01-16 Thread P Williams
Hi, I'm not sure which version of Solr/Tika you're using but I had a similar experience which turned out to be the result of a design change to PDFBox. https://issues.apache.org/jira/browse/SOLR-2886 Tricia On Sat, Jan 14, 2012 at 12:53 AM, Wayne W wrote: > Hi, > > we're using Solr running on

Re: avoid overwrite in DataImportHandler

2011-12-08 Thread P Williams
t's related to your problem at all. > > Best > Erick > > On Wed, Dec 7, 2011 at 3:27 PM, P Williams > wrote: > > Hi, > > > > I've wondered the same thing myself. I feel like the "clean" parameter > has > > something to do with it

Re: avoid overwrite in DataImportHandler

2011-12-07 Thread P Williams
Hi, I've wondered the same thing myself. I feel like the "clean" parameter has something to do with it but it doesn't work as I'd expect either. Thanks in advance to anyone who can answer this question. *clean* : (default 'true'). Tells whether to clean up the index before the indexing is start

Re: Stream still in memory after tika exception? Possible memoryleak?

2011-11-03 Thread P Williams
Hi All, I'm experiencing a similar problem to the other's in the thread. I've recently upgraded from apache-solr-4.0-2011-06-14_08-33-23.war to apache-solr-4.0-2011-10-14_08-56-59.war and then apache-solr-4.0-2011-10-30_09-00-00.war to index ~5300 pdfs, of various sizes, using the TikaEntityProce

Re: DIH doesn't handle bound namespaces?

2011-11-03 Thread P Williams
Hi Gary, From http://wiki.apache.org/solr/DataImportHandler#Usage_with_XML.2BAC8-HTTP_Datasource *It does not support namespaces , but it can handle xmls with namespaces . When you provide the xpath, just drop the namespace and give the rest (eg if the tag is '' the mapping should just contain 's

JSON and DataImportHandler

2010-07-16 Thread P Williams
Hi All, Has anyone gotten the DataImportHandler to work with json as input? Is there an even easier alternative to DIH? Could you show me an example? Many thanks, Tricia