UpdateRequestProcessor - processCommit

2011-11-23 Thread Matthew Parker
TWIMC: I creating a custom UpdateRequestProcessor chain, where I need to commit records to a database once the import process has completed. I'm assuming the processCommit method is called for each UpdateRequestProcessor chain class when the records are being commited to the Lucene index. I'm de

Re: PlainTextEntityProcessor and RegexTransformer in DataImport Handler

2011-12-24 Thread Matthew Parker
I would try something like the following:

Re: How can I index this?

2012-01-18 Thread Matthew Parker
I just started trying Apache ManifoldCF, which has a SharePoint connector that appears to integrate through Sharepoint's web services. Nutch also has a SharePoint connector, and it can publish documents into SOLR for indexing. On Wed, Jan 18, 2012 at 3:34 PM, ahammad wrote: > That would certain

Re: nutch in solr

2012-02-05 Thread Matthew Parker
Doesn't tomcat run on port 8080, and not port 8983? Or did you change the tomcat's default port to 8983? On Feb 5, 2012 5:17 AM, "alessio crisantemi" wrote: > Hi All, > I have some problems with integration of Nutch in Solr and Tomcat. > > I follo Nutch tutorial for integration and now, I can cra

Re: nutch in solr

2012-02-05 Thread Matthew Parker
8983. > .. > > 2012/2/5 Matthew Parker > > > Doesn't tomcat run on port 8080, and not port 8983? Or did you change the > > tomcat's default port to 8983? > > On Feb 5, 2012 5:17 AM, "alessio crisantemi" < > alessio.crisant...@gmail.com > >

TIKA Errors Importing MS Word Documents into SOLR Cloud

2012-02-26 Thread Matthew Parker
I tried to import some documents into SOLR Cloud using Apache Manifold. TIKA started throwing exceptions for various documents The exception reads like the following: org.apache.solr.common.SolrException at org.apache.solr.handler.extraction.ExtractionDocumentLoader.load( ExtractingDocumentLoade

Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-27 Thread Matthew Parker
TWIMC: Environment = Apache SOLR rev-1236154 Apache Zookeeper 3.3.4 Windows 7 JDK 1.6.0_23.b05 I have built a SOLR Cloud instance with 4 nodes using the embeded Jetty servers. I created a 3 node zookeeper ensemble to manage the solr configuration data. All the instances run on one serve

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-27 Thread Matthew Parker
t; > Are you using a custom index chain or anything out of the ordinary? > > > - Mark > > On Feb 27, 2012, at 12:26 PM, Matthew Parker wrote: > > > TWIMC: > > > > Environment > > = > > Apache SOLR rev-1236154 > > Apache Zookeeper 3.3.4

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-27 Thread Matthew Parker
2012, at 2:22 PM, Matthew Parker wrote: > > > Thanks for your reply Mark. > > > > I believe the build was towards the begining of the month. The > > solr.spec.version is 4.0.0.2012.01.10.38.09 > > > > I cannot access the clusterstate.json contents. I clicked on i

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-27 Thread Matthew Parker
;MACHINE1:8081_solr","base_url":" http://MACHINE1:8081/solr"}"; SHARD2 (ephemeral v=0)"{"core":"","node_name":"MACHINE1:8082_solr","base_url":" http://MACHINE1:8082/solr"}"; /OVERSEER_ELE

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-27 Thread Matthew Parker
an individual node with distrib=false to check that. > > If you build is a month old, I'd honestly recommend you try upgrading as > well. > > - Mark > > On Feb 27, 2012, at 3:34 PM, Matthew Parker wrote: > > > Here is most of the cluster state: > > > &

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-28 Thread Matthew Parker
seems the windows from 8081 and 8083 move responding to the query but there are no results. if the queries run on 8081/8082 or 8081/8084 then results come back ok. The query is nothing more than: q=*:* Regards, Matt On Mon, Feb 27, 2012 at 9:26 PM, Matthew Parker < mpar...@apogeeintegration.

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-29 Thread Matthew Parker
something we are missing here... > > Any info you can offer might help. > > - Mark > > On Feb 28, 2012, at 1:00 PM, Matthew Parker wrote: > > > Mark, > > > > I got the codebase from the 2/26/2012, and I got the same inconsistent > > results. > > >

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-29 Thread Matthew Parker
I tried running SOLR Cloud with the default number of shards (i.e. 1), and I get the same results. On Wed, Feb 29, 2012 at 10:46 AM, Matthew Parker < mpar...@apogeeintegration.com> wrote: > Mark, > > Nothing appears to be wrong in the logs. I wiped the indexes and imported

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-29 Thread Matthew Parker
I also took out my requestHandler and used the standard /update/extract handler. Same result. On Wed, Feb 29, 2012 at 11:47 AM, Matthew Parker < mpar...@apogeeintegration.com> wrote: > I tried running SOLR Cloud with the default number of shards (i.e. 1), and > I get the same results

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-29 Thread Matthew Parker
Wed, Feb 29, 2012 at 12:25 PM, Sami Siren wrote: > On Wed, Feb 29, 2012 at 7:03 PM, Matthew Parker > wrote: > > I also took out my requestHandler and used the standard /update/extract > > handler. Same result. > > How did you install/start the system this time? The same

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-29 Thread Matthew Parker
e instance with 2 shards works after waiting for the system to sync. Regards, Matt On Wed, Feb 29, 2012 at 12:03 PM, Matthew Parker < mpar...@apogeeintegration.com> wrote: > I also took out my requestHandler and used the standard /update/extract > handler. Same result. > >

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-01 Thread Matthew Parker
rsion_ field, there is something to track down here for > sure. > > On Wed, Feb 29, 2012 at 1:15 PM, Matthew Parker < > mpar...@apogeeintegration.com> wrote: > > > Mark/Sami > > > > I ran the system with 3 zookeeper nodes, 2 solr cloud nodes, and left > &

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-01 Thread Matthew Parker
tions you are making to solrconfig? > > On Mar 1, 2012, at 1:48 PM, Matthew Parker wrote: > > > Added it back in. I still get the same result. > > > > On Wed, Feb 29, 2012 at 10:09 PM, Mark Miller > wrote: > > Do you have a _version_ field in your schema? I actua

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-01 Thread Matthew Parker
t 3:35 PM, Mark Miller wrote: > > > Any other customizations you are making to solrconfig? > > > > On Mar 1, 2012, at 1:48 PM, Matthew Parker wrote: > > > >> Added it back in. I still get the same result. > >> > >> On Wed, Feb 29, 2012 at 10:09

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-02 Thread Matthew Parker
I've ensured the SOLR data subdirectories and files were completed cleaned out, but the issue still occurs. On Fri, Mar 2, 2012 at 9:06 AM, Erick Erickson wrote: > Matt: > > Just for paranoia's sake, when I was playing around with this (the > _version_ thing was one of my problems too) I removed

Re: XSLT Response Writer and content transformation

2012-03-05 Thread Matthew Parker
You can embed custom Java functions in XSLT: http://cafeconleche.org/books/xmljava/chapters/ch17s03.html On Mon, Mar 5, 2012 at 4:27 AM, darul wrote: > Hello, > > Using native XSLT Response Writer, we may need to alter content before > processing xml solr output as a RSS Feed. > > Example (tri

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-08 Thread Matthew Parker
All, I recreated the cluster on my machine at home (Windows 7, Java 1.6.0.23, apache-solr-4.0-2012-02-29_09-07-30) , sent some document through Manifold using its crawler, and it looks like it's replicating fine once the documents are committed. This must be related to my environment somehow. Tha

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-16 Thread Matthew Parker
I'm still having issues replicating in my work environment. Can anyone explain how the replication mechanism works? Is it communicating across ports or through zookeeper to manager the process? On Thu, Mar 8, 2012 at 10:57 PM, Matthew Parker < mpar...@apogeeintegration.com> wr

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-18 Thread Matthew Parker
ost attribute if I want to > setup a cluster between my macbook and linux box. > > Each nodes talks to ZooKeeper to learn about the other nodes, including > their addresses. Recovery is then done node to node using the appropriate > addresses. > > > - Mark Miller > luc

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-18 Thread Matthew Parker
The cluster is running on one machine. On Sun, Mar 18, 2012 at 2:07 PM, Mark Miller wrote: > From every node in your cluster you can hit http://MACHINE1:8084/solr in > your browser and get a response? > > On Mar 18, 2012, at 1:46 PM, Matthew Parker wrote: > > > My cloud i

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-18 Thread Matthew Parker
Sun, Mar 18, 2012 at 2:07 PM, Mark Miller wrote: > From every node in your cluster you can hit http://MACHINE1:8084/solr in > your browser and get a response? > > On Mar 18, 2012, at 1:46 PM, Matthew Parker wrote: > > > My cloud instance finally tried to sync. It looks like it&

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-18 Thread Matthew Parker
I had tried importing data from Manifold, and one document threw a Tika Exception. If I shut everything down and restart SOLR cloud, the system sync'd on startup. Could extraction errors be the issue? On Sun, Mar 18, 2012 at 2:50 PM, Matthew Parker < mpar...@apogeeintegration.com>

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-18 Thread Matthew Parker
That idea was short lived. I excluded the document. The cluster isn't syncing even after shutting everything down and restarting. On Sun, Mar 18, 2012 at 2:58 PM, Matthew Parker < mpar...@apogeeintegration.com> wrote: > I had tried importing data from Manifold, and one documen