TWIMC:
I creating a custom UpdateRequestProcessor chain, where I need to commit
records to a database once the import process has completed.
I'm assuming the processCommit method is called for each
UpdateRequestProcessor chain class when the records are being commited to
the Lucene index.
I'm de
I would try something like the following:
I just started trying Apache ManifoldCF, which has a SharePoint connector
that appears to integrate through Sharepoint's web services.
Nutch also has a SharePoint connector, and it can publish documents into
SOLR for indexing.
On Wed, Jan 18, 2012 at 3:34 PM, ahammad wrote:
> That would certain
Doesn't tomcat run on port 8080, and not port 8983? Or did you change the
tomcat's default port to 8983?
On Feb 5, 2012 5:17 AM, "alessio crisantemi"
wrote:
> Hi All,
> I have some problems with integration of Nutch in Solr and Tomcat.
>
> I follo Nutch tutorial for integration and now, I can cra
8983.
> ..
>
> 2012/2/5 Matthew Parker
>
> > Doesn't tomcat run on port 8080, and not port 8983? Or did you change the
> > tomcat's default port to 8983?
> > On Feb 5, 2012 5:17 AM, "alessio crisantemi" <
> alessio.crisant...@gmail.com
> >
I tried to import some documents into SOLR Cloud using Apache Manifold.
TIKA started throwing exceptions for various documents
The exception reads like the following:
org.apache.solr.common.SolrException
at org.apache.solr.handler.extraction.ExtractionDocumentLoader.load(
ExtractingDocumentLoade
TWIMC:
Environment
=
Apache SOLR rev-1236154
Apache Zookeeper 3.3.4
Windows 7
JDK 1.6.0_23.b05
I have built a SOLR Cloud instance with 4 nodes using the embeded Jetty
servers.
I created a 3 node zookeeper ensemble to manage the solr configuration data.
All the instances run on one serve
t;
> Are you using a custom index chain or anything out of the ordinary?
>
>
> - Mark
>
> On Feb 27, 2012, at 12:26 PM, Matthew Parker wrote:
>
> > TWIMC:
> >
> > Environment
> > =
> > Apache SOLR rev-1236154
> > Apache Zookeeper 3.3.4
2012, at 2:22 PM, Matthew Parker wrote:
>
> > Thanks for your reply Mark.
> >
> > I believe the build was towards the begining of the month. The
> > solr.spec.version is 4.0.0.2012.01.10.38.09
> >
> > I cannot access the clusterstate.json contents. I clicked on i
;MACHINE1:8081_solr","base_url":"
http://MACHINE1:8081/solr"}";
SHARD2 (ephemeral
v=0)"{"core":"","node_name":"MACHINE1:8082_solr","base_url":"
http://MACHINE1:8082/solr"}";
/OVERSEER_ELE
an individual node with distrib=false to check that.
>
> If you build is a month old, I'd honestly recommend you try upgrading as
> well.
>
> - Mark
>
> On Feb 27, 2012, at 3:34 PM, Matthew Parker wrote:
>
> > Here is most of the cluster state:
> >
> &
seems the windows from 8081 and 8083 move
responding to the query but there are no results.
if the queries run on 8081/8082 or 8081/8084 then results come back ok.
The query is nothing more than: q=*:*
Regards,
Matt
On Mon, Feb 27, 2012 at 9:26 PM, Matthew Parker <
mpar...@apogeeintegration.
something we are missing here...
>
> Any info you can offer might help.
>
> - Mark
>
> On Feb 28, 2012, at 1:00 PM, Matthew Parker wrote:
>
> > Mark,
> >
> > I got the codebase from the 2/26/2012, and I got the same inconsistent
> > results.
> >
>
I tried running SOLR Cloud with the default number of shards (i.e. 1), and
I get the same results.
On Wed, Feb 29, 2012 at 10:46 AM, Matthew Parker <
mpar...@apogeeintegration.com> wrote:
> Mark,
>
> Nothing appears to be wrong in the logs. I wiped the indexes and imported
I also took out my requestHandler and used the standard /update/extract
handler. Same result.
On Wed, Feb 29, 2012 at 11:47 AM, Matthew Parker <
mpar...@apogeeintegration.com> wrote:
> I tried running SOLR Cloud with the default number of shards (i.e. 1), and
> I get the same results
Wed, Feb 29, 2012 at 12:25 PM, Sami Siren wrote:
> On Wed, Feb 29, 2012 at 7:03 PM, Matthew Parker
> wrote:
> > I also took out my requestHandler and used the standard /update/extract
> > handler. Same result.
>
> How did you install/start the system this time? The same
e instance with 2 shards
works after waiting for the system to sync.
Regards,
Matt
On Wed, Feb 29, 2012 at 12:03 PM, Matthew Parker <
mpar...@apogeeintegration.com> wrote:
> I also took out my requestHandler and used the standard /update/extract
> handler. Same result.
>
>
rsion_ field, there is something to track down here for
> sure.
>
> On Wed, Feb 29, 2012 at 1:15 PM, Matthew Parker <
> mpar...@apogeeintegration.com> wrote:
>
> > Mark/Sami
> >
> > I ran the system with 3 zookeeper nodes, 2 solr cloud nodes, and left
> &
tions you are making to solrconfig?
>
> On Mar 1, 2012, at 1:48 PM, Matthew Parker wrote:
>
> > Added it back in. I still get the same result.
> >
> > On Wed, Feb 29, 2012 at 10:09 PM, Mark Miller
> wrote:
> > Do you have a _version_ field in your schema? I actua
t 3:35 PM, Mark Miller wrote:
>
> > Any other customizations you are making to solrconfig?
> >
> > On Mar 1, 2012, at 1:48 PM, Matthew Parker wrote:
> >
> >> Added it back in. I still get the same result.
> >>
> >> On Wed, Feb 29, 2012 at 10:09
I've ensured the SOLR data subdirectories and files were completed cleaned
out, but the issue still occurs.
On Fri, Mar 2, 2012 at 9:06 AM, Erick Erickson wrote:
> Matt:
>
> Just for paranoia's sake, when I was playing around with this (the
> _version_ thing was one of my problems too) I removed
You can embed custom Java functions in XSLT:
http://cafeconleche.org/books/xmljava/chapters/ch17s03.html
On Mon, Mar 5, 2012 at 4:27 AM, darul wrote:
> Hello,
>
> Using native XSLT Response Writer, we may need to alter content before
> processing xml solr output as a RSS Feed.
>
> Example (tri
All,
I recreated the cluster on my machine at home (Windows 7, Java 1.6.0.23,
apache-solr-4.0-2012-02-29_09-07-30) , sent some document through Manifold
using its crawler, and it looks like it's replicating fine once the
documents are committed.
This must be related to my environment somehow. Tha
I'm still having issues replicating in my work environment. Can anyone
explain how the replication mechanism works? Is it communicating across
ports or through zookeeper to manager the process?
On Thu, Mar 8, 2012 at 10:57 PM, Matthew Parker <
mpar...@apogeeintegration.com> wr
ost attribute if I want to
> setup a cluster between my macbook and linux box.
>
> Each nodes talks to ZooKeeper to learn about the other nodes, including
> their addresses. Recovery is then done node to node using the appropriate
> addresses.
>
>
> - Mark Miller
> luc
The cluster is running on one machine.
On Sun, Mar 18, 2012 at 2:07 PM, Mark Miller wrote:
> From every node in your cluster you can hit http://MACHINE1:8084/solr in
> your browser and get a response?
>
> On Mar 18, 2012, at 1:46 PM, Matthew Parker wrote:
>
> > My cloud i
Sun, Mar 18, 2012 at 2:07 PM, Mark Miller wrote:
> From every node in your cluster you can hit http://MACHINE1:8084/solr in
> your browser and get a response?
>
> On Mar 18, 2012, at 1:46 PM, Matthew Parker wrote:
>
> > My cloud instance finally tried to sync. It looks like it&
I had tried importing data from Manifold, and one document threw a Tika
Exception.
If I shut everything down and restart SOLR cloud, the system sync'd on
startup.
Could extraction errors be the issue?
On Sun, Mar 18, 2012 at 2:50 PM, Matthew Parker <
mpar...@apogeeintegration.com>
That idea was short lived. I excluded the document. The cluster isn't
syncing even after shutting everything down and restarting.
On Sun, Mar 18, 2012 at 2:58 PM, Matthew Parker <
mpar...@apogeeintegration.com> wrote:
> I had tried importing data from Manifold, and one documen
29 matches
Mail list logo