RE: What is proper way to re-init index?
The problem script, schema.xml file and exception are in the attached file. Posting file new/howtocare2.xml to http://localhost:8983/solr/update undefined field formorg.xmlpull.v1.XmlPullParserException: expected START_TAG or END_TAG not END_DOCUMENT (position: END_DOCUMENT seen ...\n\n \n... @15:1) at org.xmlpull.mxp1.MXParser.nextTag(MXParser.java:1083) at org.apache.solr.core.SolrCore.update(SolrCore.java:680) at org.apache.solr.servlet.SolrServlet.doPost(SolrServlet.java:81) at javax.servlet.http.HttpServlet.service(HttpServlet.java:767) at javax.servlet.http.HttpServlet.service(HttpServlet.java:860) at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:408) at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:350) at org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:195) at org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:164) at org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:536) at org.mortbay.jetty.Server.handle(Server.java:309) at org.mortbay.jetty.Server.handle(Server.java:285) at org.mortbay.jetty.HttpConnection.doHandler(HttpConnection.java:363) at org.mortbay.jetty.HttpConnection.access$1600(HttpConnection.java:45) at org.mortbay.jetty.HttpConnection$RequestHandler.content(HttpConnection.java:625) at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:613) at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:195) at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:297) at org.mortbay.jetty.nio.SelectChannelConnector$HttpEndPoint.run(SelectChannelConnector.java:680) at org.mortbay.thread.BoundedThreadPool$PoolThread.run(BoundedThreadPool.java:412) - book How To Care For Aging Parents The book that answers the questions you hoped you'd never ask. A one-stop resource for all themedical, financial, housing, emotional, and practical issues. Morris, Virginia Workman Publishing 2004 2006-03-23 11:48 book:99 ref text
Re: What is proper way to re-init index?
Hi John, The error message "undefined field form" means Solr doesn't know about the "form" field. Did you copy your schema.xml to the example/solrconf directory and restart the app server? I tried your schema and doc, and didn't get the error you did. I got an error further down due to and invalid date format (see schema.xml comments for date type for the correct format). -Yonik (the weird stack trace showing an XML parsing error is due to Solr trying to continue on to the next document even though the first had an error... and it looks like this part of the code could use a little work).
Re: To cluster, or not to cluster...
Jason, I'm not following why you suggesting having multiple masters (and updating multiple masters). I can see an advantage and lots of disadvantages. What is the problem you're suggesting this could address? --cw
Re: Multiple updates possible?
Further, I'd discourage anyone from using NFS for their read/write index location. I don't know about Windows either, but I'd think there could be similar non-atomic issues with windows fileshares. That's even in the case where you have one RW and the others are reads. --cw On 3/24/06, Chris Hostetter <[EMAIL PROTECTED]> wrote: > > > : In this case we are looking at having multiple tomcats to provide us > : with load balancing and failover. We are not looking at a master/slave > : index solution. We'll also be working on windows. > > I'm not very faimilar with windows, but if your goal is to have load > balanced servers for failover, then what is the advantage of running those > multiple servers on the same box (pointed at teh same index directory)? > ... if the box goes down, you're up a creek. > > what we do is have one master port that recieves all of the updates and > has a postCommit hook which makes snapshots. then we have many slave > ports (running on other machines) which pull the snapshots at regular > intervals, and are all accessible behind a load balancer. > > if one slave goes down -- no big deal, the load balancer stops using it. > > if the master goes down, the slaves happily keep serving queries, but new > updates can't be published untill we install a "master" configuration > (with thepostCommit hook) on one of the slaves, and change the DNS record > for the master to point at that slave -- at which point it because the > new master. > > I know the existing snapshotter/snappuller scripts in subversion don't > work on windows, but one of the items on the task list is to try and come > up with equivilent methods that can -- if you have any ideas on how that > can be achieved that would be great! > > > > > -Hoss > >
RE: What is proper way to re-init index?
So why wouldn't the "form" field be recognized? It's in the schema (immediately after "ref"). In the meantime I'll play around with date. The comments say it should be: "2006-03-27T12:36:00Z". Unless I hear otherwise I'll assume that's correct. -Original Message- From: Yonik Seeley [mailto:[EMAIL PROTECTED] Sent: Monday, March 27, 2006 10:17 AM To: solr-user@lucene.apache.org Subject: Re: What is proper way to re-init index? Hi John, The error message "undefined field form" means Solr doesn't know about the "form" field. Did you copy your schema.xml to the example/solrconf directory and restart the app server? I tried your schema and doc, and didn't get the error you did. I got an error further down due to and invalid date format (see schema.xml comments for date type for the correct format). -Yonik (the weird stack trace showing an XML parsing error is due to Solr trying to continue on to the next document even though the first had an error... and it looks like this part of the code could use a little work).
Re: What is proper way to re-init index?
On 3/27/06, John Mohr <[EMAIL PROTECTED]> wrote: > So why wouldn't the "form" field be recognized? Your schema is correct, I tried it myself and it worked. My best guess is that Solr isn't actually reading your schema.xml, but the example one. Are you sure solrconf/schema.xml is your schema, and that you restarted the server so it would be re-read? -Yonik > -Original Message- > From: Yonik Seeley [mailto:[EMAIL PROTECTED] > Sent: Monday, March 27, 2006 10:17 AM > To: solr-user@lucene.apache.org > Subject: Re: What is proper way to re-init index? > > > Hi John, > > The error message "undefined field form" means Solr doesn't know about > the "form" field. > Did you copy your schema.xml to the example/solrconf directory and > restart the app server? > > I tried your schema and doc, and didn't get the error you did. I got > an error further down due to and invalid date format (see schema.xml > comments for date type for the correct format). > > -Yonik > > (the weird stack trace showing an XML parsing error is due to Solr > trying to continue on to the next document even though the first had > an error... and it looks like this part of the code could use a little > work). >
Re: What is proper way to re-init index?
The easiest way to verify that you are using your schema is to go to the admin page and check out the schema name... it should say "Solr Admin (trial1)" at the top of the page. You can also click on the schema link to see the contents of the schema file and verify that it's the most recent and contains the form field. -Yonik On 3/27/06, Yonik Seeley <[EMAIL PROTECTED]> wrote: > On 3/27/06, John Mohr <[EMAIL PROTECTED]> wrote: > > So why wouldn't the "form" field be recognized? > > Your schema is correct, I tried it myself and it worked. > My best guess is that Solr isn't actually reading your schema.xml, but > the example one. > Are you sure solrconf/schema.xml is your schema, and that you > restarted the server so it would be re-read? > > -Yonik > > > > -Original Message- > > From: Yonik Seeley [mailto:[EMAIL PROTECTED] > > Sent: Monday, March 27, 2006 10:17 AM > > To: solr-user@lucene.apache.org > > Subject: Re: What is proper way to re-init index? > > > > > > Hi John, > > > > The error message "undefined field form" means Solr doesn't know about > > the "form" field. > > Did you copy your schema.xml to the example/solrconf directory and > > restart the app server? > > > > I tried your schema and doc, and didn't get the error you did. I got > > an error further down due to and invalid date format (see schema.xml > > comments for date type for the correct format). > > > > -Yonik > > > > (the weird stack trace showing an XML parsing error is due to Solr > > trying to continue on to the next document even though the first had > > an error... and it looks like this part of the code could use a little > > work). > >
RE: What is proper way to re-init index?
I was about to respond saying yes, I know it is correct, but it turns out that while I had changed the solrconf directory, I hadn't restarted Jetty that picks it up. After doing so, it worked fine. Thanks. -Original Message- From: Yonik Seeley [mailto:[EMAIL PROTECTED] Sent: Monday, March 27, 2006 12:42 PM To: solr-user@lucene.apache.org Subject: Re: What is proper way to re-init index? On 3/27/06, John Mohr <[EMAIL PROTECTED]> wrote: > So why wouldn't the "form" field be recognized? Your schema is correct, I tried it myself and it worked. My best guess is that Solr isn't actually reading your schema.xml, but the example one. Are you sure solrconf/schema.xml is your schema, and that you restarted the server so it would be re-read? -Yonik > -Original Message- > From: Yonik Seeley [mailto:[EMAIL PROTECTED] > Sent: Monday, March 27, 2006 10:17 AM > To: solr-user@lucene.apache.org > Subject: Re: What is proper way to re-init index? > > > Hi John, > > The error message "undefined field form" means Solr doesn't know about > the "form" field. > Did you copy your schema.xml to the example/solrconf directory and > restart the app server? > > I tried your schema and doc, and didn't get the error you did. I got > an error further down due to and invalid date format (see schema.xml > comments for date type for the correct format). > > -Yonik > > (the weird stack trace showing an XML parsing error is due to Solr > trying to continue on to the next document even though the first had > an error... and it looks like this part of the code could use a little > work). >
Re: To cluster, or not to cluster...
Hi Clay, The question was: "Hi Jason, Would that not mean if one of the master indexes went down then a subset of data would be offline? Rob." I think if you wanted to run redundant masters (for some reason unknown at this time) this might be one solution. Possibly if you had multiple data centers. - Original Message From: Clay Webster <[EMAIL PROTECTED]> To: solr-user@lucene.apache.org; jason rutherglen <[EMAIL PROTECTED]> Sent: Monday, March 27, 2006 8:23:30 AM Subject: Re: To cluster, or not to cluster... Jason, I'm not following why you suggesting having multiple masters (and updating multiple masters). I can see an advantage and lots of disadvantages. What is the problem you're suggesting this could address? --cw
Rsync
I was thinking, would it not be possible to avoid using rsync and record a list of all new segment files added (from within Lucene), and simply use HTTP to sync down the newest ones? Perhaps only using rsync after an optimize? Seems like if I understand Lucene correctly only new files are created?