Hi, I just saw there is a schema-solr4.xml and a schema.xml in the nutch conf directory. But with both schemas I get the same errors when starting up solr. Heres the stacktrace:
Nov 8, 2012 3:32:14 PM org.apache.solr.core.SolrConfig <init> INFO: Loaded SolrConfig: solrconfig.xml Nov 8, 2012 3:32:14 PM org.apache.solr.schema.IndexSchema readSchema INFO: Reading Solr Schema Nov 8, 2012 3:32:14 PM org.apache.solr.schema.IndexSchema readSchema INFO: Schema name=nutch Nov 8, 2012 3:32:14 PM org.apache.solr.core.CoreContainer create SEVERE: Unable to create core: collection1 org.apache.solr.common.SolrException: Schema Parsing Failed: multiple points at org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:571) at org.apache.solr.schema.IndexSchema.<init>(IndexSchema.java:113) at org.apache.solr.core.CoreContainer.create(CoreContainer.java:846) at org.apache.solr.core.CoreContainer.load(CoreContainer.java:534) at org.apache.solr.core.CoreContainer.load(CoreContainer.java:356) at org.apache.solr.core.CoreContainer$Initializer.initialize(CoreContainer.java:308) at org.apache.solr.servlet.SolrDispatchFilter.init(SolrDispatchFilter.java:107) at org.eclipse.jetty.servlet.FilterHolder.doStart(FilterHolder.java:114) at org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) at org.eclipse.jetty.servlet.ServletHandler.initialize(ServletHandler.java:754) at org.eclipse.jetty.servlet.ServletContextHandler.startContext(ServletContextHandler.java:258) at org.eclipse.jetty.webapp.WebAppContext.startContext(WebAppContext.java:1221) at org.eclipse.jetty.server.handler.ContextHandler.doStart(ContextHandler.java:699) at org.eclipse.jetty.webapp.WebAppContext.doStart(WebAppContext.java:454) at org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) at org.eclipse.jetty.deploy.bindings.StandardStarter.processBinding(StandardStarter.java:36) at org.eclipse.jetty.deploy.AppLifeCycle.runBindings(AppLifeCycle.java:183) at org.eclipse.jetty.deploy.DeploymentManager.requestAppGoal(DeploymentManager.java:491) at org.eclipse.jetty.deploy.DeploymentManager.addApp(DeploymentManager.java:138) at org.eclipse.jetty.deploy.providers.ScanningAppProvider.fileAdded(ScanningAppProvider.java:142) at org.eclipse.jetty.deploy.providers.ScanningAppProvider$1.fileAdded(ScanningAppProvider.java:53) at org.eclipse.jetty.util.Scanner.reportAddition(Scanner.java:604) at org.eclipse.jetty.util.Scanner.reportDifferences(Scanner.java:535) at org.eclipse.jetty.util.Scanner.scan(Scanner.java:398) at org.eclipse.jetty.util.Scanner.doStart(Scanner.java:332) at org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) at org.eclipse.jetty.deploy.providers.ScanningAppProvider.doStart(ScanningAppProvider.java:118) at org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) at org.eclipse.jetty.deploy.DeploymentManager.startAppProvider(DeploymentManager.java:552) at org.eclipse.jetty.deploy.DeploymentManager.doStart(DeploymentManager.java:227) at org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) at org.eclipse.jetty.util.component.AggregateLifeCycle.doStart(AggregateLifeCycle.java:63) at org.eclipse.jetty.server.handler.AbstractHandler.doStart(AbstractHandler.java:53) at org.eclipse.jetty.server.handler.HandlerWrapper.doStart(HandlerWrapper.java:91) at org.eclipse.jetty.server.Server.doStart(Server.java:263) at org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) at org.eclipse.jetty.xml.XmlConfiguration$1.run(XmlConfiguration.java:1215) at java.security.AccessController.doPrivileged(Native Method) at org.eclipse.jetty.xml.XmlConfiguration.main(XmlConfiguration.java:1138) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.eclipse.jetty.start.Main.invokeMain(Main.java:457) at org.eclipse.jetty.start.Main.start(Main.java:602) at org.eclipse.jetty.start.Main.main(Main.java:82) Caused by: java.lang.NumberFormatException: multiple points at sun.misc.FloatingDecimal.readJavaFormatString(FloatingDecimal.java:1082) at java.lang.Float.parseFloat(Float.java:422) at org.apache.solr.core.Config.getFloat(Config.java:284) at org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:358) ... 45 more Nov 8, 2012 3:32:14 PM org.apache.solr.common.SolrException log SEVERE: null:org.apache.solr.common.SolrException: Schema Parsing Failed: multiple points at org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:571) at org.apache.solr.schema.IndexSchema.<init>(IndexSchema.java:113) at org.apache.solr.core.CoreContainer.create(CoreContainer.java:846) at org.apache.solr.core.CoreContainer.load(CoreContainer.java:534) at org.apache.solr.core.CoreContainer.load(CoreContainer.java:356) at org.apache.solr.core.CoreContainer$Initializer.initialize(CoreContainer.java:308) at org.apache.solr.servlet.SolrDispatchFilter.init(SolrDispatchFilter.java:107) at org.eclipse.jetty.servlet.FilterHolder.doStart(FilterHolder.java:114) at org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) at org.eclipse.jetty.servlet.ServletHandler.initialize(ServletHandler.java:754) at org.eclipse.jetty.servlet.ServletContextHandler.startContext(ServletContextHandler.java:258) at org.eclipse.jetty.webapp.WebAppContext.startContext(WebAppContext.java:1221) at org.eclipse.jetty.server.handler.ContextHandler.doStart(ContextHandler.java:699) at org.eclipse.jetty.webapp.WebAppContext.doStart(WebAppContext.java:454) at org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) at org.eclipse.jetty.deploy.bindings.StandardStarter.processBinding(StandardStarter.java:36) at org.eclipse.jetty.deploy.AppLifeCycle.runBindings(AppLifeCycle.java:183) at org.eclipse.jetty.deploy.DeploymentManager.requestAppGoal(DeploymentManager.java:491) at org.eclipse.jetty.deploy.DeploymentManager.addApp(DeploymentManager.java:138) at org.eclipse.jetty.deploy.providers.ScanningAppProvider.fileAdded(ScanningAppProvider.java:142) at org.eclipse.jetty.deploy.providers.ScanningAppProvider$1.fileAdded(ScanningAppProvider.java:53) at org.eclipse.jetty.util.Scanner.reportAddition(Scanner.java:604) at org.eclipse.jetty.util.Scanner.reportDifferences(Scanner.java:535) at org.eclipse.jetty.util.Scanner.scan(Scanner.java:398) at org.eclipse.jetty.util.Scanner.doStart(Scanner.java:332) at org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) at org.eclipse.jetty.deploy.providers.ScanningAppProvider.doStart(ScanningAppProvider.java:118) at org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) at org.eclipse.jetty.deploy.DeploymentManager.startAppProvider(DeploymentManager.java:552) at org.eclipse.jetty.deploy.DeploymentManager.doStart(DeploymentManager.java:227) at org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) at org.eclipse.jetty.util.component.AggregateLifeCycle.doStart(AggregateLifeCycle.java:63) at org.eclipse.jetty.server.handler.AbstractHandler.doStart(AbstractHandler.java:53) at org.eclipse.jetty.server.handler.HandlerWrapper.doStart(HandlerWrapper.java:91) at org.eclipse.jetty.server.Server.doStart(Server.java:263) at org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) at org.eclipse.jetty.xml.XmlConfiguration$1.run(XmlConfiguration.java:1215) at java.security.AccessController.doPrivileged(Native Method) at org.eclipse.jetty.xml.XmlConfiguration.main(XmlConfiguration.java:1138) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.eclipse.jetty.start.Main.invokeMain(Main.java:457) at org.eclipse.jetty.start.Main.start(Main.java:602) at org.eclipse.jetty.start.Main.main(Main.java:82) Caused by: java.lang.NumberFormatException: multiple points at sun.misc.FloatingDecimal.readJavaFormatString(FloatingDecimal.java:1082) at java.lang.Float.parseFloat(Float.java:422) at org.apache.solr.core.Config.getFloat(Config.java:284) at org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:358) ... 45 more Regards Antony 2012/11/8 Markus Jelsma <markus.jel...@openindex.io> > Hi - it fixes it here. Please post the full stack trace. > > -----Original message----- > > From:Antony Steiner <ant.stei...@gmail.com> > > Sent: Thu 08-Nov-2012 15:16 > > To: solr-user@lucene.apache.org > > Subject: Re: Apache Nutch 1.5.1 + Apache Solr 4.0 > > > > Hi, > > > > Thank you for your sugestion. Nope, it didn't change anything. Should I > > post the full stacktrace? > > > > Regards > > Antony > > > > > > 2012/11/8 Markus Jelsma <markus.jel...@openindex.io> > > > > > Hi, > > > > > > Your Nutch schema likely points to the old EnglishPorterFilter that > > > doesn't exist anymore. You can change that occurance to > > > PorterStemFilterFactory, that should fix the issue. > > > > > > -----Original message----- > > > > From:Antony Steiner <ant.stei...@gmail.com> > > > > Sent: Thu 08-Nov-2012 14:05 > > > > To: solr-user@lucene.apache.org > > > > Subject: Apache Nutch 1.5.1 + Apache Solr 4.0 > > > > > > > > Hello my name is Antony and I'm new to apache nutch and solr. > > > > > > > > I want to crawl my website and therefore I downloaded nutch to do > this. > > > > This works fine. But no I would like to integrate nutch with solr. Im > > > > running this on my unix system. > > > > Im trying to follow this tutorial: > > > > http://wiki.apache.org/nutch/NutchTutorial > > > > But it wont for me. Running Solr without nutch is no problem. I can > post > > > > documents to solr with post.jar. But what I want to do is post my > nutch > > > > crawl to solr. > > > > Now if I copy the schema.xml from nutch to > > > > apache-solr-4.0.0/example/solr/collection1/conf directory aned > restart > > > solr > > > > (java -jar start.jar), I get compiling errors but Solr will start. > (Is > > > this > > > > the correct directory to copy my schema?) > > > > > > > > Nov 8, 2012 9:40:33 AM org.apache.solr.schema.IndexSchema readSchema > > > > INFO: Schema name=nutch > > > > Nov 8, 2012 9:40:33 AM org.apache.solr.core.CoreContainer create > > > > SEVERE: Unable to create core: collection1 > > > > org.apache.solr.common.SolrException: Schema Parsing Failed: multiple > > > points > > > > at > > > > org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:571) > > > > at > > > org.apache.solr.schema.IndexSchema.<init>(IndexSchema.java:113) > > > > ... > > > > > > > > Nov 8, 2012 9:40:33 AM org.apache.solr.common.SolrException log > > > > SEVERE: null:org.apache.solr.common.SolrException: Schema Parsing > Failed: > > > > multiple points > > > > at > > > > org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:571) > > > > at > > > org.apache.solr.schema.IndexSchema.<init>(IndexSchema.java:113) > > > > at > > > org.apache.solr.core.CoreContainer.create(CoreContainer.java:846) > > > > ... > > > > > > > > Now if I don't copy the schema and push my nutch crawl to solr I get > > > > following error: > > > > > > > > SolrIndexer: starting at 2012-11-08 10:49:02 > > > > Indexing 5 documents > > > > java.io.IOException: Job failed! > > > > SolrDeleteDuplicates: starting at 2012-11-08 10:49:47 > > > > SolrDeleteDuplicates: Solr url: http://photon:8983/solr/ > > > > > > > > And this is taken from the logging: > > > > org.apache.solr.common.SolrException: ERROR: [doc= > > > > http://e-docs/infrastructure/cpuload_monitor.html] unknown field > 'host' > > > > > > > > What should I do or what am I missing? > > > > > > > > I hope you can help me > > > > Best Regards > > > > Antony > > > > > > > > > >