I tried updating the solr instance I'm testing DIH with, adding the the dataimport and slf4j jar files to solr.
When I start solr, I get the following error. Is there something else which needs to be installed for the nightly build version of DIH to work in solr release 1.3? Thanks, Tim java.lang.NoClassDefFoundError: org/apache/solr/update/RollbackUpdateCommand at org.apache.solr.handler.dataimport.DataImportHandler.inform(DataImportHandler.java:95) at org.apache.solr.core.SolrResourceLoader.inform(SolrResourceLoader.java:311) at org.apache.solr.core.SolrCore.<init>(SolrCore.java:480) at org.apache.solr.core.CoreContainer$Initializer.initialize(CoreContainer.java:119) at org.apache.solr.servlet.SolrDispatchFilter.init(SolrDispatchFilter.java:69) at com.caucho.server.dispatch.FilterManager.createFilter(FilterManager.java:134) at com.caucho.server.dispatch.FilterManager.init(FilterManager.java:87) at com.caucho.server.webapp.Application.start(Application.java:1655) at com.caucho.server.deploy.DeployController.startImpl(DeployController.java:621) at com.caucho.server.deploy.StartAutoRedeployAutoStrategy.startOnInit(StartAutoRedeployAutoStrategy.java:72) at com.caucho.server.deploy.DeployController.startOnInit(DeployController.java:509) at com.caucho.server.deploy.DeployContainer.start(DeployContainer.java:153) at com.caucho.server.webapp.ApplicationContainer.start(ApplicationContainer.java:670) at com.caucho.server.host.Host.start(Host.java:420) at com.caucho.server.deploy.DeployController.startImpl(DeployController.java:621) at com.caucho.server.deploy.StartAutoRedeployAutoStrategy.startOnInit(StartAutoRedeployAutoStrategy.java:72) at com.caucho.server.deploy.DeployController.startOnInit(DeployController.java:509) at com.caucho.server.deploy.DeployContainer.start(DeployContainer.java:153) at com.caucho.server.host.HostContainer.start(HostContainer.java:504) at com.caucho.server.resin.ServletServer.start(ServletServer.java:971) at com.caucho.server.deploy.DeployController.startImpl(DeployController.java:621) at com.caucho.server.deploy.AbstractDeployControllerStrategy.start(AbstractDeployControllerStrategy.java:56) at com.caucho.server.deploy.DeployController.start(DeployController.java:517) at com.caucho.server.resin.ResinServer.start(ResinServer.java:551) at com.caucho.server.resin.Resin.init(Resin.java) at com.caucho.server.resin.Resin.main(Resin.java:625) Caused by: java.lang.ClassNotFoundException: org.apache.solr.update.RollbackUpdateCommand at com.caucho.loader.DynamicClassLoader.findClass(DynamicClassLoader.java:1130) at com.caucho.loader.DynamicClassLoader.loadClass(DynamicClassLoader.java:1072) at com.caucho.loader.DynamicClassLoader.loadClass(DynamicClassLoader.java:1021) at java.lang.ClassLoader.loadClassInternal(ClassLoader.java:320) ... 26 more On Thu, Mar 5, 2009 at 9:10 AM, Garafola Timothy <timgaraf...@gmail.com> wrote: > yes, the dataimport.properties file is present in the conf directory > from previous imports. I'll try the trunk version as you suggested to > see if the problem persists. > > Thanks, > Tim > > On Wed, Mar 4, 2009 at 7:54 PM, Noble Paul നോബിള് नोब्ळ् > <noble.p...@gmail.com> wrote: >> the dataimport.properties is created only after one successful import >> .so it is available only from second import onwards. probably you can >> create one manually and put it in the conf dir. >> >> On Thu, Mar 5, 2009 at 12:52 AM, Garafola Timothy <timgaraf...@gmail.com> >> wrote: >>> Thanks, >>> >>> I set up a another test instance of solr and ran a full import within >>> the DIH Development Console. I examined the query and found that >>> last_index_time is not getting set in the query. Yet the value does >>> get updated after a full import completes (outside of the development >>> console). Is there some place that I need to set the path to the >>> dataimport.properties file? >>> >>> On Tue, Mar 3, 2009 at 8:03 PM, Noble Paul നോബിള് नोब्ळ् >>> <noble.p...@gmail.com> wrote: >>>> I do not see anything wrong with this .It should have worked . Can you >>>> check that dataimport.properties is created (by DIH) in the conf >>>> directory? . check the content? >>>> >>>> >>>> are you sure that the query >>>> >>>> select DId from 2_Doc where ModifiedDate > >>>> '${dataimporter.last_index_time}' >>>> >>>> works with a date format yyyy-MM-dd HH:mm:ss . This is the format >>>> which DIH sends the date in . If the format is wrong you may need to >>>> format it using a dateformat function. >>>> >>>> see here >>>> >>>> http://wiki.apache.org/solr/DataImportHandler#head-5675e913396a42eb7c6c5d3c894ada5dadbb62d7 >>>> >>>> >>>> The trunk DIH can work with Solr1.3 (you may need to put the DIH jar >>>> and slf4j). Can >>>> - Show quoted text - >>>> On Wed, Mar 4, 2009 at 3:53 AM, Garafola Timothy <timgaraf...@gmail.com> >>>> wrote: >>>>> I'm using solr 1.3 and am trying to get a delta-import with the DIH. >>>>> Recently the wiki, http://wiki.apache.org/solr/DataImportHandler, was >>>>> updated explaining that delta import is a 1.4 feature now but it was >>>>> still possible get a delta using the full import example here, >>>>> http://wiki.apache.org/solr/DataImportHandlerFaq#fullimportdelta. I >>>>> tried this but each time I run DIH, it reimports all rows and updates. >>>>> >>>>> Below is my data-config.xml. I set rootEntity to false and issued >>>>> command=full-import&clean=false&optimize=false through DIH. Am I >>>>> doing something wrong here or is the DataImportHandlerFaq incorrect? >>>>> >>>>> <dataConfig> >>>>> <dataSource driver="com.mysql.jdbc.Driver" >>>>> url="jdbc:mysql://pencil-somewhere.com:22222/SomeDB" user="someUser" >>>>> password="somePassword"/> >>>>> <document name=""> >>>>> <entity name = "item" rootEntity="false" >>>>> query = "select DId from 2_Doc where >>>>> ModifiedDate > '${dataimporter.last_index_time}' >>>>> and DocType != 'Research Articles'"> >>>>> <entity name="feature" pk="DId" >>>>> transformer="RegexTransformer" >>>>> query = "SELECT d.DId, d.SiteId, >>>>> d.DocTitle, d.DocURL, d.DocDesc, >>>>> d.DocType, d.Tags, d.Source, >>>>> d.Last90DaysRFIsPercent, >>>>> d.ModifiedDate, d.DocGuid, >>>>> d.Author, >>>>> i.Industry FROM 2_Doc d LEFT >>>>> OUTER JOIN tmp_DocIndustry i >>>>> ON (d.DocId=i.DocId AND >>>>> d.SiteId=i.SiteId) where d.DocType != 'Research articles' >>>>> and d.DId = '${item.DId}' and >>>>> d.ModifiedDate > '${dataimporter.last_index_time}'"> >>>>> <field column = "DId" name ="did"/> >>>>> <field column = "SiteId" name ="SiteId"/> >>>>> <field column = "DocId" name ="DocId"/> >>>>> <field column = "DocTitle" name >>>>> ="DocTitle"/> >>>>> <field column = "DocURL" name ="DocURL"/> >>>>> <field column = "DocDesc" name ="DocDesc" >>>>> /> >>>>> <field column = "Snippet" >>>>> regex="^(.{0,800})\b.*$" sourceColName="DocDesc"/> >>>>> <field column = "DocType" name >>>>> ="DocType"/> >>>>> <field column = "Tags" name ="Tags" >>>>> splitBy=";" sourceColName="Tags"/> >>>>> <field column = "Source" name ="Source"/> >>>>> <field column = >>>>> "Last90DaysRFIsPercent" name ="Last90DaysRFIsPercent"/> >>>>> <field column = "ModifiedDate" name >>>>> ="ModifiedDate"/> >>>>> <field column = "DocGuid" name >>>>> ="DocGuid"/> >>>>> <field column = "Author" name ="Author"/> >>>>> <field column = "Industry" name >>>>> ="Industry" sourceColName="Industry"/> >>>>> </entity> >>>>> </entity> >>>>> </document> >>>>> </dataConfig> >>>>> >>>>> Thanks, >>>>> -Tim >>>>> >>>> >>>> >>>> >>>> -- >>>> --Noble Paul >>>> >>> >>> >>> >>> -- >>> -Tim >>> >> >> >> >> -- >> --Noble Paul >> > > > > -- > -Tim > -- -Tim