Thanks for the tip. This seems to work ;) But now i ran into another problem - im trying to use the "threads" parameter in my entitys in order to speed up the index creation. as soon as i use the threads parameter (e.g. threads="2") i get the following errors in my log:
org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to execute query: SELECT name AS section_name, shortname as section_shortname FROM cat where id='739' at org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(DataImportHandlerException.java:72) at org.apache.solr.handler.dataimport.JdbcDataSource $ResultSetIterator.<init>(JdbcDataSource.java:251) at org.apache.solr.handler.dataimport.JdbcDataSource.getData(JdbcDataSource.java:208) at org.apache.solr.handler.dataimport.JdbcDataSource.getData(JdbcDataSource.java:39) at org.apache.solr.handler.dataimport.SqlEntityProcessor.initQuery(SqlEntityProcessor.java:58) at org.apache.solr.handler.dataimport.SqlEntityProcessor.nextRow(SqlEntityProcessor.java:71) at org.apache.solr.handler.dataimport.ThreadedEntityProcessorWrapper.nextRow(ThreadedEntityProcessorWrapper.java:84) at org.apache.solr.handler.dataimport.DocBuilder $EntityRunner.runAThread(DocBuilder.java:438) at org.apache.solr.handler.dataimport.DocBuilder $EntityRunner.run(DocBuilder.java:391) at org.apache.solr.handler.dataimport.DocBuilder $EntityRunner.runAThread(DocBuilder.java:458) at org.apache.solr.handler.dataimport.DocBuilder$EntityRunner.access $000(DocBuilder.java:345) at org.apache.solr.handler.dataimport.DocBuilder$EntityRunner $1.run(DocBuilder.java:398) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110) at java.util.concurrent.ThreadPoolExecutor $Worker.run(ThreadPoolExecutor.java:603) at java.lang.Thread.run(Thread.java:636) Caused by: java.sql.SQLException: Streaming result set com.mysql.jdbc.rowdatadyna...@15b57613 is still active. No statements may be issued when any streaming result sets are open and in use on a given connection. Ensure that you have called .close() on any active streaming result sets before attempting more queries. at com.mysql.jdbc.SQLError.createSQLException(SQLError.java:930) at com.mysql.jdbc.MysqlIO.checkForOutstandingStreamingData(MysqlIO.java:2694) at com.mysql.jdbc.MysqlIO.sendCommand(MysqlIO.java:1868) at com.mysql.jdbc.MysqlIO.sqlQueryDirect(MysqlIO.java:2109) at com.mysql.jdbc.ConnectionImpl.execSQL(ConnectionImpl.java:2642) at com.mysql.jdbc.StatementImpl.executeSimpleNonQuery(StatementImpl.java:1544) at com.mysql.jdbc.StatementImpl.execute(StatementImpl.java:672) at com.mysql.jdbc.StatementImpl.execute(StatementImpl.java:625) at org.apache.solr.handler.dataimport.JdbcDataSource $ResultSetIterator.<init>(JdbcDataSource.java:244) ... 13 more Is the threads parameter supposed to work with jdbc/mysql? As far as i understand the error message, solr/jdbc-mysql tries to use the same mysql connection for multiple statements, which does not work. Have i misunderstood the usage of the threads parameter? Thanks again, Moritz Am Donnerstag, den 18.11.2010, 07:11 -0700 schrieb Shawn Heisey: > I did a quick grep through the directory listing of the Solr 3.1 source, > the only part of your analysis chain that came up empty was > HTMLStripWhitespaceTokenizerFactory. I think you'll have to replace it > with something like this: > > <charFilter class="solr.HTMLStripCharFilterFactory"/> > <tokenizer class="solr.WhitespaceTokenizerFactory"/> > > Also, the ISOLatin1AccentFilterFactory is deprecated, replaced with > ASCIIFoldingFilterFactory. It's still around, but could be removed at > any time. > > Shawn > > > On 11/18/2010 4:56 AM, Moritz Krinke wrote: > > Hello, > > > > i have a running solr 1.3 installation and would like to migrate it to > > solr 3 in order to get speed improvements by using the multiple threads > > for indexing. > > > > When starting SOLR 3, i get the following error message: > > SEVERE: org.apache.solr.common.SolrException: Unknown fieldtype 'textfc' > > specified on field descr > > > > I'm using the exact same schema.xml as with solr 1.3. > > In the schema.xml, the fieldTyper "textfc" is specified as follows: > > > > <fieldType name="textfc" class="solr.TextField" > > positionIncrementGap="100"> > > <analyzer type="index"> > > <tokenizer class="solr.HTMLStripWhitespaceTokenizerFactory"/> > > <!--<tokenizer class="solr.WhitespaceTokenizerFactory"/> --> > > <filter class="solr.ISOLatin1AccentFilterFactory"/> > > <!-- > > <filter class="solr.StopFilterFactory" > > ignoreCase="true" > > words="stopwords.txt" > > enablePositionIncrements="true" > > /> > > --> > > <filter class="solr.WordDelimiterFilterFactory" > > generateWordParts="1" generateNumberParts="1" catenateWords="1" > > catenateNumbers="1" catenateAll="0" splitOnCaseChange="0"/> > > <filter class="solr.LowerCaseFilterFactory"/> > > <filter class="solr.TrimFilterFactory" /> > > <filter class="solr.EnglishPorterFilterFactory" > > protected="protwords.txt"/> > > <filter class="solr.RemoveDuplicatesTokenFilterFactory"/> > > </analyzer> > > > > <analyzer type="query"> > > <tokenizer class="solr.HTMLStripWhitespaceTokenizerFactory"/> > > <!--<tokenizer class="solr.WhitespaceTokenizerFactory"/> --> > > <filter class="solr.ISOLatin1AccentFilterFactory"/> > > <filter class="solr.SynonymFilterFactory" > > synonyms="synonyms.txt" ignoreCase="true" expand="true"/> > > <!--<filter class="solr.StopFilterFactory" ignoreCase="true" > > words="stopwords.txt"/> --> > > <filter class="solr.WordDelimiterFilterFactory" > > generateWordParts="1" generateNumberParts="1" catenateWords="0" > > catenateNumbers="0" catenateAll="0" splitOnCaseChange="0"/> > > <filter class="solr.LowerCaseFilterFactory"/> > > <filter class="solr.TrimFilterFactory" /> > > <filter class="solr.EnglishPorterFilterFactory" > > protected="protwords.txt"/> > > <filter class="solr.RemoveDuplicatesTokenFilterFactory"/> > > </analyzer> > > > > </fieldType> > > > > > > Any ideas why this does not work? > > > > Thanks a lot, > > Moritz > > > > >