Hi,

This is my first post here so excuse me please if it is not really related.

At the moment I'm using Solr 1.4.1 with SOLR-236 
(https://issues.apache.org/jira/browse/SOLR-236) patch applied to support field 
collapsing.

One of the mandatory fields of documents indexed is generated from the 
*.doc/*.docx/*.pdf files uploaded by users, so Solr Cell is also heavily used 
in the project for the purpose of parsing documents to store their plain text 
content. Unfortunately, it can't parse correctly all the documents but in most 
cases it works well enough.

Recently I learned 
(http://stackoverflow.com/questions/6369214/solr-cell-extractingrequesthandler-cannot-parse-some-doc-files/)
 that Solr Cell I'm using is old so by using its up-to-date version I can get 
more documents parsed correctly. As I am using apache-solr-cell-1.4.1.jar in my 
lib folder, first thing I tried was to replace it with apache-solr-cell-3.2.jar 
from the latest distribution without changing anything else (e.g. war file). 
After Solr instance was restarted, it worked (I managed to fetch the content of 
the parsed document) but after a number of requests crashed.

Then, I decided that in order to use *-3.2 libraries properly I need to use 3.2 
core war file as well. But as I need the collapsing functionality, I need to 
build a custom patched version of it as I did before with 1.4.1.

--
So the first question is if I was really right in my assumption here - maybe it 
is possible to upgrade Solr Cell / Tika to the latest version while still using 
1.4.1 Solr core? If that's possible, my following questions can be skipped.
--

And the problem I am facing is that I can't build 3.2 version war file. I mean, 
when I get source from 
http://svn.apache.org/repos/asf/lucene/solr/tags/1.4.1/release-1.4.1 among the 
build options there is the "dist-war" key which allows to build war core and a 
set of standard libraries. Everything is simple in case you need to build 1.4.1 
core.

For 3.2, I can't see a similar build option. First, there is no release-3.2 
folder, so I tried to checkout http://svn.apache.org/repos/asf/lucene/dev/trunk 
supposing this is the latest stable release (and I might be wrong there). 
However, there is no "dist-war" build option and I only get various jar files 
when building that branch with no war file at all.

--
So the second question is what exactly am I doing wrong - do I checkout 
incorrect branch (and what is the correct one then?) or do I build it 
improperly (maybe I need to modify build.xml somehow)?
--

Many thanks in advance. Feel free to ask for more details if that matters - I 
am a total noob in Java programming so very likely I've missed something here.

--
Yuriy Akopov

                                          

Reply via email to