Hi, I am trying to read content of msg-files using Tika and index these in Solr, however I am having some problems with the OfficeParser(). I keep getting the error java.lang.NoClassDefFoundError for the OfficeParcer, even though both tika-core and tika-parsers are included in the build path.
I am using Java with the following code: public static void main(final String[] args) throws IOException,SAXException, TikaException { processDocument(pathtofile) } private static void processDocument(String pathfilename) { try { File file = new File(pathfilename); Metadata meta = new Metadata(); InputStream input = TikaInputStream.get(file); BodyContentHandler handler = new BodyContentHandler(); Parser parser = new OfficeParser(); ParseContext context = new ParseContext(); parser.parse(input, handler, meta, context); String doccontent = handler.toString(); System.out.println(doccontent); System.out.println(meta); } } In the buildpath I have the following dependencies: [cid:image001.png@01D46C59.8AECF060] Any help is appreciate. Thanks in advance. Best regards, Martin Hansen Beskyttelse af dine personlige oplysninger er vigtig for os. Her finder du KMD’s Privatlivspolitik<http://www.kmd.dk/Privatlivspolitik>, der fortæller, hvordan vi behandler oplysninger om dig. Protection of your personal data is important to us. Here you can read KMD’s Privacy Policy<http://www.kmd.net/Privacy-Policy> outlining how we process your personal data. Vi gør opmærksom på, at denne e-mail kan indeholde fortrolig information. Hvis du ved en fejltagelse modtager e-mailen, beder vi dig venligst informere afsender om fejlen ved at bruge svarfunktionen. Samtidig beder vi dig slette e-mailen i dit system uden at videresende eller kopiere den. Selvom e-mailen og ethvert vedhæftet bilag efter vores overbevisning er fri for virus og andre fejl, som kan påvirke computeren eller it-systemet, hvori den modtages og læses, åbnes den på modtagerens eget ansvar. Vi påtager os ikke noget ansvar for tab og skade, som er opstået i forbindelse med at modtage og bruge e-mailen. Please note that this message may contain confidential information. If you have received this message by mistake, please inform the sender of the mistake by sending a reply, then delete the message from your system without making, distributing or retaining any copies of it. Although we believe that the message and any attachments are free from viruses and other errors that might affect the computer or it-system where it is received and read, the recipient opens the message at his or her own risk. We assume no responsibility for any loss or damage arising from the receipt or use of this message.