>lein try clj-tika "1.2.0" user=> (use 'tika) user=> (def res (parse " https://www.oasis-open.org/committees/download.php/25054/07-08-22-MetaData-Examples.odt ")) #'user/res
res - the map consisting of: - :text -> extracted text - all other fields - metadata from document On Wed, Jun 4, 2014 at 2:33 PM, Bastien <[email protected]> wrote: > Hi Alex, > > Alex Ott <[email protected]> writes: > > > Pantomime right now doesn't support the text extraction, but you can > > take the https://github.com/alexott/clj-tika (outdate although) - it > > uses the Apache Tika for text extraction > > thanks -- I stumbled upon clj-tika but didn't understand how to use > it. Would you have a minima example? The README is pretty terse. > > Thanks in advance, > > -- > Bastien > > -- > You received this message because you are subscribed to the Google > Groups "Clojure" group. > To post to this group, send email to [email protected] > Note that posts from new members are moderated - please be patient with > your first post. > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/clojure?hl=en > --- > You received this message because you are subscribed to the Google Groups > "Clojure" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > For more options, visit https://groups.google.com/d/optout. > -- With best wishes, Alex Ott http://alexott.net/ Twitter: alexott_en (English), alexott (Russian) Skype: alex.ott -- You received this message because you are subscribed to the Google Groups "Clojure" group. To post to this group, send email to [email protected] Note that posts from new members are moderated - please be patient with your first post. To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/clojure?hl=en --- You received this message because you are subscribed to the Google Groups "Clojure" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
