I am looking to add new field to extract they value from the field text:

<field column="links" regex="......" sourceColName="text" />

 for example the field links to extract all links from the field text of
each file.
I define in tika.config.xml a regex for the expression of links but when
the prossesor of indexation is finish I get just one value even if in
schema.xml I define the field links as multiValued (true) ; And I remark
the handler update/Extract get all the links automaticlly (multi value).
what I have to do to get all links present in each files with data import
handler.

Reply via email to