It appears that this is simpler than I thought: in SOLR 4.4, at least,
there is a dataSource class named "FieldStreamDataSource" that I can use
directly with the TikaEntityProcessor. Given a blob column named DOCIMAGE,
I can use the following Tika entity:
...
There's no BlobTransformer in DataImportHandler. You'll have to write one.
Also, you'd probably need to write a FieldInputStreamDataSource instead of
FieldReaderDataSource.
On Tue, Jul 30, 2013 at 12:30 PM, Raymond Wiker wrote:
> I have a case where I want to documents and metadata content from