: However my raw files are stored on some remote storage devices. I am able to : get an InputStream object for the file to be indexed. To me it may seem : awkward to have the file temporarily stored locally. Is there a way of : directly passing the InputStream in (e.g. constructing ContentStream using : the InputStream)?
Sure, go write ahead. ContentStream is a really simple abstraction designed to make it easy to add some common pieces of information to either an InputStream or a Reader. Take a look at ContentStreamBase as a starting point for creating your own subclass that can point to whatever InputStream you want. -Hoss