Hello,

I am trying to find a way to index some documents, all located in a
directory in HDFS.

Since HDFS has a REST API, I was trying to use the DataImportHandler(DIH)
along with the datasource type as URLDataSource, to index the documents.

Is this approach wrong? If so, then is there a canonical way to index
documents present in HDFS?
-- 
Sincerely,
*Rishabh Patel*

Reply via email to