Hello, I am trying to find a way to index some documents, all located in a directory in HDFS.
Since HDFS has a REST API, I was trying to use the DataImportHandler(DIH) along with the datasource type as URLDataSource, to index the documents. Is this approach wrong? If so, then is there a canonical way to index documents present in HDFS? -- Sincerely, *Rishabh Patel*