GitHub user avamingli added a comment to the discussion: Extend the gpfdist 
tool to support SFTP/HDFS protocols for high-performance multi-source data 
ingestion

Hi, interesting!

As far as I know, gpfdist is a protocol for external tables, and the gpfdist 
tool is an implementation of that protocol. Other tools, like GPSS (Greenplum 
Streaming Server), also implement the gpfdist protocol for external tables. The 
data format is specified by the format option when defining the external table.

```sql
CREATE EXTERNAL TABLE ext1 (d varchar(20)) location ('gpfdist://9727/d.dat') 
format 'csv' (DELIMITER '|');
```

Are you planning to support additional protocols(ex: SFTP)  for external tables 
:
```sql
CREATE EXTERNAL TABLE ext1 (d varchar(20)) location ('SFTP://9727/d.dat')
```
or provide more format options?
```sql
CREATE EXTERNAL TABLE ext1 (d varchar(20)) location ('gpfdist://9727/d.dat') 
format 'SFTP' ;
```



GitHub link: 
https://github.com/apache/cloudberry/discussions/1205#discussioncomment-13636225

----
This is an automatically sent email for dev@cloudberry.apache.org.
To unsubscribe, please send an email to: dev-unsubscr...@cloudberry.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@cloudberry.apache.org
For additional commands, e-mail: dev-h...@cloudberry.apache.org

Reply via email to