DieHertz commented on issue #1229:
URL: 
https://github.com/apache/iceberg-python/issues/1229#issuecomment-2414915918

   Indeed it is good enough for I/O-bound tasks, but in my understanding this 
part is CPU-bound.
   
   I think so because I'm observing close to 100% CPU usage when inside 
`plan_files`/`inspect.files`.
   Also I have profiled how long it takes to perform `snapshot.manifests(io)` 
and `manifest.fetch_manifest_entry_io)`, and the amount spent actually 
downloading the `.avro` files seems negligible.
   Most of the time is spent processing the manifests record-by-record and 
converting each record to a dict


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to