Re: [PR] Set Glue Table Information when creating/updating tables [iceberg-python]

via GitHub Mon, 22 Jan 2024 01:51:34 -0800


nicor88 commented on PR #288:
URL: https://github.com/apache/iceberg-python/pull/288#issuecomment-1903624202


   @mgmarino @HonahX - I was testing this, and after the change I confirm that 
I can query the table in Athena (I'm still doing some deep dive on why the 
table is not droppable in athena), but anyhow, I have a weird behvior:
   if I run 
   ```
   data = [
       {"x": "Alice"},
       {"x": "Bob"}
   ]
   df = pd.DataFrame(data)
   
   to_append = pa.Table.from_pandas(df)
   
   t.append(to_append)
   ```
   In the final table I expect to have multiple records for my table with 
x=Alice, and it's not the case.
   Then I run:
   ```
   data = [
       {"x": "Alice v1"},
       {"x": "Bob v1"}
   ]
   df = pd.DataFrame(data)
   
   to_append = pa.Table.from_pandas(df)
   
   t.append(to_append)
   ```
   as I was expecting an overwrite behaviour - but I still get the first 
snapshot.
   Checking the data folder seems that the parquet files are written, also new 
metadata files, but glue catalog still point to the old snapshot - and that's 
definitely not right. Am I missing something here? 
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Re: [PR] Set Glue Table Information when creating/updating tables [iceberg-python]

Reply via email to