fix: add iceberg_type column for SqlCatalog by rchowell · Pull Request #3263 · apache/iceberg-python

rchowell · 2026-04-21T19:45:06Z

Rationale for this change

Pyiceberg currently does not include the iceberg_type column in the iceberg_tables table for SQL-based catalogs. This caused an error when reading a SQL-based Catalog from iceberg-rust, which expected this column. I will also update iceberg-rust to be more defensive like this PR.

The fix here is done exactly how iceberg-java handles this. We do an idempotent ALTER TABLE iceberg_tables ADD COLUMN iceberg_type after ensuring iceberg_tables exists. We explicitly use the try/catch to make it idempotent because older sqlite versions do not support IF NOT EXISTS for column creation. For newly created tables, the addition to the sqlalchemy table will create the column. For existing tables, this backwards-compatible schema update will hit.

Are these changes tested?

Unit Testing

Unit test for the migration case
Unit test for new table creation and idempotency

End-to-end Testing

# Case 1. New Catalog

# Catalog definition
$ cat ~/.pyiceberg.yaml | yq .catalog.tmp
type: sql
uri: sqlite:////tmp/pyiceberg.db

# Trigger initialization of catalog tables
$ pyiceberg --catalog tmp list

# Verify Schema
sqlite> .schema iceberg_tables
CREATE TABLE iceberg_tables (
        catalog_name VARCHAR(255) NOT NULL, 
        table_namespace VARCHAR(255) NOT NULL, 
        table_name VARCHAR(255) NOT NULL, 
        metadata_location VARCHAR(1000), 
        previous_metadata_location VARCHAR(1000), 
 ---->  iceberg_type VARCHAR(5), 
        PRIMARY KEY (catalog_name, table_namespace, table_name)
);

# Case 2. Existing Catalog (before fix)

# This catalog was created without the fix
sqlite> .schema iceberg_tables
CREATE TABLE iceberg_tables (
        catalog_name VARCHAR(255) NOT NULL, 
        table_namespace VARCHAR(255) NOT NULL, 
        table_name VARCHAR(255) NOT NULL, 
        metadata_location VARCHAR(1000), 
        previous_metadata_location VARCHAR(1000), 
        PRIMARY KEY (catalog_name, table_namespace, table_name)
);

# Fix is now applied
# ...

# Trigger update by loading this catalog with these changes in pyiceberg
sqlite> .schema iceberg_tables
CREATE TABLE iceberg_tables (
        catalog_name VARCHAR(255) NOT NULL, 
        table_namespace VARCHAR(255) NOT NULL, 
        table_name VARCHAR(255) NOT NULL, 
        metadata_location VARCHAR(1000), 
        previous_metadata_location VARCHAR(1000),
 ---->  iceberg_type VARCHAR(5),
        PRIMARY KEY (catalog_name, table_namespace, table_name)
);

Are there any user-facing changes?

No

geruh

Thanks for raising this @rchowell! This is a matter of completing the implementation of our SqlCatalog. Here you are updating the table to have the column and handle the migration logic.

We will definitely need a follow up here to filter against this column for table operations. Looks like the java v1 implementation runs something like WHERE (iceberg_type = 'TABLE' OR iceberg_type IS NULL) Otherwise, a view can bleed into table operations.

rchowell · 2026-04-22T23:44:17Z

@geruh nice catch, added with a little unit test. Thanks.

geruh

LGTM! I'll create the fast follow issue.

rchowell · 2026-05-08T18:07:33Z

Thanks @geruh - one thing I would like to flag is that we shouldn't auto-migrate as pointed out in this PR. apache/iceberg-rust#2380 -- I am ok with not-merging this as-is and waiting for me to update the non-auto migrate. I can address #3337 easily as well, just like the iceberg-rust PR.

fix: add iceberg_type column for SqlCatalog

1338bbc

geruh reviewed Apr 21, 2026

View reviewed changes

Comment thread tests/catalog/test_sql.py Outdated

filter on iceberg_type column

d048c5a

rchowell requested a review from geruh April 24, 2026 22:44

This was referenced Apr 28, 2026

feat: support V0 iceberg_tables schema for SqlCatalog apache/iceberg-rust#2380

Open

column "iceberg_type" does not exist apache/iceberg-rust#2068

Open

geruh approved these changes May 7, 2026

View reviewed changes

geruh mentioned this pull request May 7, 2026

SqlCatalog table operations should filter on iceberg_type #3337

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: add iceberg_type column for SqlCatalog#3263

fix: add iceberg_type column for SqlCatalog#3263
rchowell wants to merge 2 commits intoapache:mainfrom
rchowell:iceberg-type

rchowell commented Apr 21, 2026

Uh oh!

geruh left a comment

Uh oh!

Uh oh!

rchowell commented Apr 22, 2026

Uh oh!

geruh left a comment

Uh oh!

rchowell commented May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rchowell commented Apr 21, 2026

Rationale for this change

Are these changes tested?

Are there any user-facing changes?

Uh oh!

geruh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rchowell commented Apr 22, 2026

Uh oh!

geruh left a comment

Choose a reason for hiding this comment

Uh oh!

rchowell commented May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants