Iceberg

Apache Iceberg is an open table format for large analytic datasets. Supermetal writes Parquet data files directly to Iceberg tables using REST, Glue, or S3 Tables catalogs with S3 or GCS storage.


Spec Versions

Iceberg V1 introduced schema evolution, hidden partitioning, and snapshot isolation. V2 added row-level deletes through position and equality delete files. V3 brings deletion vectors (replacing positional deletes), row-level lineage tracking, the variant type for semi-structured data, and geospatial types.

Supermetal defaults to V3 for new tables. Use V2 if your query engine doesn't support V3 yet.


Write Modes

Merge on Read (default)

SELECT * returns the current state of your data. When rows are updated or deleted, Supermetal writes equality delete files that mask older versions at query time. Duplicate primary keys are resolved using positional deletes (V2) or deletion vectors (V3). No data is rewritten at ingest.

Delete modes: Soft delete (default) preserves deleted rows, queryable via WHERE _sm_deleted = true. Hard delete removes rows completely from query results.

Requires V2 or V3 and a query engine that supports equality deletes (Spark 3.x+, Trino, Dremio, Snowflake, StarRocks).

Append

All changes are appended as new rows. Inserts, updates, and deletes each produce a new data row with metadata columns _sm_deleted and _sm_version. To query current state, filter with WHERE _sm_deleted = false and deduplicate by primary key using _sm_version.

Works with any Iceberg version and any query engine.

Comparison

Merge on ReadAppend
Iceberg versionV2, V3V1, V2, V3
Query engine supportRequires equality delete supportAny engine
Query complexitySELECT * returns current stateRequires dedup logic
Read performanceEngine applies deletes at read timeEngine scans all versions

Compaction

File creation rate is controlled by flush interval (default: 10 seconds). Run periodic compaction/maintenance using your query engine (Spark, Trino) or a table management service to optimize read performance.


Prerequisites

You need a catalog (REST, Glue, or S3 Tables), storage credentials for where data files will be written (S3 or GCS), and a target namespace for table creation.


Setup

Catalog

Configure the Iceberg catalog where table metadata is stored.

FieldDescription
URICatalog endpoint (e.g., https://catalog.example.com)
WarehouseStorage location identifier
AuthenticationOAuth2, Bearer, Basic, or SigV4

Authentication methods:

MethodUse Case
OAuth2Production environments with token endpoint, client ID/secret
BearerService accounts, CI/CD with static token
BasicDevelopment, JDBC catalogs with username/password
SigV4AWS services requiring request signing (region, service)
FieldDescription
WarehouseS3 location (e.g., s3://my-bucket/warehouse)
RegionAWS region
Catalog IDAWS account ID (optional)
CredentialsAccess key and secret
FieldDescription
Table Bucket ARNS3 Tables bucket ARN
RegionAWS region
CredentialsAccess key and secret

Target Namespace

Tables are created under this namespace. For multi-level namespaces, use comma-separated values: my_database, my_schema creates tables under my_database.my_schema.

Storage Credentials

Credentials for writing Parquet data files to cloud storage.

FieldDescription
Access Key IDAWS access key
Secret Access KeyAWS secret key
RegionAWS region (e.g., us-east-1)
EndpointCustom endpoint for S3-compatible storage
Path Style AccessEnable for MinIO and similar
FieldDescription
Credentials JSONService account key (base64-encoded)
Project IDGCP project identifier

Write Options

Control how data is written to Iceberg tables. See Write Modes for details on Merge on Read vs Append.

FieldDefaultDescription
Spec VersionV3Iceberg table format version
Write ModeMerge on ReadHow updates and deletes are handled
Delete ModeSoftFor Merge on Read: Soft preserves audit trail, Hard removes rows
Truncate Table if existsOffRemove existing data before snapshot sync
Metadata CompressionGzipCompression for Iceberg metadata files
Flush Interval10000 msCommit frequency

Parquet Settings

Configure the Parquet file format. Defaults work well for most workloads.

FieldDefaultDescription
CompressionZstdZstd, Snappy, Gzip, Lz4Raw, Brotli, or Uncompressed
Compression Level3Zstd (1-22), Gzip (0-9), or Brotli (0-11)
Target File Size512 MBFiles roll when exceeding this size
Parquet VersionV1V1 for compatibility, V2 for better encoding

Partitioning

Define partition specs per table to physically lay out data on disk by transform values (day, hash bucket, etc.). Query engines use these to prune files and accelerate scans.

Transforms

TransformCompatible source types
identityany primitive
year, month, daydate, timestamp, timestamptz
hourtimestamp, timestamptz
bucket(N)int, long, decimal, date, time, timestamp(tz), string, uuid, fixed, binary
truncate(W)int, long, decimal, string, binary

bucket(N) distributes rows across N hash buckets, useful for high-cardinality columns where you want even distribution. truncate(W) rounds integers, prefixes strings, or shortens binary to width W.

Iceberg rejects redundant transforms on the same column (for example both day(ts) and month(ts)).

Supermetal's metadata columns _sm_version (long) and _sm_deleted (boolean) can be used as partition sources.


Variant Type (V3)

Semi-structured source types such as Postgres JSONB, MySQL JSON, and MongoDB documents are automatically mapped to the Iceberg variant type on V3 tables. Variant encodes nested JSON natively in Parquet's binary variant format, giving query engines columnar access to individual fields without JSON parsing.


Truncate Table if exists

This option is off by default. Enable it to atomically remove all existing data before the initial snapshot sync, preventing duplicate rows when recreating a connector.

The previous data remains accessible via Iceberg time travel, so you can roll back if the sync fails.


Snapshot Metadata

Each commit writes properties to the Iceberg snapshot summary for debugging and audit:

  • sm.connector_id, sm.run_id - identify which sync produced the snapshot
  • sm.source.commit_ts - source commit timestamp (CDC only)
  • sm.truncated_from_snapshot - previous snapshot ID (truncate only)

Query via SELECT * FROM table$snapshots.


Table Name Mapping

Supermetal preserves source table names on the target when possible, sanitizing only when the catalog rejects them as written.

REST compatible catalogs accept most characters and pass names through; whitespace is replaced with _ since storage paths can't contain it. Glue and S3 Tables require lowercase identifiers of letters, digits, and underscores, so Supermetal lowercases the name, replaces -, ., and whitespace with _, and drops other special characters.

When the first character isn't valid for the catalog (a leading digit on REST, a leading underscore on Glue or S3 Tables), Supermetal prepends t_ rather than stripping the character to avoid conflicts.

A source name with no characters valid for the catalog maps to t_ followed by the hex bytes of the original.

Reserved metadata names

Iceberg reserves identifiers like entries, files, history, manifests, partitions, snapshots, refs, and their all_ and _delete_files variants for metadata tables. A real table sharing one of these names collides with Iceberg's metadata table redirect and breaks reads on most catalogs. Supermetal suffixes such names with _tbl, so files becomes files_tbl. The full list is in MetadataTableType.

Examples

SourceREST compatibleGlue / S3 Tables
usersusersusers
OrderItemsOrderItemsorderitems
order-itemsorder-itemsorder_items
My OrdersMy_Ordersmy_orders
_temp_tempt__temp
2024_logst_2024_logs2024_logs
files (reserved metadata)files_tblfiles_tbl
売上t_売上t_e5a3b2e4b88a

Limitations

  • Schema evolution: Data type promotion is not yet supported.
  • Partition spec evolution: Changing partitioning on an existing table requires recreating it.

Data Types

Source types are converted to Iceberg-compatible types. Types without native Iceberg support are stored as strings.

Arrow TypeIceberg TypeNotes
Booleanboolean
Int8, Int16, Int32intWidened to 32-bit
UInt8, UInt16intWidened to 32-bit
Int64long
UInt32longWidened to 64-bit
UInt64decimal(20,0)Exceeds long range
Float16, Float32float
Float64double
Decimal128(p,s)decimal(p,s)
Decimal256(p,s)stringExceeds decimal128 range
Date32, Date64date
Time32, Time64timeConverted to microseconds
Timestamp(s/ms/us, tz)timestamptzConverted to microseconds, UTC
Timestamp(s/ms/us, None)timestampConverted to microseconds
Timestamp(ns, *)longNanoseconds not supported
Utf8, LargeUtf8, Utf8Viewstring
Binary, LargeBinary, BinaryViewbinary
FixedSizeBinary(n)fixed(n)
List<T>, LargeList<T>list<T>
Map<K,V>map<K,V>
Structstruct
Duration, Interval, Union, Nullstring
Arrow TypeIceberg TypeNotes
Booleanboolean
Int8, Int16, Int32intWidened to 32-bit
UInt8, UInt16intWidened to 32-bit
Int64long
UInt32longWidened to 64-bit
UInt64decimal(20,0)Exceeds long range
Float16, Float32float
Float64double
Decimal128(p,s)decimal(p,s)
Decimal256(p,s)stringExceeds decimal128 range
Date32, Date64date
Time32, Time64timeConverted to microseconds
Timestamp(s/ms/us, tz)timestamptzConverted to microseconds, UTC
Timestamp(s/ms/us, None)timestampConverted to microseconds
Timestamp(ns, *)longQuery engines lack nanosecond support
Utf8, LargeUtf8, Utf8Viewstring
Binary, LargeBinary, BinaryViewbinary
FixedSizeBinary(n)fixed(n)
List<T>, LargeList<T>list<T>
Map<K,V>map<K,V>
Structstruct
Utf8 with arrow.json extensionvariant
Duration, Interval, Union, Nullstring

Apache Iceberg is a trademark of the Apache Software Foundation. No endorsement by the Apache Software Foundation is implied by the use of this mark.

Changelog

Last updated on

On this page