NoneWriteDistributionSortedDataframeSpec: + See https://iceberg.apache.org/docs/1.6.0/spark-writes/#writing-distribution-modes Using write.distribution-mode - should create the appropriate number of Iceberg files + Given a table that is created with: CREATE TABLE polaris.my_namespace.NoneWriteDistributionSortedDataframeSpec ( id int, label String, partitionKey long, date Date, timestamp Timestamp ) USING iceberg TBLPROPERTIES ( 'format-version' = '2', 'write.distribution-mode' = 'none' ) PARTITIONED BY (partitionKey); + And the data is sorted on the partitionKey column + And it has 20 rows over 5 data files when writing with 4 executor threads + When we add another 20 rows of the same data that is logically distributed over 5 partitions + And the data is sorted on the partitionKey column + Then there are now 5 more data files + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +