Skip to content

On-cluster requests to Iceberg table don't prune files. #772

@ianton-ru

Description

@ianton-ru

Describe the bug
Request from single host loads only required data

Image

┌─s3_events───────────────────────────────────────┐
 1. │ ('IcebergMetadataFilesCacheHits',46)            │
 2. │ ('IcebergMetadataFilesCacheMisses',30)          │
 3. │ ('IcebergMetadataFilesCacheWeightLost',1181784) │
 4. │ ('S3Clients',1)                                 │
 5. │ ('S3GetObject',30)                              │
 6. │ ('S3HeadObject',30)                             │
 7. │ ('S3ListObjects',6)                             │
 8. │ ('S3ReadMicroseconds',1916304)                  │
 9. │ ('S3ReadRequestsCount',69)                      │
10. │ ('S3WriteMicroseconds',358)                     │
11. │ ('S3WriteRequestsCount',1)                      │
    └─────────────────────────────────────────────────┘

when request with swarm cluster loads much more

Image

┌─s3_events─────────────────────────────────────┐
 1. │ ('IcebergMetadataFilesCacheHits',92)          │
 2. │ ('IcebergMetadataFilesCacheMisses',60)        │
 3. │ ('ParquetFetchWaitTimeMicroseconds',36863709) │
 4. │ ('ParquetMetaDataCacheHits',8786)             │
 5. │ ('ParquetMetaDataCacheMisses',1034)           │
 6. │ ('S3Clients',2)                               │
 7. │ ('S3GetObject',8507)                          │
 8. │ ('S3HeadObject',9880)                         │
 9. │ ('S3ListObjects',12)                          │
10. │ ('S3ReadMicroseconds',436913251)              │
11. │ ('S3ReadRequestsCount',18405)                 │
12. │ ('S3WriteMicroseconds',1143)                  │
13. │ ('S3WriteRequestsCount',2)                    │
    └───────────────────────────────────────────────┘

Expected behavior
Cluster request loads the same size of data as local request.

Key information
Provide relevant runtime details.

  • Project Antalya Build Version 25.2.2.27772.altinityantalya

Additional context
Add any other context about the problem here.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions