feat: decouple partition location from executor metadata by sandugood · Pull Request #1853 · apache/datafusion-ballista

sandugood · 2026-06-09T21:42:22Z

Which issue does this PR close?

Closes #1851.

Rationale for this change

In the current implementation there is a problem - in the PartitionLocation that is used in each shuffle operation (for each partition in the previous stage) there was executor_metadata field (which is of ExecutorMetadata type) filled with unnecessary info, because it did not add up any information that could be used by ShuffleReaderExec to extract partition (or i.e resume the execution from a failed stage)

What changes are included in this PR?

ExecutorMetadata was decoupled from PartitionLocation and now it is used as a separate struct for:

registering executor
in heartbeats
REST API info fetch

PartitionLocation is now exposed only to the executor_id, host and port, which is sufficient for fetching needed partitions by the ShuffleReaderExec
This way we can save up a lot of space and data-transfer during each shuffle operation (potentially removing possibility of Scheduler's OOM errors and improving speed of queries with lots of partitions)

Are there any user-facing changes?

Yes.
In the REST API interface ExecutorResponse doesnt contain nested struct with Executor's hardware and OS info. It was flattened.

Potential follow-up:

check the TUI REST API integration

milenkovicm

thanks for contribution @sandugood

i think this is good first step, but we need to go a bit further and deduplicate executor connectivity information, is there a need to have same peace of information in thousents of places.

when we serialise PartitionLocation can we serialise it in two strucutres, vector of partition locations and a hash map of executor id -> executor metadata. partition location could reference executor with executor id (which we can store as bytes)

For struct PartitionLocation, can we make share executor_meta behind Arc (pub executor_meta: Arc<ExecutorMetadata>)

#[derive(Debug, Clone)]
pub struct PartitionLocation {
    /// The source partition ID from the map stage.
    pub map_partition_id: usize,
    /// The partition identifier.
    pub partition_id: PartitionId,
    /// Metadata about the executor hosting this partition.
    pub executor_meta: Arc<ExecutorMetadata>,
    /// Statistics about the partition data.
    pub partition_stats: PartitionStats,
    /// shuffle file id
    pub file_id: Option<u64>,
    /// whether this partition uses sort shuffle
    pub is_sort_shuffle: bool,
}

so basically when we do ser/de we can deduplicate executor meta.

Also, I'm not sure if flattening ExecutorSpecification and the other structure makes more sense than making it optional

This pr will need some effort, but it will help a lot cases where there is many partitions
thanks a lot

sandugood · 2026-06-10T12:01:36Z

Thank you for your review and ideas @milenkovicm
Going to tackle it

…ixed comments

…decouple

sandugood · 2026-06-16T10:27:26Z

Refactored the code:

In the .proto spec now we are sending the PartitionLocation along with a map of ExecutorMetadata (used by ShuffleReaderPartition). So we can now transfer a Vec<> of locations and access the executor's metadata via its id.
On the native side - added ExecutorMetadata in the PartitionLocation behind an Arc<> to avoid heavy copying.
Made parts of the .proto spec in the ExecutorMetadata optional

sandugood changed the title ~~feat: decouple partition's location from executor's metadata~~ feat: decouple partition location from executor metadata Jun 9, 2026

milenkovicm reviewed Jun 10, 2026

View reviewed changes

Comment thread ballista/core/proto/ballista.proto Outdated

Comment thread ballista/core/proto/ballista.proto

sandugood marked this pull request as draft June 10, 2026 12:01

sandugood changed the title ~~feat: decouple partition location from executor metadata~~ [WIP] feat: decouple partition location from executor metadata Jun 10, 2026

sandugood added 2 commits June 16, 2026 12:46

Changed logic to deduplicate data into separate .proto declaration. F…

fc3cd1f

…ixed comments

Merge remote-tracking branch 'upstream/main' into feat/executor-info-…

7fc0cea

…decouple

sandugood force-pushed the feat/executor-info-decouple branch from e5ac776 to 7fc0cea Compare June 16, 2026 09:54

sandugood added 2 commits June 16, 2026 12:59

Gracefully handle optional fields in ExecutorMetadata

43a9048

Fixed lint

63d76d4

sandugood marked this pull request as ready for review June 16, 2026 10:27

sandugood changed the title ~~[WIP] feat: decouple partition location from executor metadata~~ feat: decouple partition location from executor metadata Jun 16, 2026

Fixed naming in tests

00afec3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: decouple partition location from executor metadata#1853

feat: decouple partition location from executor metadata#1853
sandugood wants to merge 5 commits into
apache:mainfrom
sandugood:feat/executor-info-decouple

sandugood commented Jun 9, 2026 •

edited

Loading

Uh oh!

milenkovicm left a comment

Uh oh!

Uh oh!

Uh oh!

sandugood commented Jun 10, 2026

Uh oh!

sandugood commented Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sandugood commented Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are there any user-facing changes?

Potential follow-up:

Uh oh!

milenkovicm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sandugood commented Jun 10, 2026

Uh oh!

sandugood commented Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sandugood commented Jun 9, 2026 •

edited

Loading