Task Constellation — Overview

Example of a Task Constellation illustrating both sequential and parallel dependencies

🌌 Introduction

The Task Constellation is the central abstraction in Galaxy that captures the concurrent and asynchronous structure of distributed task execution. It provides a formal, directed acyclic graph (DAG) representation of complex workflows, enabling consistent scheduling, fault-tolerant orchestration, and runtime dynamism across heterogeneous devices.

At its core, a Task Constellation decomposes complex user requests into interdependent subtasks connected through explicit dependency edges. This formalism not only enables correct distributed execution but also supports runtime adaptation—allowing new tasks or dependencies to be introduced as the workflow evolves.

For information on how Task Constellations are orchestrated and scheduled, see the Constellation Orchestrator documentation. To understand how agents interact with constellations, refer to the Constellation Agent guide.

🎯 Core Components

The Task Constellation framework consists of four primary components:

Component	Purpose	Key Features
TaskStar	Atomic execution unit	Self-contained task with description, device assignment, execution state, dependencies
TaskStarLine	Dependency relationship	Directed edge with conditional logic, success-only, completion-only, or unconditional execution
TaskConstellation	DAG orchestrator	Complete workflow graph with validation, scheduling, and dynamic modification
ConstellationEditor	Interactive editor	Command pattern-based interface with undo/redo for safe constellation manipulation

📐 Formal Model

Mathematical Foundation

A Task Constellation \(\mathcal{C}\) is formally defined as a directed acyclic graph (DAG):

\[ \mathcal{C} = (\mathcal{T}, \mathcal{E}) \]

where: - \(\mathcal{T}\) is the set of all TaskStars (task nodes) - \(\mathcal{E}\) is the set of TaskStarLines (dependency edges)

TaskStar Representation

Each TaskStar \(t_i \in \mathcal{T}\) encapsulates a complete task specification:

\[ t_i = (\text{name}_ i, \text{description}_ i, \text{target\_device\_id}_ i, \text{tips}_ i, \text{status}_ i, \text{dependencies}_ i) \]

Components: - name: Short name for the task - description: Natural-language specification sent to the device agent - target_device_id: ID of the device agent responsible for execution - tips: List of guidance hints to help the device agent complete the task - status: Current execution state (pending, running, completed, failed, cancelled, waiting_dependency) - dependencies: Set of prerequisite task IDs that must complete first

TaskStarLine Representation

Each TaskStarLine \(e_{i \rightarrow j} \in \mathcal{E}\) represents a dependency from task \(t_i\) to task \(t_j\).

Dependency Types:

Type	Behavior
Unconditional	\(t_j\) always waits for \(t_i\) to complete
Success-only	\(t_j\) proceeds only if \(t_i\) succeeds
Completion-only	\(t_j\) proceeds when \(t_i\) completes (regardless of success/failure)
Conditional	\(t_j\) proceeds based on a user-defined or runtime condition

✨ Key Advantages

1. Explicit Task Ordering

Task dependencies are explicitly captured in the DAG structure, ensuring correctness across distributed execution without ambiguity.

2. Natural Parallelism

The DAG topology naturally exposes parallelizable tasks, enabling efficient concurrent execution across heterogeneous devices.

3. Runtime Dynamism

Unlike static DAG schedulers, Task Constellations are mutable objects. Tasks and dependency edges can be: - Added: Introduce new subtasks or diagnostic tasks - Removed: Prune completed or redundant nodes - Modified: Rewire dependencies, update conditions, change device assignments

This enables adaptive execution without restarting the entire workflow.

4. Formal Guarantees

The DAG representation provides formal properties: - Acyclicity: No circular dependencies - Causal consistency: Execution respects logical ordering - Safe concurrency: Parallel execution without race conditions

🔄 Lifecycle States

The Task Constellation progresses through several states during its lifecycle:

stateDiagram-v2 [*] --> CREATED: Initialize CREATED --> READY: Add tasks & dependencies READY --> EXECUTING: Start execution EXECUTING --> EXECUTING: Tasks running EXECUTING --> COMPLETED: All tasks succeed EXECUTING --> FAILED: All tasks fail EXECUTING --> PARTIALLY_FAILED: Some succeed, some fail COMPLETED --> [*] FAILED --> [*] PARTIALLY_FAILED --> [*]

State	Description
CREATED	Constellation initialized, no tasks added
READY	Tasks and dependencies configured, ready to execute
EXECUTING	At least one task is running or completed
COMPLETED	All tasks completed successfully
FAILED	All tasks failed
PARTIALLY_FAILED	Some tasks succeeded, some failed

📊 DAG Metrics

Parallelism Analysis

The Task Constellation provides several metrics to analyze workflow parallelism:

Critical Path Length (\(L\))

The longest serial dependency chain in the constellation:

\[ L = \max_{p \in \text{paths}} |p| \]

where \(|p|\) is the length of path \(p\) from any root to any leaf node.

Total Work (\(W\))

Sum of all task execution durations:

\[ W = \sum_{t_i \in \mathcal{T}} \text{duration}(t_i) \]

Parallelism Ratio (\(P\))

Measure of achievable parallelism:

\[ P = \frac{W}{L} \]

\(P = 1\): Completely serial execution
\(P > 1\): Parallel execution possible
Higher \(P\) indicates more parallelism

Maximum Width

Maximum number of tasks that can execute concurrently:

\[ \text{MaxWidth} = \max_{\text{level}} |\text{tasks at level}| \]

Calculation Modes

The constellation supports two calculation modes:

Node Count Mode: Uses task counts when execution is incomplete
Actual Time Mode: Uses real execution durations when all tasks are terminal

🛠️ Core Operations

DAG Construction

from galaxy.constellation import TaskConstellation, TaskStar, TaskStarLine

# Create constellation
constellation = TaskConstellation(name="my_workflow")

# Add tasks
task_a = TaskStar(name="task_a", description="Checkout code on laptop")
task_b = TaskStar(name="task_b", description="Build on GPU server")
task_c = TaskStar(name="task_c", description="Deploy to staging")

constellation.add_task(task_a)
constellation.add_task(task_b)
constellation.add_task(task_c)

# Add dependencies
dep_ab = TaskStarLine.create_success_only(
    from_task_id=task_a.task_id,
    to_task_id=task_b.task_id,
    description="Build depends on successful checkout"
)

dep_bc = TaskStarLine.create_unconditional(
    from_task_id=task_b.task_id,
    to_task_id=task_c.task_id,
    description="Deploy after build"
)

constellation.add_dependency(dep_ab)
constellation.add_dependency(dep_bc)

DAG Validation

# Validate structure
is_valid, errors = constellation.validate_dag()
if not is_valid:
    print(f"Validation errors: {errors}")

# Check for cycles
has_cycles = constellation.has_cycle()

# Get topological order
order = constellation.get_topological_order()
print(f"Execution order: {order}")

Parallelism Analysis

# Get parallelism metrics
metrics = constellation.get_parallelism_metrics()

print(f"Critical Path Length: {metrics['critical_path_length']}")
print(f"Total Work: {metrics['total_work']}")
print(f"Parallelism Ratio: {metrics['parallelism_ratio']}")
print(f"Critical Path: {metrics['critical_path_tasks']}")

# Get maximum width
max_width = constellation.get_max_width()
print(f"Maximum concurrent tasks: {max_width}")

🔧 Dynamic Modification

Safe Editing with ConstellationEditor

from galaxy.constellation.editor import ConstellationEditor

# Create editor with undo/redo support
editor = ConstellationEditor(constellation)

# Add a new diagnostic task
diagnostic_task = editor.create_and_add_task(
    task_id="diag_1",
    description="Check server health",
    name="Server Health Check"
)

# Add conditional dependency
editor.create_and_add_dependency(
    from_task_id=task_b.task_id,
    to_task_id=diagnostic_task.task_id,
    dependency_type="CONDITIONAL",
    condition_description="Run diagnostic if build fails"
)

# Undo if needed
if something_wrong:
    editor.undo()

# Get modifiable components
modifiable_tasks = constellation.get_modifiable_tasks()
modifiable_deps = constellation.get_modifiable_dependencies()

Modification Safety

Tasks and dependencies can only be modified if they are in PENDING or WAITING_DEPENDENCY status. Running or completed tasks cannot be modified to ensure execution consistency.

📈 Example Workflows

Sequential Workflow

graph LR A[Task A] --> B[Task B] B --> C[Task C]

Parallelism Ratio: 1.0 (completely serial)
Maximum Width: 1

Parallel Workflow

graph LR A[Task A] --> B[Task B] A --> C[Task C] B --> D[Task D] C --> D

Parallelism Ratio: 2.0 (B and C can run in parallel)
Maximum Width: 2

Complex Workflow

graph LR A[Task A] --> B[Task B] A --> C[Task C] B --> D[Task D] C --> E[Task E] D --> F[Task F] E --> F

Parallelism Ratio: ~1.67
Maximum Width: 3 (B, C, E can run concurrently after A completes)

🎨 Visualization

The Task Constellation provides multiple visualization modes for monitoring and debugging:

Overview Mode

High-level constellation structure with task counts and state

Topology Mode

DAG graph showing task relationships and dependencies

Details Mode

Detailed task information including execution times and status

Execution Mode

Real-time execution flow with progress tracking

# Display constellation
constellation.display_dag(mode="overview")  # or "topology", "details", "execution"

For interactive web-based visualization, check out the Galaxy WebUI.

📚 Component Documentation

Explore detailed documentation for each component:

TaskStar — Atomic execution units representing individual tasks in the constellation
TaskStarLine — Dependency relationships connecting tasks with conditional logic
TaskConstellation — Complete DAG orchestrator managing workflow execution and coordination
ConstellationEditor — Interactive editor with command pattern and undo/redo capabilities

Constellation Orchestrator — Learn how constellations are scheduled and executed across devices
Constellation Agent — Understand how agents plan and manage constellation lifecycles
Evaluation & Metrics — Monitor constellation performance and analyze execution patterns

🔬 Research Background

The Task Constellation model is grounded in formal DAG theory and distributed systems research. Key properties include:

Acyclicity guarantees through Kahn's algorithm for topological sorting
Topological ordering for consistent execution
Critical path analysis for performance optimization
Dynamic graph evolution without compromising consistency

For more on Galaxy's architecture and design principles, see the Galaxy Overview.

💡 Best Practices

Designing Effective Constellations

Keep tasks atomic: Each TaskStar should represent a single, well-defined operation
Minimize dependencies: Reduce unnecessary dependencies to maximize parallelism
Use appropriate dependency types: Choose conditional dependencies for error handling
Validate early: Run validate_dag() before execution
Monitor metrics: Track parallelism ratio to optimize workflow design

Common Patterns:

Fan-out: One task spawns multiple independent parallel tasks
Fan-in: Multiple parallel tasks converge to a single task
Pipeline: Sequential stages with parallel tasks within each stage
Conditional branching: Use conditional dependencies for error handling paths

🚀 Next Steps

Learn about TaskStar — Atomic task execution units
Explore TaskStarLine — Dependency relationships
Master TaskConstellation — DAG orchestration
Try ConstellationEditor — Interactive editing

Task Constellation — Overview

🌌 Introduction

🎯 Core Components

📐 Formal Model

Mathematical Foundation

TaskStar Representation

TaskStarLine Representation

✨ Key Advantages

1. Explicit Task Ordering

2. Natural Parallelism

3. Runtime Dynamism

4. Formal Guarantees

🔄 Lifecycle States

📊 DAG Metrics

Parallelism Analysis

Critical Path Length (\(L\))

Total Work (\(W\))

Parallelism Ratio (\(P\))

Maximum Width

🛠️ Core Operations

DAG Construction

DAG Validation

Parallelism Analysis

🔧 Dynamic Modification

Safe Editing with ConstellationEditor

📈 Example Workflows

Sequential Workflow

Parallel Workflow

Complex Workflow

🎨 Visualization

Overview Mode

Topology Mode

Details Mode

Execution Mode

📚 Component Documentation

Related Documentation

🔬 Research Background

💡 Best Practices

🚀 Next Steps