Heterogeneous Robot Exploration with CBS Task Allocation

Overview

This course project extends a ground-robot exploration framework to a heterogeneous UAV–UGV team.

We introduce a region-level Conflict-Based Search (CBS) allocator that dynamically reassigns subregions based on real-time information gain and robot status. Each robot plans globally with PRM–TSP to visit assigned regions efficiently and locally with A* for collision-free motion.

CMU 16891 Multi-robot Planning and Coordination - Course Project, Feb.–Apr. 2025


System at a Glance

  • Shared global map + IG grid computed from each robot’s occupancy map.
  • CBS task planner partitions the space into subregions and assigns them to robots with soft/hard constraints to avoid duplicate coverage.
  • Per-robot planners: PRM–TSP for visiting sequence across assigned regions; A* for local paths.
  • Dual-mode execution: robots explore independently; CBS triggers replans when regions finish or a robot stalls, enabling adaptive collaboration.

Pipeline
Exploration → IG grid → CBS Region Assignment → Per-robot PRM–TSP → Local A* → Map/Status update → (Re)plan as needed.

system overview

Figure 1 – System Pipeline


Methods

CBS Region Allocator

The CBS allocator divides the environment into fixed-size subregions and assigns them to robots based on global/local information gain, supporting soft/hard constraints for stalled or blocked areas.

It receives the IG grid from the exploration module, determines subregion assignments, and sends results back to each robot.

CBS Planner
Figure 2 – Interaction between Exploration and CBS Planner
CBS Region Assignment
Figure 3 – Subregion Region Assignment Example

Planning Modules

  • Global Planning (PRM–TSP) builds a roadmap and finds an optimal visiting sequence among assigned subregions.
  • Local Planning (A*) generates collision-free motion within each subregion using the ESDF map; viewpoints are ranked by gain–cost ratio.
  • Mapping merges each robot’s local occupancy grid into a global representation for shared decision-making.

Simulation Setup

We created a 40×40 m office-like Isaac Sim environment with walls, furniture, and narrow corridors to test UAV-only, UGV-only, and cooperative UAV–UGV exploration.
Regions that are only accessible to UAVs are highlighted for testing heterogeneous task division.

labeled scene
Figure 4 – Environment Labeling: UAV-accessible (pink) and General Regions (green)
simulation environment
Figure 5 – Isaac Sim Environment Setup for UAV–UGV Experiments

Results

collaborative exploration

Figure 6 – Collaborative UAV–UGV Exploration with Dynamic CBS Region Assignment

Mapping Completeness & Efficiency

ScenarioPath Length (m)Exploration Time (s)Mapping Completeness
UGV-only373.88283.99Incomplete
UAV-only1003.88895.99Complete
UAV–UGV (CBS)670.29533.61Complete

UGV-only exploration UAV-only exploration

Figure 7 – Comparison of UGV-only (left) and UAV-only (right) Exploration Runs

Takeaways
The collaborative CBS exploration achieves full coverage with ~33% shorter total path and ~40% lower total time compared to UAV-only runs, while avoiding redundant overlap via region exclusivity.
The UGV covers lower corridors and tight areas, while the UAV handles high or aerial-only subregions, achieving efficient complementary exploration.


My Contribution

  • Extended the ground-robot exploration stack to a UAV–UGV framework in Isaac Sim.
  • Implemented the CBS allocator with soft/hard constraints and IG-driven dynamic reassignment.
  • Integrated the PRM–TSP global planner and A* local planner for each robot.
  • Created simulated environments and visualized results for quantitative comparison.

Future work: deploy on real robots, extend to more agents, and integrate learning-based viewpoint sampling for adaptive frontier discovery.