Jimmy Chiun | AI Research Scientist (Reasoning & RL)

Now / Seeking

Now

Ph.D. Candidate in Robotics at NUS and Associate Scientist at Temasek Laboratories. Building scalable, intelligent robot fleets and exploring next-gen architectures for embodied general intelligence.

Seeking

Research Scientist roles at the intersection of Embodied AI and AGI. I am interested in teams pioneering new learning paradigms, efficient architectures, and foundation models key to solving reasoning, generalization, and long-horizon control.

What's New

Jan 2026

Conference

AAAI 2026 at Singapore!

Attending AAAI 2026; collaborating with Joonyeol Sim (UCI) on Multi-Agent Path Finding for expanding maps!

Jan 2026

Event

Amazon AI Research Night

Engaged with leading scientists at the Amazon AI Research Night in Singapore.

Jan 2026

Event

Huawei Talent Night

Explored future tech horizons and networking at the Huawei Talent Night, Fullerton Bay Hotel.

Dec 2025

Conference

IEEE Multi-Robot Symposium (MRS) 2025 at Singapore

Attended IEEE MRS 2025; collaborating with Cho Janghyun (Sogang University) on distributed multi-robot exploration.

Aug 2025

Spotlight

Search-TTA Accepted to CoRL 2025

"Search-TTA" framework for visual search accepted as a Spotlight paper. Seoul, Korea.

June 2025

Publication

IEEE RA-L / IROS 2025

"Heterogeneous Multi-Robot Task Allocation" to be presented at IROS 2025 in Hangzhou, China.

May 2025

Presentation

ICRA 2025 Presentation

Presenting "MARVEL" in Atlanta, USA. Session Room 312 on May 22.

Apr 2025

Showcase

Invited Showcase @ SAFMC 2025

Live demo of outdoor evasive search using multiple drones in an urban environment mockup.

Mar 2025

Publication

IEEE RA-L Accepted

"Heterogeneous Multi-Robot Task Allocation" accepted to IEEE RA-L.

May 2024

Presentation

ICRA 2024 Live Demo

Presented live demo of swarm robotics algorithms at the ICRA 2024 Expo in Yokohama, Japan.

Apr 2023

Award

Judge Commendation @ SAFMC 2023

Awarded Judge Commendation for Category E: Swarm at Singapore Autonomous Flower Model Comparison (SAFMC) 2023.

July 2022

Scholarship

Awarded NUS-DSO Scholarship

Recipient of the prestigious NUS-DSO Graduate Program scholarship for Ph.D. research.

Aug 2019

Scholarship

NUS E-Scholar Scholarship

Awarded the NUS Engineering Scholarship (E-Scholar) for undergraduate studies.

Selected Research

ICRA 2025 • Atlanta, USA 🇺🇸

MARVEL: Multi-Agent Reinforcement Learning for Constrained Field-of-View Multi-Robot Exploration in Large-Scale Environments

Jimmy Chiun, Shizhe Zhang, Yizhuo Wang, Yuhong Cao, Guillaume Sartoretti

Introduces a decentralized MARL framework for scalable exploration for agents with constrained field-of-view.

Paper Code

In multi-robot exploration, a team of mobile robot is tasked with efficiently mapping an unknown environments. While most exploration planners assume omnidirectional sensors like LiDAR, this is impractical for small robots such as drones, where lightweight, directional sensors like cameras may be the only option due to payload constraints. These sensors have a constrained field-of-view (FoV), which adds complexity to the exploration problem, requiring not only optimal robot positioning but also sensor orientation during movement. In this work, we propose MARVEL, a neural framework that leverages graph attention networks, together with novel frontiers and orientation features fusion technique, to develop a collaborative, decentralized policy using multi-agent reinforcement learning (MARL) for robots with constrained FoV. To handle the large action space of viewpoints planning, we further introduce a novel information-driven action pruning strategy. MARVEL improves multi-robot coordination and decision-making in challenging large-scale indoor environments, while adapting to various team sizes and sensor configurations (i.e., FoV and sensor range) without additional training. Our extensive evaluation shows that MARVEL's learned policies exhibit effective coordinated behaviors, outperforming state-of-the-art exploration planners across multiple metrics. We experimentally demonstrate MARVEL's generalizability in large-scale environments, of up to 90 m by 90 m, and validate its practical applicability through successful deployment on a team of real drone hardware.

@inproceedings{chiun2025marvel,
  title={MARVEL: Multi-Agent Reinforcement Learning for constrained field-of-View multi-robot Exploration in Large-scale environments},
  author={Chiun, Jimmy and Zhang, Shizhe and Wang, Yizhuo and Cao, Yuhong and Sartoretti, Guillaume},
  booktitle={IEEE International Conference on Robotics and Automation (ICRA)},
  year={2025},
  pages={11392-11398},
  doi={10.1109/ICRA55743.2025.11127700}                              
}

ICCAS 2024 • Jeju, South Korea 🇰🇷

STAR: Swarm Technology for Aerial Robotics Research

Jimmy Chiun, Yan Rui Tan, Yuhong Cao, John Tan, Guillaume Sartoretti

A comprehensive stack that simplifies the development and deployment of swarm algorithms on real hardware.

Paper Code

@inproceedings{chiun2024star,
  title={STAR: Swarm Technology for Aerial Robotics Research},
  author={Chiun, Jimmy and Tan, Yan Rui and Cao, Yuhong and Tan, John and Sartoretti, Guillaume},
  booktitle={International Conference on Control, Automation and Systems (ICCAS)},
  year={2024}
  pages={141-146},
  keywords={Location awareness;System performance;Stars;Swarm robotics;Object detection;Motion capture;Peer-to-peer computing;Maintenance;Collision avoidance;Robots;Aerial Robotics;Micro UAV;Landmark Localization;Robotic Application},
  doi={10.23919/ICCAS63016.2024.10773308}
}

CoRL 2025 • Seoul, South Korea 🇰🇷 (Spotlight)

Search-TTA: A Multimodal Test-Time Adaptation Framework for Visual Search in the Wild

Derek Ming Siang Tan, Shailesh, Boyang Liu, Alok Raj, Qi Xuan Ang, Weiheng Dai, Tanishq Duhan, Jimmy Chiun, Yuhong Cao, Florian Shkurti, Guillaume Sartoretti

A framework that aligns remote sensing imagery with ground-level visual priors to improve robotic search efficiency using test-time adaptation.

Paper Code Project Page

To perform outdoor visual navigation and search, a robot may leverage satellite imagery to generate visual priors. This can help inform high-level search strategies, even when such images lack sufficient resolution for target recognition. However, many existing informative path planning or search-based approaches either assume no prior information, or use priors without accounting for how they were obtained. Recent work instead utilizes large Vision Language Models (VLMs) for generalizable priors, but their outputs can be inaccurate due to hallucination, leading to inefficient search. To address these challenges, we introduce Search-TTA, a multimodal test-time adaptation framework with a flexible plug-and-play interface compatible with various input modalities (e.g., image, text, sound) and planning methods (e.g., RL-based). First, we pretrain a satellite image encoder to align with CLIP's visual encoder to output probability distributions of target presence used for visual search. Second, our TTA framework dynamically refines CLIP's predictions during search using uncertainty-weighted gradient updates inspired by Spatial Poisson Point Processes. To train and evaluate Search-TTA, we curate AVS-Bench, a visual search dataset based on internet-scale ecological data containing 380k images and taxonomy data. We find that Search-TTA improves planner performance by up to 30.0%, particularly in cases with poor initial CLIP predictions due to domain mismatch and limited training data. It also performs comparably with significantly larger VLMs, and achieves zero-shot generalization via emergent alignment to unseen modalities. Finally, we deploy Search-TTA on a real UAV via hardware-in-the-loop testing, by simulating its operation within a large-scale simulation that provides onboard sensing.

@inproceedings{tan2025search,
  title={Search-TTA: A Multimodal Test-Time Adaptation Framework for Visual Search in the Wild},
  author={Derek Ming Siang Tan and Shailesh and Boyang Liu and Alok Raj and Qi Xuan Ang and Weiheng Dai and Tanishq Duhan and Jimmy Chiun and Yuhong Cao and Florian Shkurti and Guillaume Sartoretti},
  booktitle={Conference on Robot Learning (CoRL)},
  year={2025}
}

IEEE RA-L 2025 (IROS 2025 • Hangzhou, China 🇨🇳)

Heterogeneous Multi-Robot Task Allocation and Scheduling via Reinforcement Learning

Weiheng Dai, Utkarsh Rai, Jimmy Chiun, Yuhong Cao, Guillaume Sartoretti

A hierarchical RL approach that jointly optimizes task scheduling and allocation for heterogeneous robot teams.

Paper Code

Many multi-robot applications require allocating a team of heterogeneous agents to complete a given set of spatially distributed tasks as quickly as possible, such as search and rescue, area inspection/monitoring, and space exploration. We focus on tasks which can only be initiated when all required agents have arrived, such as detection involving sensor fusion or cooperative assembly requiring robots with different tools/skillsets. Robots dynamically form and disband teams based on diverse abilities needed for each task, which, however, can potentially result in extra idle (waiting) times at the task location. Robots (agents) should take into account the schedules of others to maximize their collective efficiency and reduce overall task completion time (makespan). Conventional methods such as mix-integer programming generally require centralized scheduling and long optimization time, which limits their potential for real-world applications. In this work, we propose a decentralized method that fully relies on reinforcement learning (RL) to train a generalized policy applicable to heterogeneous agents. To address the challenge of complex cooperation learning, we further introduce a constrained flashforward mechanism to guide/constrain the agents' exploration and help them make better predictions. Through an attention mechanism that reasons about both short-term cooperation and long-term scheduling dependency, agents learn to reactively choose their next tasks (and subsequent coalitions) to avoid wasting abilities and to shorten the makespan. We compare our method with state-of-the-art heuristic and mixed-integer programming methods, demonstrating its generalization ability and showing it closely matches or outperforms these baselines while remaining at least two orders of magnitude faster.

@article{dai2025heterogeneous,
  title={Heterogeneous Multi-Robot Task Allocation via Reinforcement Learning},
  author={Dai, Weiheng and Rai, Utkarsh and Chiun, Jimmy and Cao, Yuhong and Sartoretti, Guillaume},
  journal={IEEE Robotics and Automation Letters},
  year={2025},
  pages={2654-2661},
  keywords={Robots;Robot kinematics;Robot sensing systems;System recovery;Resource management;Schedules;Training;Vectors;Reinforcement learning;Search problems;Planning, scheduling and coordination;multi-robot systems;reinforcement learning},
  doi={10.1109/LRA.2025.3534682}
}

CoRL 2024 • Munich, Germany 🇩🇪

ViPER: Visibility-Based Pursuit-Evasion via Reinforcement Learning

Yizhuo Wang, Yuhong Cao, Jimmy Chiun, Sourav Koley, Minh Pham, Guillaume Sartoretti

Presents a learning-based solution for the visibility-based pursuit-evasion game. Unlike classical methods, ViPER utilizes a GNN-based policy.

Paper Code

@inproceedings{wang2024viper,
  title={ViPER: Visibility-Base Pursuit-Evasion via Reinforcement Learning},
  author={Wang, Yizhuo and Cao, Yuhong and Chiun, Jimmy and Koley, Sourav and Pham, Minh and Sartoretti, Guillaume},
  booktitle={Conference on Robot Learning (CoRL)},
  year={2024}
}

CoRL 2023 • Atlanta, USA 🇺🇸

Context-Aware Deep Reinforcement Learning for Autonomous Robotic Navigation in Unknown Area

Jingsong Liang, Zhichen Wang, Yuhong Cao, Jimmy Chiun, Mengqi Zhang, Guillaume Adrien Sartoretti

A DRL approach for autonomous navigation that leverages context to improve performance in unknown environments.

Paper Code

@inproceedings{liang2023context,
  title={Context-aware deep reinforcement learning for autonomous robotic navigation in unknown area},
  author={Liang, Jingsong and Wang, Zhichen and Cao, Yuhong and Chiun, Jimmy and Zhang, Mengqi and Sartoretti, Guillaume Adrien},
  booktitle={Conference on Robot Learning},
  pages={1425--1436},
  year={2023},
  organization={PMLR}
}

2024 CIS-RAM • Hangzhou, China 🇨🇳

3D Map Generation for Indoor Non-Manhattan World Environments

Timothy Bonner, Bryan Wei Xian Lee, Sutthiphong Srigrarom, Wai Lun Leong, Jimmy Chiun

Research on generating 3D maps for complex indoor environments that do not follow the Manhattan world assumption and using a sparse LiDAR sensor.

Paper

@article{chiun20213d,
    author={Bonner, Timothy and Xian Lee, Bryan Wei and Srigrarom, Sutthiphong and Leong, Wai Lun and Chiun, Jimmy},
    booktitle={2024 IEEE International Conference on Cybernetics and Intelligent Systems (CIS) and IEEE International Conference on Robotics, Automation and Mechatronics (RAM)}, 
    title={3D Map Generation for Indoor Non-Manhattan World Environments}, 
    year={2024},
    volume={},
    number={},
    pages={417-422},
    keywords={Three-dimensional displays;Accuracy;Machine learning algorithms;Shape;Noise;Robot sensing systems;Real-time systems},
    doi={10.1109/CIS-RAM61939.2024.10673043}

}

Projects

SCENE REASONING

3D Scene Intelligence for Cross-Modal Foundational Search

This project develops a high-fidelity framework for embodied object navigation by leveraging incremental 3D Scene Graphs and foundational Vision-Language Models (VLMs). By moving beyond flat occupancy maps, the system builds a hierarchical semantic representation of the world that captures objects, rooms, and their relationships. The system utilizes Knowledge-Augmented Generation (KAG) to predict structural and semantic properties of unobserved regions, enabling robots to perform complex, cross-modal search missions based on categories, natural language descriptions, or visual exemplars in completely unknown topographies.

VLM-Driven Semantic Mapping: Engineered a neural pipeline that utilizes foundational Vision-Language Models to autonomously synthesize 3D scene graphs in real-time, grounding high-level semantic concepts into metric-topological coordinates.
Uncertainty-Quantified Neural Perception: Investigated the integration of probabilistic belief-states within the scene graph to explicitly quantify and manage perception errors, sensor noise, and model hallucinations.
Knowledge-Augmented Generative Priors (KAG): Developed a retrieval-augmented framework to extract architectural and semantic "common sense" from foundational LLMs, allowing the agent to predict the layout and occupancy of unmapped areas.
Cross-Modal Goal Grounding: Implemented a unified embedding space that allows agents to ground search targets specified via diverse modalities—semantic categories, free-form text descriptions, or reference images—directly into the 3D scene graph.
Predictive Graph Completion: Researched generative methods to "complete" the topological graph of partially explored buildings by reasoning over observed room functions and global spatial priors.
Zero-Shot Foundation Search: Demonstrated the system’s ability to locate specific or rare objects without environment-specific training, utilizing the reasoning depth of foundational models to prioritize high-probability frontiers.
Hierarchical Latent Reasoning: Validated that reasoning over hierarchical scene graphs significantly reduces the computational complexity of long-horizon search tasks compared to traditional point-cloud or voxel-centric methods.

EMBODIED NAVIGATION

Cognitive Architectures for Long-Horizon Semantic Navigation with Visual-Language Priors

This project explores the integration of foundational Vision-Language Models (VLMs) and Large Language Models (LLMs) to redefine the cognitive architecture of intelligent navigation. We investigated a framework that leverages the zero-shot reasoning capabilities of internet-scale foundational models alongside a structured spatial-semantic memory. This enables embodied agents to perform complex, language-driven semantic search tasks—such as finding specific objects described in everyday natural language—by reasoning over environmental uncertainty and past observations in completely novel environments.

Foundational VLM-Based Semantic Memory: Engineered a dual-indexed memory system that utilizes high-dimensional feature embeddings from foundational models to pair visual observations with real-time semantic goal similarity.
Zero-Shot Object Navigation (ZSON): Leveraged the internet-scale knowledge of foundational VLMs to enable agents to locate novel objects without task-specific training, facilitating generalization across a wide variety of previously unseen items and environments.
Uncertainty-Aware Foundational Reasoning: Investigated the decomposition of cognitive uncertainty by measuring the confidence of foundational models in interpreting complex natural language instructions (e.g., "find a wooden toy airplane").
LLM-Driven Decision Coordination: Designed an adaptive weighting mechanism that uses the semantic-level understanding of foundational models to orchestrate navigation modules, dynamically balancing goal-directed search and environmental exploration.
Semantic Frontier Evaluation: Integrated CLIP-based features to score environmental frontiers, allowing the agent to hypothesize which unknown regions are most likely to contain the semantic target based on broad foundational priors.
Cognitive Memory Rejection & Loop Detection: Developed a repulsion mechanism that uses semantic feature similarity from foundational models to prevent redundant exploration and break cyclic behaviors in complex indoor topographies.
Benchmark State-of-the-Art: Demonstrated that the integration of foundational reasoning into the MU-Nav framework boosts Success Rate (SR) to 53.3% on standard industry benchmarks, significantly outperforming memory-less and non-foundational baselines.

MULTI-AGENT COORDINATION

Scalable Decentralized Coordination for Complex Multi-Robot Systems

This project introduces MARVEL (Multi-Agent Reinforcement Learning for Constrained Field-of-View Multi-Robot Exploration), a framework for high-performance, decentralized coordination in large-scale environments. By leveraging Graph Attention mechanisms, MARVEL enables robot teams to reason about teammate intent and spatial dependencies under restricted sensing constraints. Our approach focuses on information-theoretic action pruning to optimize coverage and mission efficiency, facilitating complex collaborative maneuvers in completely unknown topographies without a central controller.

Constrained-FoV Collaborative Exploration: Developed a neural architecture that enables robots with limited sensing cones to cooperatively map environments up to 100m2, significantly outperforming frontier-based baselines.
Information-Guided Action Pruning: Introduced a mechanism to prune redundant movements by focusing on the most informative viewpoint-heading pairs, facilitating efficient long-horizon planning in complex spaces.
Decentralized Relational Reasoning: Utilized Graph Attention Networks (GATs) within the MARVEL framework to allow agents to reason about the cross-dependencies between their own status and the global requirements of the swarm.
Zero-Shot Scalability: Demonstrated that coordination policies trained on small groups generalize seamlessly to teams of up to 15 agents without further fine-tuning.
Real-World Hardware Validation: Proved the robustness of the MARVEL framework through hardware deployments on aerial robotic swarms.

Adversarial Multi-Agent Pursuit-Evasion in Complex Stochastic Environments

This project develops a decentralized, learning-based framework for visibility-based pursuit-evasion in challenging outdoor environments. We focus on enabling teams of mobile agents to systematically clear contaminated spaces and capture adversarial evaders within high-density urban terrains. By integrating multi-agent reinforcement learning with advanced spatial reasoning, the system addresses the critical challenges of building-induced occlusions and limited sensor ranges, allowing for real-time coordinated maneuvers without the need for a central controller.

Triple-Belief State Representation: Engineered a specialized state-space architecture that maintains three distinct belief maps—flight-ready zones, ground-traversable paths, and designated safe zones—to ensure precise coordination across heterogeneous outdoor domains.
Visibility-Based Space Clearing: Developed decentralized policies that partition the environment into cleared and contaminated regions, enabling pursuers to systematically eliminate areas where an evader could hide.
Occlusion-Aware Coordination: Leveraged attention-based neural networks to reason about line-of-sight constraints caused by urban architecture, ensuring agents proactively position themselves to minimize visibility gaps and prevent recontamination.
Relational Reasoning via Graph Attention: Utilized an encoder-decoder structure with multi-head attention to reason about the complex cross-dependencies between agent status, target position, and environmental obstacles.
Zero-Shot Scalability and Generalization: Demonstrated that policies trained in simulation generalize seamlessly to large-scale 3D urban environments and varying team sizes without additional fine-tuning.
High-Stakes Live Validation: Successfully showcased the framework's robustness through live demonstrations at major robotics events, using physical quadrotor swarms to perform autonomous search-and-capture missions in real-time.

A Modular Infrastructure for Agile Nano-Quadcopter Swarm Autonomy

This project introduces STAR (Swarm Technology for Aerial Robotics), a modular, open-source infrastructure designed to bridge the gap between simulation and high-fidelity physical deployments. STAR integrates decentralized task allocation with robust vision-based landmark localization to manage fleets of nano-quadrotors (e.g., Crazyflies) in cluttered environments. The framework provides a high-throughput ROS 2-based communication layer and a hardware-in-the-loop (HIL) sim-to-real pipeline, enabling researchers to validate complex multi-agent algorithms, reactive obstacle avoidance, and swarm behaviors on physical robotic collectives.

Distributed ROS 2 Swarm Layer: Engineered a high-throughput communication and control architecture based on ROS 2 to manage a fleet of palm-sized nano-quadrotors (Crazyflies).
Onboard-Offboard Landmark Localization: Developed a hybrid localization module that utilizes fiducial markers to achieve precise positioning without the need for cost-prohibitive external motion-capture infrastructure.
Non-Holonomic Velocity Obstacle (VO) Integration: Implemented reactive collision avoidance protocols based on the velocity obstacle paradigm to ensure safe autonomous navigation within cluttered workspaces.
Asynchronous Multi-Target Tracking: Investigated the synchronization of camera-based detection and landmark tracking to repurpose environmental markers as dynamic mission objectives.
High-Fidelity Sim-to-Real Pipeline: Created a custom simulation environment with realistic sensor noise models and physical constraints to facilitate rapid prototyping of multi-agent behaviors.
Modular Task-Action Sequencing: Designed a versatile mission manager that enables the decentralized execution of complex, sequential tasks across the entire robotic collective.
Hardware-in-the-Loop (HIL) Validation: Proved system reliability through extensive experimental trials, demonstrating the framework's ability to handle real-time flight dynamics and network latency.

Leveraging context for robotic navigation in unknown environments

This project develops a decentralized framework for context-aware navigation, enabling embodied agents to perform complex path-finding and search tasks in unknown environments without prior maps. By leveraging Graph Attention Networks to encode environmental context, the framework allows robots to reason about the global structure of a space from local observations. This enables a suite of navigation capabilities—from zero-shot exploration to adaptive prior-based path-finding—that outperform traditional geometric planners in both computational efficiency and success rate.

Context-Aware Policy Networks: Developed a deep reinforcement learning (DRL) architecture that forms a global context from the agent’s partial belief, allowing it to sequence short-term movements informed by long-term objectives.
Graph-Based Belief Representation: Engineered a system where the robot's partial map is represented as a dynamic graph, enabling the use of attention mechanisms to reason about dependencies between distinct spatial areas.
Multimodal Test-Time Adaptation: Investigated mechanisms for robots to refine and adapt visual priors (such as floorplans or high-level environmental structures) dynamically during a mission to improve search efficiency by up to 30%.
Graph Rarefaction for Scalability: Introduced a graph-thinning algorithm to reduce the computational complexity of decision-making in large-scale environments, facilitating real-time planning in areas exceeding several thousand square meters.
Multi-Scale Spatial Reasoning: Leveraged Graph Attention Networks (GATs) to allow agents to process environmental information at multiple scales, identifying bottlenecks and promising corridors from local sensor data.
Zero-Shot Transferability: Validated that navigation policies trained on synthetic 2D datasets generalize successfully to 3D Gazebo simulations and real-world environments without additional training.

Work Experience

2022-Present

Associate Scientist

Temasek Laboratories @ NUS (Swarm Autonomy Lab)

Apr 2024-Jul 2024

Robotics Autonomy Intern

DSO National Laboratories (Robotics Division)

Academic Journey

2022-Present

Ph.D. Candidate, Robotics

National University of Singapore

MARMot Lab

2022

Bachelor of Engineering (Highest Distinction)

National University of Singapore

Honors & Awards

April 2025

Invited Showcase (Outdoor Evasive Search) @ SAFMC 2025

May 2024

Presented Live Demo (Swarm Robotics) @ ICRA 2024 Expo

April 2023

Awarded Judge Commendation @ SAFMC 2023 CAT E: Swarm

March 2023

Selected for Static Display @ SAFMC 2023

July 2022

Awarded NUS-DSO Scholarship under the NUS-DSO Graduate Program

July 2022

Attained Highest Distinction, Bachelor of Engineering, NUS

August 2019

Awarded the prestigous NUS E-Scholar Scholarship

Now / Seeking

What's New

AAAI 2026 at Singapore!

Amazon AI Research Night

Huawei Talent Night

IEEE Multi-Robot Symposium (MRS) 2025 at Singapore

Search-TTA Accepted to CoRL 2025

IEEE RA-L / IROS 2025

ICRA 2025 Presentation

Invited Showcase @ SAFMC 2025

IEEE RA-L Accepted

ICRA 2024 Live Demo

Judge Commendation @ SAFMC 2023

Awarded NUS-DSO Scholarship

NUS E-Scholar Scholarship

Selected Research

MARVEL: Multi-Agent Reinforcement Learning for Constrained Field-of-View Multi-Robot Exploration in Large-Scale Environments

STAR: Swarm Technology for Aerial Robotics Research

Search-TTA: A Multimodal Test-Time Adaptation Framework for Visual Search in the Wild

Heterogeneous Multi-Robot Task Allocation and Scheduling via Reinforcement Learning

ViPER: Visibility-Based Pursuit-Evasion via Reinforcement Learning

Context-Aware Deep Reinforcement Learning for Autonomous Robotic Navigation in Unknown Area

3D Map Generation for Indoor Non-Manhattan World Environments

Projects

3D Scene Intelligence for Cross-Modal Foundational Search

Cognitive Architectures for Long-Horizon Semantic Navigation with Visual-Language Priors

Scalable Decentralized Coordination for Complex Multi-Robot Systems

Adversarial Multi-Agent Pursuit-Evasion in Complex Stochastic Environments

A Modular Infrastructure for Agile Nano-Quadcopter Swarm Autonomy

Leveraging context for robotic navigation in unknown environments

Work Experience

Academic Journey

Honors & Awards

Curriculum Vitae