[ICML2026] ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO
-
Updated
Apr 30, 2026
[ICML2026] ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO
This repository provides the source code for the paper Reinforcement Learning for Active Perception in Autonomous Navigation.
A Python library for Robotic Information Gathering
Official implementation of Matcha-agent, https://arxiv.org/abs/2303.08268
Deep Reinforcement Learning for Robotic Pushing and Picking in Cluttered Environment
The code release of "Real-time Active Vision for a Humanoid Soccer Robot Using Deep Reinforcement Learning" paper, ICAART 2021
An active vision system on the PR2 humanoid robot to dynamically detect objects via the head and arm cameras
Learning Underwater Active Perception in Simulation
Official code release for our paper 'APPLE: Toward General Active Perception via Reinforcement Learning'.
🤖 Discover the latest research papers on ArXiv tailored to your interests using AI-powered analysis and smart search features.
The webpage for Matcha-agent project.
Minimal Python robotics examples for planning, manipulation, active perception, and embodied AI. No ROS, GPU, or heavy simulator.
Resource-aware multimodal scene understanding with view selection for efficient captioning and QA.
NeRF-SLAM-oriented neural mapping lab for camera rays, sigma-density learning, opacity/depth rendering, and robotics perception experiments.
Small program that let's you sample from the MNIST handwritten digits dataset, but with random rotation, scaling and translation transformations applied.
Active perception system for target localization on a mobile robot— maintains a Bayesian belief map over object location, plans viewpoints by mutual information maximization, and executes via behaviour tree in ROS 2.
GitHub profile for Muhammed Elyamani: structure-aware planning, active sensing, mobile manipulation, SLAM, and robotics control.
Public organization repo for line-scan-aware active scanning and coverage planning with a mobile manipulator.
Public research-reading and architecture scaffold for active perception, coverage planning, mobile manipulation, estimation, and control.
Real-time Uncertainty-Aware Motion Planning for Magnetic-based Navigation (InfoMagNav)
Add a description, image, and links to the active-perception topic page so that developers can more easily learn about it.
To associate your repository with the active-perception topic, visit your repo's landing page and select "manage topics."