High-impact open-source AI research
SPOC is a vision-language agent that follows language instructions to navigate, explore, and manipulate objects in real and virtual environments using only RGB input.
Learn MorePoliFormer is an RGB-based indoor navigation agent that supports long-term memory and reasoning for tasks like object navigation, tracking, and open-vocabulary exploration.
Learn MoreUnified-IO 2 is a single unified transformer that processes images, text, audio, and more, enabling seamless multimodal understanding and generation through a unified architecture.
Learn MoreAll-in-one diffusion model for text-to-image generation, conditional generation, image understanding, ID customzation and multi-view generation.
Learn MoreSatlas is an AI-powered platform that provides quarterly global updates on marine infrastructure, renewable energy sites, and tree cover using satellite imagery.
Learn MoreGalileo is a family of generalist transformer-based satellite imagery models for remote sensing tasks like crop mapping, marine debree monitoring, and flood detection.
Learn More