High-impact open-source AI research
Molmo 2 is a family of state-of-the-art open video-language models.
SPOC is a vision-language agent that follows language instructions to navigate, explore, and manipulate objects in real and virtual environments using only RGB input.
Learn MorePoliFormer is an RGB-based indoor navigation agent that supports long-term memory and reasoning for tasks like object navigation, tracking, and open-vocabulary exploration.
Learn MoreMolmo is a family of open state-of-the-art multimodal AI models that can understand images and point to what they perceive, enabling rich interactions with physical and virtual worlds.
Learn MoreAll-in-one diffusion model for text-to-image generation, conditional generation, image understanding, ID customzation and multi-view generation.
Learn MoreOlmoEarth is a spatio-temporal, multimodal foundation model for Earth observation, achieving state-of-the-art performance and powering an end-to-end platform for non-profits and NGOs.
Learn MoreSatlas is an AI-powered platform that provides quarterly global updates on marine infrastructure, renewable energy sites, and tree cover using satellite imagery.
Learn More