Blog

Announcement
June 8, 2026Photon is now free
Photon 1.3.0 makes Moondream faster across NVIDIA, Mac, and Windows, runs finetunes on far more hardware, fixes an accuracy issue on older GPUs — and running Moondream locally is now completely free.
Read more
Engineering
June 4, 2026Popping the GPU Bubble
Photon, Moondream's inference engine, achieves near-realtime VLM inference (~33ms on NVIDIA B200). This is a peek into how it delivers up to 35% higher decode throughput by optimizing how the GPU works.
Read more

Announcement
May 1, 2026Photon 1.2.0: Faster Inference, Now on Mac, Windows, Blackwell, and Jetson Thor
Photon 1.2.0 brings native inference to Apple Silicon and Windows, adds NVIDIA Blackwell and Jetson Thor support, and ships meaningful speed gains across existing GPUs.
Read more

Announcement
April 20, 2026Lens: Moondream's Finetune Service
Solve the last-mile problem with Lens, our fine-tuning product that makes VLMs production-ready.
Read more

Announcement
March 25, 2026Photon: Real-Time VLM Is Here
Photon brings real-time Moondream inference to production vision AI, from edge devices to H100-class servers.
Read more

Model Release
March 10, 2026Moondream Segmenting Update: Better Masks, Better Benchmarks, 40% Faster
Moondream Cloud segmenting now delivers stronger benchmark scores, improved mask quality, and 40% faster inference.
Read more

Announcement
December 19, 2025We added Moondream 3 Preview support to Moondream Station
Mac users can now run Moondream 3 Preview in Station with MLX-native, quantized performance.
Read more

Announcement
October 17, 2025Announcing Moondream Cloud
Fast, cheap, smart. Pick three.
Read more

Release
September 23, 2025Moondream Station 2: Simpler, more features, and Windows!
Moondream Station 2 is a one-line installer to run Moondream locally
Read more