Lens, train Moondreamto master logo detection.anything.
Production-ready vision AI in minutes. We handle the infrastructure so you can focus on what matters.
Perfection in 3 steps
What is fine-tuning?
Moondream starts as a general model trained on broad, public information. Fine-tuning makes it great at one specific task by teaching it the products, documents, categories, or internal information that matter to your business.
Who is this for?
This is for teams putting vision AI into production. If you already know the task and need the model to master that job, fine-tuning is how you get there. It is built for teams that need frontier performance at real-time speed.
Example fine-tunes

Player with Ball Detection
Detect the player holding the basketball in NBA broadcast footage. After fine-tuning, F1 jumps from 0.28 to 0.79 and false positives drop from 61 to 2.
F128.3%→78.8%

State Farm Logo Detection
Detect State Farm logos in NBA broadcast frames. The fine-tuned model reaches perfect F1 of 1.0 with zero false positives and zero missed logos, up from 0.38 on the base model.
F138.5%→100%

GeoGuessr Countries
Predict the country from a single street-view image. SFT fine-tuning takes accuracy from 28.6% to 71.1%, beating GPT-5.4 at 69.8%. Trained on 25 images per country.
Accuracy28.6%→71.1%

Rock Paper Scissors
Classify hand gestures as rock, paper, or scissors. RL fine-tuning with only 5 training examples per class reaches 98.8% accuracy from a 54.8% baseline. GPT-5.4 gets 100%.
Accuracy54.8%→98.8%

Glaucoma Detection
Classify retinal images by glaucoma stage (normal, early, advanced). RL fine-tuning takes accuracy from 17.6% to 69.2%, more than double GPT-5.4 at 33.2%.
Accuracy17.6%→69.2%

Video Action Description
Given a 3x3 grid of frames taken from a video, describe the action. SFT fine-tuning raises description accuracy from 54% to 74%.
Accuracy0.5413→0.7391
Frequently asked questions
Ready to take Moondream to production?
Need help? We'll build it for you.
We can help define the task, prepare the data, run training, validate results, and hand off a model your team can use.