Moondream logo

Guide AI agents
with Moondream

Build agents that can see, understand, and interact with any interface. Using Moondream's pointing and detection skills, your agents can click buttons, fill forms, and automate workflows.

Agents

Everything you need for computer use automation

Moondream's pointing vision AI gives your agents the precise visual understanding needed to interact with any interface, automate workflows, and perform comprehensive testing.

Precision Pointing

Agents can accurately identify and point to specific UI elements, buttons, and interactive components on any screen or interface.

Interface Understanding

Advanced visual comprehension of user interfaces, layouts, and design patterns across web, desktop, and mobile applications.

Workflow Automation

Automate complex multi-step processes by teaching agents to navigate through applications and perform tasks autonomously.

Automated Testing

Build comprehensive testing agents that can interact with your applications, validate functionality, and catch bugs automatically.

Real-time Interaction

Process visual interface changes instantly and respond with appropriate actions, enabling seamless real-time automation.

Visual Navigation

Navigate through complex interfaces by understanding visual cues, contextual information, and spatial relationships between elements.

Get Started

Get Running in Minutes.

Moondream is open source and you can install and run it anywhere, for free. You can have it running on your computer or in our cloud in a matter of minutes.

Run It Yourself
  • Moondream Station is free
  • Works with our Python and Node clients
  • Works offline, fully under your control
  • CPU or GPU compatible
Moondream Station
Run in the Cloud
  • Spin up instantly—no downloads or DevOps
  • $5 in free monthly credits, no card required
  • Predictable pay-as-you-go pricing
  • 2 RPS on free tier, scales to 10 RPS or more with paid credits
Moondream Cloud
Pricing

Estimate your costs

Adjust the toggles for each workload to see how Moondream Cloud pricing scales for your use case.

Video Processing
Moondream processes video by analyzing one frame at a time.
1
126
Estimated cost
Based on the sliders you choose
$
0123456789
.
0123456789
0123456789
/ hour

60 inference requests / hour

Input tokens / hour48K
Output tokens / hour3K

Estimated pricing for planning purposes only. Review Moondream Cloud pricing at moondream.ai/pricing.