Moondream CloudHIPAA compliant · SOC 2 soon

Hosted vision inference, ready when production is.

Moondream Cloud gives you the same production-ready VLM through a managed API. Start with free monthly credits, ship without infrastructure work, and keep a path to local or private deployments when your stack evolves.

$export MOONDREAM_API_KEY=md_...
Managed APIAutoscaled inference
python
import moondream as md

model = md.vl(api_key="$MOONDREAM_API_KEY")
image = open("frame.jpg", "rb")

answer = model.query(image, "Is the line blocked?")
print(answer)
Hosted

Production Moondream without server work.

Usage based

Start free, then pay as traffic grows.

Compliant

HIPAA compliant. SOC 2 soon.

$5

Cloud

in free credits added monthly

$0.06

Cloud

per 1K images for cloud inference

1 API

Cloud

from prototype to hosted production

01
Ship faster

One managed endpoint for captions, queries, detection, and more.

Cloud is the fastest way to put Moondream behind a real product. Use it for prototypes, production services, or burst capacity while you decide where each workload should live long term.

python
# Swap in Cloud when you need hosted inference
from moondream import vl

model = vl(runtime="cloud", api_key=MD_KEY)
result = model.detect(image, "damaged package")
print(result.objects)
Use the same SDK

Keep the Moondream API surface your team already built against. Local tests, Lens fine-tunes, Photon deployments, and cloud inference share the same mental model.

Skip the serving layer

No GPU provisioning, autoscaling work, or queue plumbing. Send images to the managed endpoint and let Moondream handle the production runtime.

Move when you need to

Start hosted, then shift workloads to Photon, partner clouds, or private infrastructure when compliance, latency, or cost calls for it.

Built for production paths

Cloud now. Photon or private deployments later.

Moondream Cloud is part of the same platform story as the open models, Lens, and Photon. You can validate in the hosted API, fine-tune for your data, and keep the option to run closer to your users or data when it matters.

02
Trust and compliance

Bring Moondream into sensitive production workflows.

Moondream Cloud is HIPAA compliant today for teams working with regulated healthcare data, and SOC 2 is coming soon as our formal audit program completes.

Security posture
  • HIPAA compliant for healthcare and life-science teams handling sensitive workflows.
  • SOC 2 is in progress, with the same security program backing our managed production stack.
  • Enterprise support is available when you need deeper reviews, procurement, or deployment guidance.
Enterprise help

Need a review before you deploy?

We can help your team work through security questionnaires, architecture reviews, and the right deployment shape for regulated or high-volume environments.

Talk to us
Start hosted

Get a key, send an image, and keep moving.

Moondream Cloud is the quickest way to run production VLM inference while keeping the deployment flexibility the main platform is built around.