Xiaomi MiMo

MiMo Speech Models

The English MiMo homepage highlights MiMo-V2.5-TTS Series with the positioning 'Give your agent a voice.' The blog list also includes MiMo-V2.5-ASR and MiMo-V2-TTS. Treat this as the speech-specific MiMo track until separate model cards expose deeper API and pricing details.

Partially availableFull English UILimited APIUnknownTrusted

Quick answers

At a glance

Overview
Xiaomi MiMo's English-listed speech model line covering MiMo-V2.5-ASR, MiMo-V2.5-TTS Series and MiMo-V2-TTS.
Best fit
Teams watching Xiaomi's speech stack for ASR, TTS and voice-agent experiments.
Trust
2/2 sources verified, recently checked · 2026-05-17
Coverage
100/100 · backfill: pricing

Editorial verdict

Best for

Teams watching Xiaomi's speech stack for ASR, TTS and voice-agent experiments.

Avoid if

Avoid choosing it for production voice workloads until API limits, language coverage and licensing are verified.

Why it matters

MiMo now has enough English-facing speech signals to deserve a separate audio profile.

Pricing

Speech-model pricing not publicly visible on the English homepage; verify inside MiMo API Platform

Payment

API Platform billing, AI Studio, Open-source model access where available

Commercial use

Commercial use should follow the current product, API, model license and billing terms.

Privacy

Review prompt, file, media upload, retention and training-use terms before sensitive workloads.

Use-case fit

Voice agents

Medium

TTS Series is positioned around giving agents a voice.

Speech recognition research

Medium

The blog list describes MiMo-V2.5-ASR as open-source speech recognition.

Global user checklist

RegistrationPartialStart from MiMo Web Demo or API Platform and verify whether speech models are available to the account.
English UIConfirmedSpeech releases are listed on the English homepage/blog.
API and docsPartialPublic English pages identify the models, but detailed API docs still need platform access.
Commercial useUnknownCheck hosted and open-source speech-model licenses separately.
Coverage · 100/100 · backfill: pricing

Model names, quotas, release status, regional access and commercial terms can change quickly; recheck official sources before procurement or production use.

Pros

  • - English site explicitly lists ASR and TTS releases
  • - Fits agent voice and speech-recognition workflows

Cons

  • - Detailed speech API parameters and pricing need platform verification
  • - Homepage/blog list is lighter than a full developer documentation set

Decision paths

minimax-audio

qwen-audio

zhipu-glm-audio

Sources

MiMo English website

official · en · verified 2026-05-17

Lists MiMo-V2.5-TTS Series and build-with-MiMo access paths.

MiMo English blog

official · en · verified 2026-05-17

Lists MiMo-V2.5-ASR, V2.5-TTS Series and V2-TTS release entries.

Reviews