Alibaba Cloud
Wan / HappyHorse Video
Qwen Cloud's marketplace and docs show video generation alongside other model categories, with HappyHorse-T2V/I2V in the marketplace and Wan text-to-video/image-to-video models in the model-selection docs.
Quick answers
At a glance
- Overview
- Qwen Cloud's video-generation model line, including Wan video models and HappyHorse text-to-video/image-to-video entries.
- Best fit
- Developers who want Qwen Cloud API access to Chinese video generation models.
- Trust
- 2/2 sources verified, recently checked · 2026-05-17
- Coverage
- 100/100
Editorial verdict
Best for
Developers who want Qwen Cloud API access to Chinese video generation models.
Avoid if
Avoid it as the initial no-code creator path; compare Kling or Hailuo for that workflow.
Why it matters
Qwen Cloud now exposes video generation in its English marketplace and docs, so Qwen should not be represented only as an LLM.
Pricing
Free tier and pay-as-you-go video API billing vary by model
Payment
Qwen Cloud billing, Pay-as-you-go API billing
Commercial use
Commercial use should follow the current product, API, model license and billing terms.
Privacy
Review prompt, file, media upload, retention and training-use terms before sensitive workloads.
Use-case fit
Text-to-video API
StrongUse Wan or HappyHorse text-to-video models for API-driven video generation.
Image-to-video API
StrongUse image-to-video models when source imagery needs animation.
Global user checklist
Model names, quotas, release status, regional access and commercial terms can change quickly; recheck official sources before procurement or production use.
Pros
- - Official marketplace includes video-generation entries
- - Docs separate text-to-video and image-to-video workflows
Cons
- - Creator web workflow is less established than dedicated video tools like Kling or Hailuo
Decision paths
kling-ai
hailuo-ai
zhipu-cogvideo
Sources
official · en · verified 2026-05-17
Lists HappyHorse text-to-video and image-to-video models.
docs · en · verified 2026-05-17
Documents text-to-video generation workflow.