/minimaxi prefix, and every request body, response field, and error code stays fully compatible with the official MiniMax API — so you can migrate existing integrations without changing a single payload shape.
Field details, value ranges, and constraints follow the official MiniMax API reference. This page covers Fhddos-specific routing, billing, and ready-to-run examples.
Authentication & routing
Set your Fhddos token as a Bearer credential and prefix every path with/minimaxi:
Capabilities at a glance
Video Generation
Create text-to-video, first-frame, first-last-frame, and subject-reference video tasks with
MiniMax-Hailuo-02.Text-to-Speech (TTS)
Synthesize audio synchronously via
t2a_v2 or submit long jobs asynchronously via t2a_async_v2.Voice Cloning
Clone a voice in three steps — upload audio, optionally upload a prompt sample, then call
voice_clone.File Management
Retrieve file metadata by ID and download raw file content from the MiniMax files API.
Billing
Video generation uses a pay-on-success async billing model:- When you create a task, Fhddos records it but does not immediately charge your account.
- A background poller checks each task’s final state (success or failure).
- Only successful tasks are billed according to the price schedule.
- Failed tasks are never charged.
Video generation
Submit video generation tasks toPOST /minimaxi/v1/video_generation. All tasks are asynchronous — creation returns a task_id you then poll to retrieve the finished video.
Scenario 1 — Text-to-video
task_id is the raw MiniMax upstream task ID — you can use it directly with the official MiniMax query endpoint. platform_id (video_<ULID>) is Fhddos’s internal tracking ID. Both IDs are accepted by the query endpoint.Scenario 2 — First-frame image drive
Animate from a single starting image by supplyingfirst_frame_image:
Scenario 3 — First-frame + last-frame
Generate the transition between two keyframes by providing bothfirst_frame_image and last_frame_image:
Scenario 4 — Subject reference
Pass reference images viasubject_reference to anchor a character or object across the generated video:
Query a video task
Poll task status with thetask_id returned at creation:
Text-to-speech (TTS)
Fhddos exposes both synchronous and asynchronous TTS endpoints. Choose sync for short, latency-sensitive requests and async for longer texts.- Sync — t2a_v2
- Async — t2a_async_v2
Send text and receive the audio file directly in the response:
Voice aliases
To maintain compatibility with OpenAI-style voice names, Fhddos automatically maps the following aliases to the corresponding MiniMaxvoice_id values:
| Alias | MiniMax voice_id |
|---|---|
alloy | female-chengshu |
echo | male-qn-qingse |
fable | male-qn-jingying |
onyx | presenter_male |
nova | presenter_female |
shimmer | audiobook_female_1 |
emotion if you don’t supply one explicitly. You can override any alias through channel custom parameters:
Key TTS parameters
| Parameter | Values | Notes |
|---|---|---|
output_format | hex (default), url | Non-streaming only; URL validity follows upstream policy |
emotion | happy, sad, angry, fearful, disgusted, surprised, calm, fluent, whisper | Not all model + voice combinations support every emotion |
sound_effects | spacious_echo, auditorium_echo, lofi_telephone, robotic | Applies an acoustic effect to the output |
Audio return mode
Control how the audio payload is returned via theaudio_mode channel parameter:
| Mode | Behaviour |
|---|---|
json (default) | Response body is JSON; data.audio holds the hex string or URL |
hex | When output_format=hex, returns a raw audio stream; when output_format=url, still returns JSON |
Voice cloning
Clone any voice in three steps usingPOST /minimaxi/v1/voice_clone. The interface is fully compatible with the official MiniMax /v1/voice_clone endpoint.
Step 1 — Upload the source audio
file.file_id— pass this toPOST /minimaxi/v1/voice_clonefile.filename— the original file name
Step 2 — (Optional) Upload a prompt sample
Upload a short reference clip to improve similarity and stability:Step 3 — Clone the voice
Request fields
| Field | Required | Description |
|---|---|---|
file_id | ✅ | ID of the audio to clone (from purpose=voice_clone upload) |
voice_id | ✅ | Target cloned voice ID; must follow official naming rules |
clone_prompt.prompt_audio | ❌ | File ID of a short reference sample (from purpose=prompt_audio upload) |
clone_prompt.prompt_text | ❌ | Transcript of the reference sample |
text | ❌ | Preview text to synthesize (max 1 000 characters) |
model | ✅ when text set | TTS model for preview synthesis (e.g. speech-2.8-hd) |
language_boost | ❌ | Minor-language / dialect enhancement (Chinese, English, auto, …) |
need_noise_reduction | ❌ | Enable noise reduction on the source audio |
need_volume_normalization | ❌ | Normalise volume of the source audio |
aigc_watermark | ❌ | Append an AIGC watermark rhythm to the preview audio |
Response fields
| Field | Description |
|---|---|
demo_audio | Preview audio URL (returned when text + model are provided) |
input_sensitive | Populated when the source audio triggers a content policy; includes type |
base_resp.status_code | 0 = success |
base_resp.status_msg | Human-readable status description |
