This endpoint layers AI-generated vocals on top of an existing instrumental. Given a prompt (e.g., lyrical concept or musical mood) and optional audio, it produces vocal output harmonized with the provided track.
uploadUrl specifies the audio file URL to be processedprompt defines lyric content and singing stylemodel specifies the AI model version for generationstyle and negativeTags are used to control music and vocal stylenegativeTags are used to exclude unwanted elementstext (text generation), first (first track completed), complete (all completed)All APIs require authentication via Bearer Token.
Get API Key:
Usage: Add to request header: Authorization: Bearer YOUR_API_KEY
Note:
Prompt for generating audio. Usually text describing audio content, used to guide vocal singing content and style.
"A calm and relaxing piano track."
Music title. Will be displayed in the player interface and file name.
"Relaxing Piano"
Excluded music styles. Used to avoid including specific styles or elements in the generated music.
"heavy metal, strong drum beats"
Music style. Such as jazz, electronic, classical and other music types.
"Jazz"
URL of the uploaded audio file. Specifies the source audio file location for adding vocals.
"https://example.com/music.mp3"
URL address for receiving vocal generation task completion updates. This parameter is required for all vocal generation requests.
text (text generation), first (first track completed), complete (all completed)"https://example.com/callback"
The AI model version to use for generation.
V5: Superior musical expression, faster generation.V4_5PLUS: V4.5+ is richer sound, new ways to create.V4_5PLUS, V5 "V4_5PLUS"
Vocal gender preference. Optional. 'm' for male, 'f' for female. Based on practice, this parameter can only increase the probability but cannot guarantee adherence to male/female voice instructions.
m, f "m"
Adherence strength to specified style. Optional. Range 0–1, up to 2 decimal places.
0 <= x <= 1Must be a multiple of 0.010.61
Controls experimental/creative deviation level. Optional. Range 0–1, up to 2 decimal places.
0 <= x <= 1Must be a multiple of 0.010.72
Relative weight of audio elements. Optional. Range 0–1, up to 2 decimal places.
0 <= x <= 1Must be a multiple of 0.010.65
Request successful
Response status code
200, 401, 402, 404, 409, 422, 429, 451, 455, 500 Error message when code != 200
"success"