Agent assistants
Let a Discord, Telegram, browser, or support agent speak when text is too flat.
ForgeMesh Voice turns text into speech for agents, apps, videos, books, narration, support flows, and creative tools. Pay per call. No subscription. No API key. Starts at a tenth of a cent.
Use it as the default voice for an assistant, narration for a video workflow, character lines in a game, book drafts, status updates, social clips, and audio alerts.
Let a Discord, Telegram, browser, or support agent speak when text is too flat.
Generate short narration, hooks, voiceovers, and alternate takes without a subscription seat.
Turn scripts, explainers, walkthroughs, and product demos into clean spoken audio.
Draft audiobook chapters, character readings, bedtime stories, and serial narration.
Add generated lines, alerts, quest text, and system narration using persona voices.
Turn critical changes into immediate audio alerts that a human can notice fast.
Expressions are the controls that make speech feel faster, calmer, clearer, more premium, or more urgent. Use them when the voice needs intent, not just audio.
0.7-2.0x. Slow narration down for stories, or push agent updates a little faster. Good starting point for Assistant: 1.12x.
steps 1-100. Use 8 for base-voice clips, 16 for polished speech, and 24 for premium narration.
Assistant, Storyteller, Narrator, Announcer, Urgent, Velvet, Sage, Spark, Anchor, and Echo require Custom. They are not included in Base, Pro, Batch, or OpenAI-shaped routes.
Base is 10 base voices only. Pro adds speed and quality for base voices. Custom unlocks persona voices plus speed and quality.
Start with Assistant, set speed around 1.12, and use 16 quality steps. It keeps status updates clear, warm, and quick without sounding rushed.
POST https://tts.forgemesh.io/v1/tts/custom
content-type: application/json
{
"text": "I found a new task that needs your review.",
"voice": "Assistant",
"speed": 1.12,
"steps": 16
}
Pick the voice by job, not by a random voice id. Standard voices cover everyday speech; personas make the output feel intentional.
Warm, measured narration for books, podcasts, and story agents.
Rich premium tone for luxury UX, product moments, and polished clips.
Crisp voice for intros, news, alerts, and high-clarity updates.
Friendly support voice for chatbots, help systems, and guided flows.
Fast attention voice for warnings, monitoring, and incident audio.
Small calls stay cheap. Longer speech uses the matching bucket. Agents can probe, pay, and speak without accounts, dashboards, or monthly minimums.
Launch an English-speaking agent first, then reuse the same voice layer for global clips, product demos, stories, alerts, and support flows.
Your agent chooses what needs to be said and which voice fits the moment.
It calls the voice endpoint with text, voice, language, and expression settings.
The x402 challenge is paid automatically with USDC on Base.
The WAV returns for playback, storage, alerts, clips, or agent replies.
For the 10 base voices, keep the familiar speech request shape and point it at ForgeMesh Voice. Use Custom routes when you want persona voices like Assistant, Storyteller, Velvet, or Urgent.
Send input, voice, and model fields the way your app already expects.
Use tts.forgemesh.io as the speech base URL.
Your agent receives the payment challenge and retries with USDC on Base.
The response is audio your app, agent, or workflow can play or store.
POST https://tts.forgemesh.io/v1/audio/speech
content-type: application/json
{
"input": "Your shipment is delayed by 18 minutes. I will keep monitoring.",
"voice": "M1",
"model": "forgemesh-voice"
}