Every cloud TTS service is the same story. Send text, get audio, pay per character, pray the API doesn’t change. Your audio goes to someone else’s servers. Your costs scale …
I keep ending up in the same situation. I’ve got a server somewhere with files that need media processing — videos to transcode, audio to convert, images to resize — …