Hacker News new | past | comments | ask | show | jobs | submit login

First question, does it pronounce numbers > 9 correctly? At least OpenAI's model doesn't perform at all, marking garbage out of almost every number it finds. I actually dont remember if I checked with EleventLabs... But I was shocked enough that in 2024, someone could release a TTS model that doesn't do numbers correctly. As if the AI industry was approaching Xerox level of failings. However, the TTS models are way worse then the Xerox compression algo ever was.

I believe verifying numbers up to at least 100000 should be a requirement for new TTS models.






Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: