this post was submitted on 05 Oct 2025
21 points (76.9% liked)
TechTakes
2222 readers
210 users here now
Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.
This is not debate club. Unless it’s amusing debate.
For actually-good tech, you want our NotAwfulTech community
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Ignorant question ahead: how do the voices work with these things? A face, or entire body, is limited in its range of motion by our skeleton and muscles (for the large part. Puff your cheeks - there’s one of many exceptions). A voice, though, is MUCH more dynamic. Programming the lyrics and notes wouldn’t be nearly enough. Just getting the tone and inflections right seems like it would be an absolute nightmare.
I was wondering the same thing about that AI “actress”. In the case of a talented professional, (or even a hack who’s terrible but trying their best) a LOT of care and thought goes into the emotion behind each word. How do they program that? Or are these just fancy 3D puppets with human voice actors behind them?
I don’t know why I didn’t think of that. Audio prompts can be just as approximate as video prompts. Very simple answer. Thank you.
Of course, I’m still not crazy about it, but that’s a different topic.