xAI has rolled out its Grok Voice API, enabling developers to create interactive voice applications that can listen, understand, and respond with natural language. The new capability supports real-time voice conversations with multilingual speech recognition, giving users access to five distinct voices—Ara, Rex, Sal, Eve, and Leo—each engineered for clarity and natural sound quality. Low latency ensures smooth interactions, while integrated Web and X search tools allow apps to access current information during conversations. This opens up possibilities for voice-driven AI applications across multiple platforms and use cases.

This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • 3
  • Repost
  • Share
Comment
0/400
GasWaster69vip
· 2025-12-18 10:59
Grok Voice API is out, and a bunch of people are starting to tinker with it, but does anyone really use it?
View OriginalReply0
EthMaximalistvip
· 2025-12-17 20:58
Grok Voice API is out, with five voice options, which is quite diverse. However, the applications that can actually be used will have to wait. Most developers are probably still observing.
View OriginalReply0
TestnetScholarvip
· 2025-12-17 20:52
Sound API is out, with five voice options available. It sounds pretty good, but I don't know how the latency is.
View OriginalReply0
  • Pin