Low latency voice interaction is key to establishing a natural way of interacting with AI. There have already been some great examples of small companies making large progress in this arena but finally OpenAI, the juggernaut behind ChatGPT, has entered the fray with guns blazing. We should get ready for some very interesting use cases… and probably more than a few scandalous developments too!
OpenAI is expanding its Advanced Voice Mode (AVM) to more of its ChatGPT users, specifically those in the Plus and Teams tiers, with Enterprise and Edu customers set to receive access next week. This new rollout includes a redesigned interface and additional features aimed at making interactions with ChatGPT more natural and user-friendly.
How It Works
Advanced Voice Mode now features a blue animated sphere, replacing the previous black dots. Users will see a pop-up notification in the ChatGPT app when AVM becomes available to them. The feature aims to make ChatGPT more conversational and lifelike by incorporating five new voices—Arbor, Maple, Sol, Spruce, and Vale—bringing the total number of voices to nine.
Benefits
- Enhanced User Experience: The new voices and improved accents make interactions smoother and more natural.
- Customization: Users can personalize responses with Custom Instructions and have ChatGPT remember past conversations with the Memory feature.
- Multilingual Capabilities: The system can now say “Sorry I’m late” in over 50 languages, broadening its accessibility.
Concerns
Despite these advancements, there are some concerns. The AVM feature is not yet available in several regions, including the EU, the U.K., and parts of Scandinavia. Additionally, the absence of the Sky voice, which was removed due to legal concerns from Scarlett Johansson, highlights potential issues with voice likeness and intellectual property.
Possible Business Use Cases
- Virtual Customer Service: Use AVM to create more natural and efficient customer service bots for businesses.
- Language Learning Apps: Develop apps that leverage AVM’s multilingual capabilities to teach languages more interactively.
- Healthcare Assistants: Implement AVM in healthcare settings to provide patients with a more comforting and human-like interaction.
As we see these advancements in voice technology, it raises an important question: How will the increasing naturalness of AI voices impact our daily interactions and the way we perceive artificial intelligence?
Image Credit: DALL-E
—
Want to get the RAIZOR Report with all the latest AI news, tools, and jobs? We even have a daily mini-podcast version for all the news in less than 5 minutes! You can subscribe here.
RAIZOR helps our clients cut costs, save time, and boost revenue with custom AI automations. Book an Exploration Call if you’d like to learn more about how we can help you grow your business.