OpenAI rolls out voice mode after delaying it for safety reasons
OpenAI is launching a new voice mode for ChatGPT, capable of detecting tones and processing audio directly. It will be available to paying customers by fall, starting with limited users.
Read original articleOpenAI has announced the rollout of its new voice mode for ChatGPT, following a delay for additional safety testing. Initially showcased in May, the voice mode can detect various tones and respond to interruptions, mimicking human conversation. However, it faced criticism for potentially reinforcing sexist stereotypes, particularly regarding the portrayal of female assistants. Allegations arose from actress Scarlett Johansson, who claimed her voice was used without permission, but OpenAI clarified that the voice in question, named Sky, was sourced from a different actor and has since been removed from the product. The new voice mode will be gradually made available to all paying customers by fall, starting with a limited user group. Unlike previous versions that transcribed spoken input into text, the updated model processes audio directly, allowing it to interpret multiple voices and emotional tones simultaneously. OpenAI has collaborated with representatives from 45 languages and 29 regions to enhance the model's capabilities. Initially, users will have access to four unique voices, with safeguards in place to prevent the generation of voices resembling real individuals. This development reflects ongoing efforts by tech companies to create advanced conversational AI, akin to those depicted in science fiction.
Related
OpenAI releases ChatGPT on your desktop for macOS
OpenAI released ChatGPT for macOS, enabling desktop users to chat about various topics, access features like screenshots and file sharing, and enhance productivity. The app plans to expand to Windows.
ChatGPT just (accidentally) shared all of its secret rules
ChatGPT's internal guidelines were accidentally exposed on Reddit, revealing operational boundaries and AI limitations. Discussions ensued on AI vulnerabilities, personality variations, and security measures, prompting OpenAI to address the issue.
AI speech generator 'reaches human parity' – but it's too dangerous to release
Microsoft's VALL-E 2 AI speech generator replicates human voices accurately using minimal audio input. Despite its potential in various fields, Microsoft refrains from public release due to misuse concerns.
OpenAI slashes the cost of using its AI with a "mini" model
OpenAI launches GPT-4o mini, a cheaper model enhancing AI accessibility. Meta to release Llama 3. Market sees a mix of small and large models for cost-effective AI solutions.
IRL 25: Evaluating Language Models on Life's Curveballs
A study evaluated four AI models—Claude 3.5 Sonnet, GPT-4o, Gemini 1.5 Pro, and Mistral Large—on real-life communication tasks, revealing strengths in professionalism but weaknesses in humor and creativity.
Oh and please drop the requirement for Google play on Android. It's so Annoying. I have Google play on my phone but no Google account and the ChatGPT app keeps triggering the play login screen and refuses to proceed.
Related
OpenAI releases ChatGPT on your desktop for macOS
OpenAI released ChatGPT for macOS, enabling desktop users to chat about various topics, access features like screenshots and file sharing, and enhance productivity. The app plans to expand to Windows.
ChatGPT just (accidentally) shared all of its secret rules
ChatGPT's internal guidelines were accidentally exposed on Reddit, revealing operational boundaries and AI limitations. Discussions ensued on AI vulnerabilities, personality variations, and security measures, prompting OpenAI to address the issue.
AI speech generator 'reaches human parity' – but it's too dangerous to release
Microsoft's VALL-E 2 AI speech generator replicates human voices accurately using minimal audio input. Despite its potential in various fields, Microsoft refrains from public release due to misuse concerns.
OpenAI slashes the cost of using its AI with a "mini" model
OpenAI launches GPT-4o mini, a cheaper model enhancing AI accessibility. Meta to release Llama 3. Market sees a mix of small and large models for cost-effective AI solutions.
IRL 25: Evaluating Language Models on Life's Curveballs
A study evaluated four AI models—Claude 3.5 Sonnet, GPT-4o, Gemini 1.5 Pro, and Mistral Large—on real-life communication tasks, revealing strengths in professionalism but weaknesses in humor and creativity.