US venture firm OpenAI says it has developed generative artificial intelligence technology that can recreate human speech by using a short recorded voice sample of the original speaker.
OpenAI revealed its newly-developed AI "Voice Engine" on Friday.
The company says "Voice Engine" uses a 15-second audio sample and inputted text to generate human speech. "Voice Engine" reads the inputted text in a voice that closely resembles the original speaker's.
The firm says the new model can also translate the AI-generated speech into foreign languages. The model can reportedly preserve the native accent of the original speaker as it does so.
OpenAI says the AI technology can be used to support people who have trouble speaking, due to disease or other reasons. The firm says businesses can also use the AI technology to transmit information to foreign countries.
But there is concern that this kind of chatbot could be used to spread fake information during election campaigns or in other situations.
OpenAI said, "We recognize that generating speech that resembles people's voices has serious risks, which are especially top of mind in an election year."
The company says it does not plan to widely release the technology for the time being, because of the possible risks.
It says it still needs to take steps to prevent the technology from being misused. It cited the generation of the voices of celebrities as one type of misuse.
OpenAI revealed its newly-developed AI "Voice Engine" on Friday.
The company says "Voice Engine" uses a 15-second audio sample and inputted text to generate human speech. "Voice Engine" reads the inputted text in a voice that closely resembles the original speaker's.
The firm says the new model can also translate the AI-generated speech into foreign languages. The model can reportedly preserve the native accent of the original speaker as it does so.
OpenAI says the AI technology can be used to support people who have trouble speaking, due to disease or other reasons. The firm says businesses can also use the AI technology to transmit information to foreign countries.
But there is concern that this kind of chatbot could be used to spread fake information during election campaigns or in other situations.
OpenAI said, "We recognize that generating speech that resembles people's voices has serious risks, which are especially top of mind in an election year."
The company says it does not plan to widely release the technology for the time being, because of the possible risks.
It says it still needs to take steps to prevent the technology from being misused. It cited the generation of the voices of celebrities as one type of misuse.
Similar Readings (5 items)
OpenAI Introduces Voice Cloning Tool, But Waits for Public Release
OpenAI unveils new model capable of more natural conversations
Summary: OpenAI reportedly developing new generative music tool
Conversation: OpenAI ramps up developer push with more powerful models in its API
Summary: ChatGPT’s voice mode is no longer a separate interface
Summary
OpenAI unveiled a new AI technology, "Voice Engine," which generates human-like speech using a 15-second voice sample and text input. The AI can mimic the original speaker's voice, preserve native accents during translation, and potentially aid those with speaking difficulties. However, concerns
Statistics
234
Words1
Read CountDetails
ID: 65971ea1-d33c-461d-8e94-e35be90b6d54
Category ID: nhk
URL: https://www3.nhk.or.jp/nhkworld/en/news/20240331_01/
Date: March 31, 2024
Created: 2024/03/31 06:30
Updated: 2025/12/08 15:47
Last Read: 2024/03/31 17:17