Logo Logo
directory schraeg
Camcorders· Cinema-Kamera· Computers· Drohnen· GPU· Kamera-Zubehör· Video-DSLR· accessories
Compositing· Color correction· DV Editing

Shooting· Sound
Forschung· KI· Reviews· Streaming
/// News
Synthetic AI voices are competing with professional speakers

Synthetic AI voices are competing with professional speakers

[13:26 Sun,21.May 2023   by ]    

Generative AIs can now create texts that sound as if they were written by humans, conjure up photorealistic images out of thin air, and last but not least - as has often been the subject of this article - credibly synthesize human voices. Both voices that exist in real life and artificial new ones can be generated, or rather, recordings with such voices can be created.


Unlike the robot voices that tried to sell us dubious services over the phone years ago, the algorithms that are now speaking actually sound confusingly similar to us humans. Not only is their pronunciation virtually perfect, they can even simulate emotions. This is already being used for fraudulent purposes (for example, in grandchildren's tricks or blackmail calls), but of course many other uses are obvious - namely, practically everywhere where sound recordings of speakers have been used up to now. We would guess that we already encounter AI voices more often than we think.

For example, according to a recent report at www.digitaljournal.com/life/audio-book-narrators-say-ai-is-already-taking-away-business/article, audiobook narrators are complaining about dwindling orders - in some cases, revenues are said to have halved compared to the previous year. The main culprit is said to be competition from AI-based dubbing services. There are several services on the Internet that offer to create audiobooks at a fraction of the usual price. "Spoken" by artificial voices with a trained emotional register. In some cases actual voice actors have been cloned and their creators receive royalties when their voices are used for commissions, but this is not always the case.

While traditional narrators are in danger of losing their livelihoods due to AI, the new services are being touted as democratizing the audiobook industry. Even the smallest publishers can now afford audiobook versions, the argument goes. However, only a handful of services are likely to make money from the largely automated production of audiobooks. With a labeling obligation for AI-based audiobook productions, as some speakers are calling for, the audience could still decide for themselves who reads stories to them.

The situation is likely to be similar in the dubbing business. The forerunner here seems to be Latin America, at least an article made the rounds in February, describing how speakers there are increasingly coming under pressure from automatic dubbing services that have these very speakers read in voice samples for their AI voice training - at dumping prices and without further involvement.

All these automatically generated AI voices may sound human and be able to adapt their intonation to the spoken content. Of course, that doesn't mean they can compete with real (voice) actors. A good dubbed version, for instance, was classically recorded by trained voice actors with proper direction and in some cases could be at least as good in quality as the original (sometimes even better - at least funnier, as in the legendary case of The Two).

Today's dubbed versions often seem much more loveless (especially on TV), and are usually produced much cheaper and faster. In some cases, it may not make that much of a difference if synthetic voices are used. It is also well known that AI can even adapt the lip movements in the image to the new, spoken text, so that it would soon be possible, for example, to have Ryan Gosling in Blade Runner deliver his text in German, in his "own" voice.

Not uncool at all, if this should really work out well. But the prospect of a cheap, AI-generated synchro-feeling mishmash everywhere is less pleasing.

deutsche Version dieser Seite: Synthetische KI-Stimmen machen professionellen Sprechern Konkurrenz


  Vorige News lesen Nächste News lesen 
bildHyliumX: Neue Drohe bleibt mehr als 5 Stunden in der Luft - dank Wasserstoff bildNeue Fuji-X Objektive: Voigtländer Ultron 27mm f2.0 und Tamron 11-20mm F/2.8

related news:1E0Making of - How the viral Wes Anderson AI videos are created 29.May 2023
Microsoft Olive: New free tool doubles performance of Stable Diffusion 24.May 2023
Generative Fill - AI image enhancement in Photoshop Beta, easy for everyone! 23.May 2023
Diffusae - Stable diffusion for After Effects plugin 23.May 2023
Drag your GAN - Change AI motifs simply by dragging them 19.May 2023
Stable Diffusion rejuvenates Harrison Ford 16.May 2023
Google introduces About this image and a label for AI generated images 14.May 2023
alle Newsmeldungen zum Thema KI

[nach oben]

Archiv Newsmeldungen


May - April - March - February - January

December - November - October - September - August - July - June - May - April - March - February - January























deutsche Version dieser Seite: Synthetische KI-Stimmen machen professionellen Sprechern Konkurrenz

last update : 30.Mai 2023 - 12:02 - slashCAM is a project by channelunit GmbH- mail : slashcam@--antispam:7465--slashcam.de - deutsche Version