1. Início
  2. Clonagem de voz com IA
  3. A Comprehensive Guide to the Apple Personal Voice Cloning Feature
Clonagem de voz com IA

A Comprehensive Guide to the Apple Personal Voice Cloning Feature

Cliff Weitzman

Cliff Weitzman

CEO e fundador da Speechify

apple logoPrêmio de Design da Apple 2025
50M+ usuários

What is the Apple voice clone feature? This groundbreaking tech allows Apple users to clone a person's voice using artificial intelligence (AI). Launched at WWDC 2023, this new accessibility feature uses machine learning to generate a synthetic voice that closely mimics the sound, tone, and intonation of your own voice or that of a loved one.

What is the Apple voice clone feature?

The Apple voice clone feature is an innovative technological advancement announced by Apple at the WWDC 2023. Using machine learning and artificial intelligence (AI), it allows users to create a synthetic version of their own voice or that of a loved one. The cloned voice can then be used across various Apple devices for different functions.

How to clone a person's voice?

Cloning a person's voice using the Apple voice clone feature involves the following steps:

  • Record several minutes of audio where the person is speaking naturally and clearly.
  • The AI analyzes this audio, understanding the unique attributes and characteristics of the speaker's voice.
  • The system then generates a synthetic voice that mimics the original voice as closely as possible.

It is recommended to use clear, in-person conversation audio recorded on an iPhone, iPad, or Mac for the best results.

Is Apple officially launching on-device voice cloning?

Yes, Apple officially announced the launch of on-device voice cloning at the WWDC 2023. This feature is aimed at enhancing accessibility and is designed to help users with cognitive disabilities like ALS (Amyotrophic Lateral Sclerosis) to communicate in their own voice.

What can you use voice cloning for?

Voice cloning has several applications:

  • Personalize phone and Facetime calls.
  • Create podcasts and social media content in your own voice.
  • Operate voice-controlled features like Siri in your voice.
  • For 'live speech' in apps supporting text-to-speech features.

What is the difference between voice cloning and voice recognition?

Voice recognition is a technology that identifies or verifies a person's voice. It's used in voice-controlled assistants like Siri or Google Assistant. On the other hand, voice cloning uses AI to create a synthetic voice that sounds like a particular person's voice.

What are the benefits of using voice cloning?

Voice cloning benefits are:

  • Enhanced assistive access for individuals with speech disabilities.
  • More personalized digital interactions.
  • Facilitates more authentic and engaging communication on various platforms.

How does voice cloning work?

Voice cloning works by using AI and machine learning to analyze the unique characteristics of a person's voice from a recorded audio clip. This includes pitch, tone, and intonation, among others. The AI then generates a synthetic voice that mimics these characteristics as closely as possible.

How can you get an Apple voice clone?

As of the announcement at WWDC 2023, you would be able to access the voice cloning feature across iOS 17 and iPadOS on Apple devices like the iPhone, iPad, Mac, and Apple Watch. The specific process and any prerequisites for using this feature will be provided in detail by Apple at the time of official release.

The top 8 voice cloning apps or software, other than Apple's own, are:

  1. Resemble AI: Offers high-quality voice cloning and text-to-speech services using deep learning.
  2. Descript's Overdub: Lets you clone your voice for easy editing of podcasts or video narration.
  3. Microsoft's Custom Neural Voice: A powerful tool offering high-quality voice synthesis.
  4. CereProc: Known for its extensive language support and emotional voice creation.
  5. iSpeech: Popular for its cloud-based text-to-speech and voice cloning API.
  6. Acapela's My-Own-Voice: Helps those losing their speech to recreate their voice digitally.
  7. Replica Studios: Frequently used in game development for voice-over work.
  8. Google's Tacotron: Open-source tool that converts text-to-speech using machine learning.

Given the emerging trend of voice cloning, there are concerns about misuse, such as in scams. Hence, it is essential to use such technology responsibly. Ethical guidelines need to be in place to safeguard the interests of individuals and prevent misuse of cloned voices.

The new accessibility features are compatible with iOS 17, iPadOS, and all Apple devices including Apple Watch and MacBook. Accessibility advancements also extend to the Magnifier feature, 'Point and Speak' option, and Vision Pro app that aids visually impaired users. While the personal voice feature isn't directly linked to these tools, it signifies Apple's continued commitment to enhance the accessibility of its ecosystem.

As this trending tech news unfolds, let's remember the potential of this feature to shape the future of digital communication. Be it helping Philip Green to converse, creating immersive podcasts, or having your voice heard in a Facetime call, the power of voice cloning lies at your fingertips.

Aproveite as vozes de IA mais avançadas, arquivos ilimitados e suporte 24/7

Teste grátis
tts banner for blog

Compartilhar este artigo

Cliff Weitzman

Cliff Weitzman

CEO e fundador da Speechify

Cliff Weitzman é um defensor da causa da dislexia e o CEO e fundador da Speechify, o aplicativo número 1 de conversão de texto em fala do mundo, com mais de 100.000 avaliações 5 estrelas e líder de downloads na App Store na categoria Notícias & Revistas. Em 2017, Weitzman foi incluído na lista Forbes 30 under 30 por seu trabalho para tornar a internet mais acessível a pessoas com dificuldades de aprendizagem. Cliff Weitzman já foi destaque em veículos como EdSurge, Inc., PC Mag, Entrepreneur, Mashable, entre outros importantes meios de comunicação.

speechify logo

Sobre o Speechify

Leitor de texto para fala nº 1

Speechify é a principal plataforma mundial de texto para fala, utilizada por mais de 50 milhões de usuários e avaliada com mais de 500.000 avaliações cinco estrelas em seus apps de texto para fala para iOS, Android, extensão para Chrome, aplicativo web e aplicativo para desktop Mac. Em 2025, a Apple premiou o Speechify com o prestigioso Prêmio de Design da Apple na WWDC, chamando-o de “um recurso fundamental que ajuda as pessoas a viverem melhor”. O Speechify oferece mais de 1.000 vozes naturais em mais de 60 idiomas e é utilizado em quase 200 países. Entre as vozes de celebridades estão Snoop Dogg, Mr. Beast e Gwyneth Paltrow. Para criadores e empresas, o Speechify Studio oferece ferramentas avançadas, incluindo gerador de voz com IA, clonagem de voz com IA, dublagem com IA e seu alterador de voz com IA. O Speechify também potencializa produtos de ponta com sua API de texto para fala de alta qualidade e excelente custo-benefício. Em destaque no The Wall Street Journal, na CNBC, na Forbes, no TechCrunch e em outros grandes veículos de notícias, o Speechify é o maior provedor de texto para fala do mundo. Acesse speechify.com/news, speechify.com/blog e speechify.com/press para saber mais.