1. Início
  2. TTS
  3. Speech to Text on Google Docs
TTS

Speech to Text on Google Docs

Cliff Weitzman

Cliff Weitzman

CEO e fundador da Speechify

apple logoPrêmio de Design da Apple 2025
50M+ usuários

If you've ever wished you could simply speak and have your words magically appear on the screen, then Google Docs' voice typing feature is here to make that dream a reality. I’m going to walk you through how to use this powerful tool, step-by-step, in a smart casual yet educational tone.

Getting Started with Google Docs Voice Typing

First things first, you need to open Google Docs in your Chrome browser. This functionality works best in Google Chrome, so make sure you're using that.

  1. Open Google Docs. If you don't have a Google account, you'll need to create one. Once you're logged in, open a new document.
  2. Go to the "Tools" menu in the top bar and select "Voice typing…". A microphone icon will appear on the left side of your document.
  3. Click on the microphone icon to start speaking. Make sure your microphone is enabled and functioning correctly.
  4. Start speaking clearly in your preferred language. Google’s voice typing supports multiple languages, including English, French, Spanish, and many more. The tool will transcribe your speech in real-time.

Using Voice Commands

Google Docs' voice typing feature isn’t just about transcribing your speech. You can also use voice commands to format your document. Here are some handy commands you can use:

  • New line: Moves the cursor to a new line.
  • New paragraph: Starts a new paragraph.
  • Comma, period, question mark: Inserts the respective punctuation marks.
  • Bold, italics, underline: Applies the formatting to the selected text.
  • Select paragraph: Selects the current paragraph.
  • Go to end of line: Moves the cursor to the end of the current line.

You can even say "right-click" to bring up the context menu or use "ctrl+shift+s" as a keyboard shortcut for additional speech-to-text options.

Voice Typing on Different Devices

On Windows and Mac

The voice typing feature works seamlessly on both Windows and Mac systems as long as you're using the Chrome browser. The process is the same: open Google Docs, activate voice typing, and start speaking.

On Android

For those on Android devices, the process is equally straightforward. Open Google Docs via the Google Drive app, tap on the document to start editing, and use the built-in voice typing feature on your keyboard.

Tips for Better Transcription

To ensure high-quality transcription:

  • Speak clearly and at a steady pace.
  • Use a good quality microphone.
  • Avoid background noise.

Formatting with Voice Commands

One of the standout features of Google Docs' voice typing is its ability to handle formatting commands. For example:

  • Say "comma" to insert a comma.
  • Say "new paragraph" to start a new paragraph.
  • Say "underline" before and after the word you want to underline.

This functionality helps streamline your workflow, allowing you to dictate not only the text but also the formatting, which can be a huge time-saver.

Top 5 speech-to-text apps

  1. Google Docs Voice Typing: Google Docs offers built-in speech recognition through its Voice Typing feature. Simply select Voice Typing from the dropdown menu under Tools to start dictation and convert your speech to text effortlessly.
  2. Microsoft Dictate: Microsoft Dictate is an add-on for Office applications, utilizing advanced voice recognition technology to transcribe spoken words into text. It integrates seamlessly with Word, Outlook, and PowerPoint, enabling efficient dictation.
  3. Otter.ai: Otter.ai provides real-time speech recognition and transcription services. It's ideal for meetings, lectures, and notes, offering high accuracy and the ability to integrate with Google Slides for live captioning.
  4. Dragon Anywhere: Dragon Anywhere by Nuance offers professional-grade speech recognition for mobile devices. It allows continuous dictation and voice commands to edit and format text, making it perfect for on-the-go users in Canada and beyond.
  5. Speechnotes: Speechnotes is a user-friendly speech-to-text app that provides accurate dictation and voice recognition. With easy access via a pop-up or dropdown menu, it’s great for quick transcriptions and note-taking.

These apps utilize advanced speech recognition technology to make dictation easy and efficient, whether you're using Google Slides, Microsoft applications, or other platforms.

Speechify Speech API

The Speechify Text to Speech API is a powerful tool designed to convert written text into spoken words, enhancing accessibility and user experience across various applications. It leverages advanced speech synthesis technology to deliver natural-sounding voices in multiple languages, making it an ideal solution for developers looking to implement audio reading features in apps, websites, and e-learning platforms.

With its easy-to-use API, Speechify enables seamless integration and customization, allowing for a wide range of applications from reading aids for the visually impaired to interactive voice response systems.

Troubleshooting

If the voice typing feature isn't working:

  • Check your microphone settings: Ensure your microphone is properly connected and enabled in Chrome.
  • Clear browser cache: Sometimes, clearing your browser’s cache can resolve minor issues.
  • Update Chrome: Make sure you’re using the latest version of Google Chrome.

Google Docs' voice typing feature is a powerful tool that can enhance your productivity by allowing you to transcribe your speech quickly and accurately. Whether you're using it for personal notes, business documents, or academic papers, this feature is versatile and easy to use. By incorporating voice commands, you can further streamline your workflow and focus on your content rather than the mechanics of typing.

Give it a try and see how it transforms your document creation process. Whether you're on Windows, Mac, or Android, Google Docs' voice typing is a game-changer for anyone looking to use speech-to-text technology.

Aproveite as vozes de IA mais avançadas, arquivos ilimitados e suporte 24/7

Teste grátis
tts banner for blog

Compartilhar este artigo

Cliff Weitzman

Cliff Weitzman

CEO e fundador da Speechify

Cliff Weitzman é um defensor da causa da dislexia e o CEO e fundador da Speechify, o aplicativo número 1 de conversão de texto em fala do mundo, com mais de 100.000 avaliações 5 estrelas e líder de downloads na App Store na categoria Notícias & Revistas. Em 2017, Weitzman foi incluído na lista Forbes 30 under 30 por seu trabalho para tornar a internet mais acessível a pessoas com dificuldades de aprendizagem. Cliff Weitzman já foi destaque em veículos como EdSurge, Inc., PC Mag, Entrepreneur, Mashable, entre outros importantes meios de comunicação.

speechify logo

Sobre o Speechify

Leitor de texto para fala nº 1

Speechify é a principal plataforma mundial de texto para fala, utilizada por mais de 50 milhões de usuários e avaliada com mais de 500.000 avaliações cinco estrelas em seus apps de texto para fala para iOS, Android, extensão para Chrome, aplicativo web e aplicativo para desktop Mac. Em 2025, a Apple premiou o Speechify com o prestigioso Prêmio de Design da Apple na WWDC, chamando-o de “um recurso fundamental que ajuda as pessoas a viverem melhor”. O Speechify oferece mais de 1.000 vozes naturais em mais de 60 idiomas e é utilizado em quase 200 países. Entre as vozes de celebridades estão Snoop Dogg, Mr. Beast e Gwyneth Paltrow. Para criadores e empresas, o Speechify Studio oferece ferramentas avançadas, incluindo gerador de voz com IA, clonagem de voz com IA, dublagem com IA e seu alterador de voz com IA. O Speechify também potencializa produtos de ponta com sua API de texto para fala de alta qualidade e excelente custo-benefício. Em destaque no The Wall Street Journal, na CNBC, na Forbes, no TechCrunch e em outros grandes veículos de notícias, o Speechify é o maior provedor de texto para fala do mundo. Acesse speechify.com/news, speechify.com/blog e speechify.com/press para saber mais.