1. Inici
  2. TTS
  3. Speech to Text on Google Docs
Publicat el TTS

Speech to Text on Google Docs

Cliff Weitzman

Cliff Weitzman

CEO i fundador de Speechify

apple logoPremi de Disseny Apple 2025
Més de 50 M d'usuaris

If you've ever wished you could simply speak and have your words magically appear on the screen, then Google Docs' voice typing feature is here to make that dream a reality. I’m going to walk you through how to use this powerful tool, step-by-step, in a smart casual yet educational tone.

Getting Started with Google Docs Voice Typing

First things first, you need to open Google Docs in your Chrome browser. This functionality works best in Google Chrome, so make sure you're using that.

  1. Open Google Docs. If you don't have a Google account, you'll need to create one. Once you're logged in, open a new document.
  2. Go to the "Tools" menu in the top bar and select "Voice typing…". A microphone icon will appear on the left side of your document.
  3. Click on the microphone icon to start speaking. Make sure your microphone is enabled and functioning correctly.
  4. Start speaking clearly in your preferred language. Google’s voice typing supports multiple languages, including English, French, Spanish, and many more. The tool will transcribe your speech in real-time.

Using Voice Commands

Google Docs' voice typing feature isn’t just about transcribing your speech. You can also use voice commands to format your document. Here are some handy commands you can use:

  • New line: Moves the cursor to a new line.
  • New paragraph: Starts a new paragraph.
  • Comma, period, question mark: Inserts the respective punctuation marks.
  • Bold, italics, underline: Applies the formatting to the selected text.
  • Select paragraph: Selects the current paragraph.
  • Go to end of line: Moves the cursor to the end of the current line.

You can even say "right-click" to bring up the context menu or use "ctrl+shift+s" as a keyboard shortcut for additional speech-to-text options.

Voice Typing on Different Devices

On Windows and Mac

The voice typing feature works seamlessly on both Windows and Mac systems as long as you're using the Chrome browser. The process is the same: open Google Docs, activate voice typing, and start speaking.

On Android

For those on Android devices, the process is equally straightforward. Open Google Docs via the Google Drive app, tap on the document to start editing, and use the built-in voice typing feature on your keyboard.

Tips for Better Transcription

To ensure high-quality transcription:

  • Speak clearly and at a steady pace.
  • Use a good quality microphone.
  • Avoid background noise.

Formatting with Voice Commands

One of the standout features of Google Docs' voice typing is its ability to handle formatting commands. For example:

  • Say "comma" to insert a comma.
  • Say "new paragraph" to start a new paragraph.
  • Say "underline" before and after the word you want to underline.

This functionality helps streamline your workflow, allowing you to dictate not only the text but also the formatting, which can be a huge time-saver.

Top 5 speech-to-text apps

  1. Google Docs Voice Typing: Google Docs offers built-in speech recognition through its Voice Typing feature. Simply select Voice Typing from the dropdown menu under Tools to start dictation and convert your speech to text effortlessly.
  2. Microsoft Dictate: Microsoft Dictate is an add-on for Office applications, utilizing advanced voice recognition technology to transcribe spoken words into text. It integrates seamlessly with Word, Outlook, and PowerPoint, enabling efficient dictation.
  3. Otter.ai: Otter.ai provides real-time speech recognition and transcription services. It's ideal for meetings, lectures, and notes, offering high accuracy and the ability to integrate with Google Slides for live captioning.
  4. Dragon Anywhere: Dragon Anywhere by Nuance offers professional-grade speech recognition for mobile devices. It allows continuous dictation and voice commands to edit and format text, making it perfect for on-the-go users in Canada and beyond.
  5. Speechnotes: Speechnotes is a user-friendly speech-to-text app that provides accurate dictation and voice recognition. With easy access via a pop-up or dropdown menu, it’s great for quick transcriptions and note-taking.

These apps utilize advanced speech recognition technology to make dictation easy and efficient, whether you're using Google Slides, Microsoft applications, or other platforms.

Speechify Speech API

The Speechify Text to Speech API is a powerful tool designed to convert written text into spoken words, enhancing accessibility and user experience across various applications. It leverages advanced speech synthesis technology to deliver natural-sounding voices in multiple languages, making it an ideal solution for developers looking to implement audio reading features in apps, websites, and e-learning platforms.

With its easy-to-use API, Speechify enables seamless integration and customization, allowing for a wide range of applications from reading aids for the visually impaired to interactive voice response systems.

Troubleshooting

If the voice typing feature isn't working:

  • Check your microphone settings: Ensure your microphone is properly connected and enabled in Chrome.
  • Clear browser cache: Sometimes, clearing your browser’s cache can resolve minor issues.
  • Update Chrome: Make sure you’re using the latest version of Google Chrome.

Google Docs' voice typing feature is a powerful tool that can enhance your productivity by allowing you to transcribe your speech quickly and accurately. Whether you're using it for personal notes, business documents, or academic papers, this feature is versatile and easy to use. By incorporating voice commands, you can further streamline your workflow and focus on your content rather than the mechanics of typing.

Give it a try and see how it transforms your document creation process. Whether you're on Windows, Mac, or Android, Google Docs' voice typing is a game-changer for anyone looking to use speech-to-text technology.

Gaudeix de les veus amb IA més avançades, arxius il·limitats i suport 24/7

Prova-ho gratis
tts banner for blog

Comparteix aquest article

Cliff Weitzman

Cliff Weitzman

CEO i fundador de Speechify

Cliff Weitzman és un defensor de la dislèxia i el CEO i fundador de Speechify, l'app de text a veu número 1 al món, amb més de 100.000 ressenyes de 5 estrelles i líder del rànquing de l'App Store en Notícies i Revistes. El 2017, Weitzman va entrar a la llista Forbes 30 under 30 per la seva tasca fent internet més accessible per a persones amb dificultats d'aprenentatge. Cliff Weitzman ha aparegut a EdSurge, Inc., PC Mag, Entrepreneur, Mashable i altres mitjans destacats.

speechify logo

Sobre Speechify

El millor lector de text a veu

Speechify és la plataforma líder mundial de text a veu, de confiança per a més de 50 milions d'usuaris i avalada per més de 500.000 ressenyes de cinc estrelles a les seves aplicacions de text a veu per a iOS, Android, Extensió de Chrome, aplicació web i aplicació per a Mac. El 2025, Apple va premiar Speechify amb el prestigiós Premi de Disseny Apple a la WWDC, qualificant-lo com “una eina essencial que ajuda la gent a viure la seva vida.” Speechify ofereix més de 1.000 veus naturals en més de 60 idiomes i s'utilitza a gairebé 200 països. Entre les veus de celebritats hi trobem Snoop Dogg i Gwyneth Paltrow. Per a creadors i empreses, Speechify Studio proporciona eines avançades com Generador de veu IA, Clonació de veus IA, Doblatge IA i el seu Canviador de veu IA. Speechify també impulsa productes líders amb la seva API de text a veu, d'alta qualitat i amb una relació qualitat-preu òptima API de text a veu. Present en The Wall Street Journal, CNBC, Forbes, TechCrunch i altres mitjans destacats, Speechify és el proveïdor de text a veu més gran del món. Visiteu speechify.com/news, speechify.com/blog i speechify.com/press per saber-ne més.