1. Beranda
  2. TTS
  3. Speech to Text on Google Docs
Dipublikasikan pada TTS

Speech to Text on Google Docs

Cliff Weitzman

Cliff Weitzman

CEO/Pendiri Speechify

apple logoApple Design Award 2025
50J+ pengguna

If you've ever wished you could simply speak and have your words magically appear on the screen, then Google Docs' voice typing feature is here to make that dream a reality. I’m going to walk you through how to use this powerful tool, step-by-step, in a smart casual yet educational tone.

Getting Started with Google Docs Voice Typing

First things first, you need to open Google Docs in your Chrome browser. This functionality works best in Google Chrome, so make sure you're using that.

  1. Open Google Docs. If you don't have a Google account, you'll need to create one. Once you're logged in, open a new document.
  2. Go to the "Tools" menu in the top bar and select "Voice typing…". A microphone icon will appear on the left side of your document.
  3. Click on the microphone icon to start speaking. Make sure your microphone is enabled and functioning correctly.
  4. Start speaking clearly in your preferred language. Google’s voice typing supports multiple languages, including English, French, Spanish, and many more. The tool will transcribe your speech in real-time.

Using Voice Commands

Google Docs' voice typing feature isn’t just about transcribing your speech. You can also use voice commands to format your document. Here are some handy commands you can use:

  • New line: Moves the cursor to a new line.
  • New paragraph: Starts a new paragraph.
  • Comma, period, question mark: Inserts the respective punctuation marks.
  • Bold, italics, underline: Applies the formatting to the selected text.
  • Select paragraph: Selects the current paragraph.
  • Go to end of line: Moves the cursor to the end of the current line.

You can even say "right-click" to bring up the context menu or use "ctrl+shift+s" as a keyboard shortcut for additional speech-to-text options.

Voice Typing on Different Devices

On Windows and Mac

The voice typing feature works seamlessly on both Windows and Mac systems as long as you're using the Chrome browser. The process is the same: open Google Docs, activate voice typing, and start speaking.

On Android

For those on Android devices, the process is equally straightforward. Open Google Docs via the Google Drive app, tap on the document to start editing, and use the built-in voice typing feature on your keyboard.

Tips for Better Transcription

To ensure high-quality transcription:

  • Speak clearly and at a steady pace.
  • Use a good quality microphone.
  • Avoid background noise.

Formatting with Voice Commands

One of the standout features of Google Docs' voice typing is its ability to handle formatting commands. For example:

  • Say "comma" to insert a comma.
  • Say "new paragraph" to start a new paragraph.
  • Say "underline" before and after the word you want to underline.

This functionality helps streamline your workflow, allowing you to dictate not only the text but also the formatting, which can be a huge time-saver.

Top 5 speech-to-text apps

  1. Google Docs Voice Typing: Google Docs offers built-in speech recognition through its Voice Typing feature. Simply select Voice Typing from the dropdown menu under Tools to start dictation and convert your speech to text effortlessly.
  2. Microsoft Dictate: Microsoft Dictate is an add-on for Office applications, utilizing advanced voice recognition technology to transcribe spoken words into text. It integrates seamlessly with Word, Outlook, and PowerPoint, enabling efficient dictation.
  3. Otter.ai: Otter.ai provides real-time speech recognition and transcription services. It's ideal for meetings, lectures, and notes, offering high accuracy and the ability to integrate with Google Slides for live captioning.
  4. Dragon Anywhere: Dragon Anywhere by Nuance offers professional-grade speech recognition for mobile devices. It allows continuous dictation and voice commands to edit and format text, making it perfect for on-the-go users in Canada and beyond.
  5. Speechnotes: Speechnotes is a user-friendly speech-to-text app that provides accurate dictation and voice recognition. With easy access via a pop-up or dropdown menu, it’s great for quick transcriptions and note-taking.

These apps utilize advanced speech recognition technology to make dictation easy and efficient, whether you're using Google Slides, Microsoft applications, or other platforms.

Speechify Speech API

The Speechify Text to Speech API is a powerful tool designed to convert written text into spoken words, enhancing accessibility and user experience across various applications. It leverages advanced speech synthesis technology to deliver natural-sounding voices in multiple languages, making it an ideal solution for developers looking to implement audio reading features in apps, websites, and e-learning platforms.

With its easy-to-use API, Speechify enables seamless integration and customization, allowing for a wide range of applications from reading aids for the visually impaired to interactive voice response systems.

Troubleshooting

If the voice typing feature isn't working:

  • Check your microphone settings: Ensure your microphone is properly connected and enabled in Chrome.
  • Clear browser cache: Sometimes, clearing your browser’s cache can resolve minor issues.
  • Update Chrome: Make sure you’re using the latest version of Google Chrome.

Google Docs' voice typing feature is a powerful tool that can enhance your productivity by allowing you to transcribe your speech quickly and accurately. Whether you're using it for personal notes, business documents, or academic papers, this feature is versatile and easy to use. By incorporating voice commands, you can further streamline your workflow and focus on your content rather than the mechanics of typing.

Give it a try and see how it transforms your document creation process. Whether you're on Windows, Mac, or Android, Google Docs' voice typing is a game-changer for anyone looking to use speech-to-text technology.

Nikmati suara AI tercanggih, file tanpa batas, dan dukungan 24/7

Coba gratis
tts banner for blog

Bagikan artikel ini

Cliff Weitzman

Cliff Weitzman

CEO/Pendiri Speechify

Cliff Weitzman adalah advokat disleksia, sekaligus CEO dan pendiri Speechify, aplikasi text-to-speech nomor 1 di dunia dengan lebih dari 100.000 ulasan bintang 5 dan peringkat pertama di App Store untuk kategori Berita & Majalah. Pada tahun 2017, Weitzman masuk daftar Forbes 30 Under 30 berkat upayanya membuat internet lebih mudah diakses bagi penyandang disabilitas belajar. Cliff juga pernah tampil di EdSurge, Inc., PC Mag, Entrepreneur, Mashable, dan berbagai media terkemuka lainnya.

speechify logo

Tentang Speechify

#1 Pembaca Teks ke Ucapan

Speechify adalah platform teks ke ucapan terkemuka di dunia, dipercaya oleh lebih dari 50 juta pengguna dan didukung oleh lebih dari 500.000 ulasan bintang lima di berbagai aplikasi teks ke ucapan iOS, Android, Ekstensi Chrome, aplikasi web, dan desktop Mac. Pada tahun 2025, Apple memberikan Speechify penghargaan terhormat Apple Design Award di WWDC, menyebutnya sebagai “sumber penting yang membantu orang menjalani hidup mereka.” Speechify menawarkan 1.000+ suara alami dalam 60+ bahasa dan digunakan di hampir 200 negara. Suara selebriti termasuk Snoop Dogg dan Gwyneth Paltrow. Untuk kreator dan bisnis, Speechify Studio menyediakan alat canggih, termasuk AI Voice Generator, AI Voice Cloning, AI Dubbing, dan AI Voice Changer. Speechify juga menyokong produk-produk terkemuka dengan API teks ke ucapan berkualitas tinggi dan hemat biaya. Telah diliput di The Wall Street Journal, CNBC, Forbes, TechCrunch, dan banyak media besar lainnya, Speechify adalah penyedia teks ke ucapan terbesar di dunia. Kunjungi speechify.com/news, speechify.com/blog, dan speechify.com/press untuk informasi lebih lanjut.