In this guide, you will:

  1. 1️⃣ See 7 dictation tools ranked by offline support, price, and real user reports.
  2. 2️⃣ Learn when free built-in Win+H is enough and when it is not.
  3. 3️⃣ Pick the right tool for your workflow, whether that is any-app dictation, meeting notes, or pro use.
SayOnce - Voice Dictation Software Screenshot.
Eugene - CEO at SoftOrbits, Candidate of Technical Sciences, has more than 16 years of expertise in software development, photo and multimedia applications, enhancing and transforming digital images and videos.
📅 Last updated on:  2026-07-02

Looking for the best voice to text software for Windows because you are tired of typing? The honest answer is that there is no single winner. It depends on whether you value privacy, price, or raw convenience. Some tools run fully offline on your PC. Others send every word to a cloud server and charge a monthly fee for the privilege. A few are not really dictation tools at all; they transcribe meetings. Below we compare seven options, from the free Win+H feature already built into Windows 10 and 11 to paid desktop apps, so you can match a tool to how you actually work.

3x
faster than typing in a Stanford and Baidu study, where dictation entered English text about three times quicker than a smartphone keyboard, and with a lower error rate too
Source

That speed is why voice input keeps pulling people away from the keyboard. The catch is that the tools differ wildly in where your voice is processed and what they charge for it.
Disclosure: SoftOrbits makes SayOnce, one of the tools below. We ranked every pick using the same criteria, including our own, and kept the honest limits on each card so you can compare before you download.

What you will learn
Apply in 12 min Saves 3 hEasy

Quick comparison: 7 voice-to-text tools at a glance

ToolBest forPrice modelPlatformOffline?
SayOnce (our pick)Private any-app dictationOne-timeWindows 10/11Yes
Windows Voice Typing (Win+H)Free casual dictationFreeWindows 10/11No
Wispr FlowAI cleanup, cross-deviceSubscriptionWindows, MacNo
Dragon ProfessionalMedical/legal specialistsOne-time (check vendor)WindowsYes
Otter.aiMeeting transcriptionSubscriptionWindows, webNo
WhisperTypingWhisper-grade accuracySubscriptionWindowsNo
Braina ProDictation + voice assistantSubscriptionWindowsNo

How we picked (and what "our take" means)

TL;DR

We ranked on five things. The biggest is where your voice gets processed, locally or in the cloud, which is really the privacy question. The rest are the price model, whether it types into any app or just one, the accuracy people actually report, and a real track record on Windows 10 and 11.

Voice tools are easy to make sound good in a demo, so we used five criteria that hold up in daily use.

  • Offline and privacy. Does your audio stay on your PC, or does it get uploaded to a vendor's cloud? On Microsoft's own community forums this is the single most-requested feature.
  • Price model. A one-time purchase versus a subscription you pay forever.
  • Reach. Can it type into any window, or is it stuck inside one browser or one meeting app?
  • Accuracy that holds up in real use, judged from what users report on accents and technical terms, not from a vendor's marketing number.
  • Windows support plus a real track record on Windows 10 and 11.

A word on what "Our take" means on each card below. We ran the free and one-time tools first-hand on Windows 10 and 11. That means Win+H, our own SayOnce, and the Wispr Flow free tier. We did not buy a Dragon Professional license or every paid plan, so those verdicts rely on vendor docs, pricing pages, and paraphrased user reports. We flag that openly on the relevant cards, rather than pretend we tested everything for weeks.

The 7 best voice to text tools for Windows, reviewed

1. SayOnce: Voice Dictation Software - best for private, any-app dictation you buy once

SayOnce puts a global hotkey on your keyboard, and whatever you say lands as text in the window you are already using. Email, chat box, Word document - it types wherever your cursor already sits. What sets it apart is that recognition runs locally on your PC through an on-device model, so your audio never leaves the machine. It ships with three presets (Prose, Code, and Email), a personal dictionary, and snippets, and a batch mode turns audio and video files into plain-text transcripts across 25 languages. Full disclosure, since we make it: we still ranked it against the same yardstick as everyone else, and it has real limits.

6.34%
average word error rate for NVIDIA Parakeet TDT 0.6B v3, the on-device engine inside SayOnce, on the HuggingFace Open ASR Leaderboard (a benchmark across eight English datasets, not our own test)
Source

Pros:

Runs fully offline; your audio stays on your PC

One-time purchase, no monthly subscription

Types into any Windows app through a global hotkey

Batch-transcribes audio and video files to text

Prose, Code, and Email presets plus a personal dictionary

Cons:

Windows 10/11 only; no Mac or Linux build in v1

Rules-based cleanup, no AI proofread in v1 (it will not rewrite a rambling sentence for you)

25 speech languages, not the "100+" some cloud tools advertise

Newer brand with little third-party review history yet

Verdict: Pick SayOnce if offline privacy and a one-time price matter more to you than AI rewriting, and you want dictation that works in every app.
Our take: This is the tool we would reach for on a work PC where nothing should touch a cloud. We make it, so read the Cons above. Even so, the offline-plus-one-time combination is genuinely rare in this list.

If keeping your voice off the cloud is the deciding factor, try SayOnce: Voice Dictation Software on Windows 10 or 11.

2. Windows Voice Typing (Win+H) - best free starting point

Windows Voice Typing is already on your PC. Press Win+H in any text field on Windows 10 or 11 and start talking, with zero install and zero cost. For a few dictated messages a day it is fine, and it is the honest baseline every paid tool is measured against. The limits arrive fast, though. It runs on Microsoft's Azure Speech service, so per Microsoft Support your PC needs an internet connection. There is no offline mode. Users on Windows forums also report it stumbles on names and specialist vocabulary, and edit commands like "scratch that" often get typed out as literal text instead of deleting anything.

Pros:

Free and already built into Windows 10 and 11

Zero install; works instantly in any text field

Good enough for short, casual dictation

Cons:

Requires an internet connection; audio is processed in the cloud

Accuracy drops on technical terms, proper nouns, and strong accents

Voice-edit commands are unreliable ("scratch that" is hit or miss)

No presets, custom vocabulary, or batch transcription

Verdict: Start here to see if voice input suits you at all; move on when accuracy or privacy starts to bite.
Our take: We use Win+H for quick notes, and as the free default it does the job. We would not draft anything sensitive or long-form with it, though.

3. Wispr Flow - best for AI cleanup across devices

Wispr Flow is a slick, system-wide dictation app that cleans up your speech with AI as you go, so half-formed sentences come out tidier than you said them. People who like it report jumping from typing speed to well over 100 effective words per minute, and it works across Windows and Mac. Two recurring themes show up in user threads. The first is privacy. As one reviewer put it, everything you say gets sent to the cloud to be transcribed. That is fine for quick notes but uncomfortable for client data. The second is price and reliability. Its pricing page lists $15 a month (or $12 billed annually), which some users balked at paying forever, and there was a rough patch where the community complained loudly about support and stability after a funding round.

Pros:

AI cleanup makes dictated text read well with little editing

Works system-wide across any app, on Windows and Mac

Strong accuracy on conversational English

Cons:

Audio is sent to the cloud for processing; no true offline mode

$15/month (or $12/month billed annually) adds up over time

Free tier is capped at 2,000 words per week on desktop

Users have reported outages and shaky support during growth spurts

Verdict: A strong pick if you want AI-polished text and do not mind a subscription or cloud processing.
Our take: On the free tier it felt fast and genuinely useful; the cloud dependency and the monthly bill are what would push a privacy-minded Windows user elsewhere.

4. Dragon Professional - best for medical and legal specialists

Dragon Professional is the veteran of serious dictation, built for people who dictate documents all day and need deep customization, custom vocabulary, and voice commands. Nuance claims it dictates up to three times faster than typing with up to 99% accuracy, though that is the vendor's own figure, not an independent test. It runs offline on Windows, which many professionals need. The friction is cost and consistency. Nuance does not publish a price on its site. It is a contact-sales, one-time license, so check the vendor site for a current quote. And in medical communities, some long-time users say pricing keeps climbing and quality has felt inconsistent since Nuance's acquisition by Microsoft, with a few moving back to manual typing plus saved templates.

Pros:

Deep custom vocabulary and voice commands for specialist work

Runs fully offline on Windows

Strong track record in legal and medical dictation

Cons:

Pricing is quote-only (contact sales); expect a high one-time cost

Windows-only; the Mac version was discontinued

Users report weaker recognition outside English

Community reports of rising prices and inconsistent quality since the Microsoft acquisition

Verdict: Still the specialist's choice if you need custom vocabularies and can budget for a professional license.
Our take: We did not buy a Dragon license, so this verdict comes from Nuance's docs and user threads, not hands-on time. The capability runs deep, and so do the cost and the recent complaints.

5. Otter.ai - best for meeting transcription rather than any-app dictation

Otter.ai is excellent at one job that people often confuse with dictation, and that job is transcribing meetings. It joins your Zoom, Teams, or Meet calls and produces searchable, shareable notes with speaker labels. It is not a general any-app dictation tool. You cannot press a hotkey and dictate into a Word document with it the way you can with the others here. Users report accuracy suffers with strong accents, jargon, or people talking over each other, and speaker labels often come out as "Speaker 1 / Speaker 2." Per its pricing page, the free tier gives 300 transcription minutes a month, which longer sessions burn through quickly, and Pro runs $16.99 a month.

Pros:

Real-time meeting transcription with searchable, shareable notes

Native Windows desktop app plus web access

Good for capturing calls you would otherwise take notes on by hand

Cons:

Not an any-app dictation tool; it is built for meetings

Accuracy drops with accents, jargon, and overlapping speakers

Free tier capped at 300 minutes per month; Pro is $16.99/month

Privacy concerns when the bot auto-joins and emails transcripts to everyone

Verdict: The right tool if your problem is meeting notes, the wrong tool if you want to dictate into any app.
Our take: We would keep Otter for calls and pair it with a real dictation tool for everything else. They solve different problems.

6. WhisperTyping - best for Whisper-grade accuracy with rewrite modes

WhisperTyping is a newer desktop tool built on Whisper-family models, with a simple hotkey and AI rewrite and translate modes. Users on Capterra praise how it slots into email and messaging without switching apps, and it works in any Windows text field. It is worth being honest that its community footprint is still thin compared with Wispr Flow or Dragon, so there is less independent feedback to lean on. One recurring question in reviews is whether the free tier stays generous long-term, given the API costs behind the underlying models. The paid tiers scale up fairly quickly for full features.

Pros:

Whisper-family accuracy with AI rewrite and translate modes

Works in any Windows text field via hotkey

Generous free tier to start

Cons:

Thin review history; a newer, less-proven product

Cloud-based processing on the AI features

Subscription tiers climb quickly ($5 to $30/month) for full features

Verdict: Worth a look if you want Whisper-grade accuracy and rewrite tools and do not mind an evolving product.
Our take: The Whisper lineage is a good sign for accuracy, but we would watch how the free tier and pricing settle before committing.

7. Braina Pro - best for dictation bundled with a voice assistant

Braina Pro pairs dictation with a Windows voice assistant, so you can dictate text and also fire off PC commands and simple automations by voice. Reviewers on Capterra call its accuracy solid and note it copes well with background noise and recognizes multiple speakers without training a profile. One even preferred it to Dragon for their use case. The trade-offs are that it needs an internet connection to work, it is Windows-only, and the onboarding is thin, so there is a trial-and-error learning curve. Support is described as slow, and the price sits at $99/year for one PC.

Pros:

Bundles dictation with a voice assistant and PC commands

Handles background noise and multiple speakers well

More affordable than a Dragon license

Cons:

Needs an active internet connection to work; no offline mode

Windows-only with weak onboarding and documentation

Subscription at $99/year; support reported as slow

Dictation punctuation can disappoint despite clear speech

Verdict: A fit if you want voice control of your PC alongside dictation and can live with a subscription.
Our take: The assistant angle is the real draw; as a pure dictation tool it faces stiffer competition on this list.

Which voice-to-text tools work offline, without the cloud?

TL;DR

Only two of the seven run fully offline. Those are SayOnce and Dragon Professional. The rest send audio to a cloud server to transcribe it, and that includes the free Win+H along with Wispr Flow, Otter, WhisperTyping, and Braina.

Offline matters more than it looks. When a tool is offline, your voice is turned into text by a model on your own PC, so nothing gets uploaded. That is why people dictating emails, client information, or medical notes keep asking for it, and why cloud-only tools keep drawing privacy complaints on Reddit and Microsoft's forums. Among the seven, SayOnce and Dragon Professional are the offline options. Dragon is powerful but gated behind quote-only pricing, which leaves SayOnce as the offline pick you can just buy. Everything else, including the free Win+H, needs an internet connection because the recognition happens on the vendor's servers, not yours.

Cloud dictation

You talk, your audio streams to a vendor's server, and the text comes back. Fast, but every word passes through someone else's machine and you have to be online.

On-device dictation

A model on your own PC turns speech into text right there. Nothing gets uploaded, and it keeps working with no internet connection.

For client emails, medical notes, or legal text, remember that every cloud tool ships your recording off to a remote server for processing. That can collide with an NDA, patient confidentiality, or plain client trust. When the words are sensitive, an offline tool that keeps everything on your PC is the safer default.

Is Windows Voice Typing (Win+H) good enough on its own?

TL;DR

For a few casual messages a day, yes. Once you need long documents or offline work, or edit commands that actually fire, no. That is where a dedicated tool pays off.

Windows does have a built-in voice-to-text feature, and it costs nothing, so it is the right first stop. Press Win+H and try it. Dictate the odd message or search box and you may never need more. But Win+H has hard edges. Because it processes speech in the cloud, Microsoft Support confirms you must be online to use it, so there is no offline option. Accuracy slips on unusual names and jargon, and edit commands are inconsistent. One long-running Windows forum thread notes "scratch that" often gets typed instead of executed. On Microsoft's community hub, people ask for a better tool for exactly these reasons. There is no custom vocabulary and no batch transcription. If any of that describes your day, a dedicated Windows voice typing alternative earns its keep.

Which voice-to-text software for Windows is the most accurate?

TL;DR

It depends on your speech and setup, but real-world reports favor offline or Whisper-based tools for clean English, and Dragon for trained specialist vocabulary. Built-in Win+H trails on technical terms and accents.

Accuracy is not one number. Vendors love headline figures. Nuance claims up to 99% for Dragon, but those are marketing claims, not independent tests, so treat them as a claim, not a verdict. What users actually report is more useful. Built-in Win+H struggles with complex sentences, proper nouns, and strong accents, per repeated threads on Microsoft's community hub. Whisper-based tools and modern on-device models generally handle clean English well. Dragon shines when you train it on a specialist vocabulary, but users say it lags outside English. And accuracy is only half the story: a tool that transcribes perfectly but cannot reliably delete a word on command still slows you down. If precise voice-edit commands matter, test that specifically before you commit.

Accuracy also depends heavily on who is speaking, and that is not a soft claim. A peer-reviewed study measured it directly.

35% vs 19%
average word error rate for Black speakers versus white speakers across five major speech engines (Amazon, Apple, Google, IBM, Microsoft) in a peer-reviewed PNAS study, hard evidence that accent and dialect move accuracy more than any spec sheet
Source

Your microphone matters, and so does the noise around you. Speaking pace plays a part, and overlapping voices are the hardest case of all. That is why one person's "flawless" tool is another's daily frustration.

Watch the gap between the spec sheet and daily use. Microsoft's own docs say Win+H just works in any text box, yet in practice it needs a live internet connection and stumbles on names and jargon. Nuance markets Dragon at up to 99% accuracy, but some medical users report the recognition has felt inconsistent since the Microsoft acquisition. Treat a vendor's accuracy number as a claim to test, not a promise.

One-time price vs monthly subscription: which is cheaper long-term?

TL;DR

A one-time purchase wins on total cost once you pass the first year or two. Subscriptions make sense if you need cross-device sync or constant AI updates and would rather pay monthly.

The math is simple. A subscription like Wispr Flow at $15/month is $180 a year, every year. Otter's Pro plan runs $16.99/month. A one-time tool like SayOnce is paid once at around $49 and then it is yours. Over three years, that is the difference between roughly $50 and well over $500.

$540 vs $49
three years of a $15/month subscription (Wispr Flow) adds up to $540, against about $49 once for a one-time tool like SayOnce, a gap of roughly $491 for the same daily task
Source

This is exactly the frustration that shows up in user threads. People resent paying a recurring fee just to dictate a few prompts, and medical users report Dragon's pricing creeping upward. Subscriptions are not automatically bad; if you want your dictation synced across a phone and two laptops with regular AI improvements, monthly billing buys that. But if you work on one Windows PC and want to stop paying, a one-time license is the cheaper road.

Any-app dictation vs meeting transcription: are they the same?

TL;DR

No. Any-app dictation types your speech into whatever window you are in, whether that is an email, a doc, or a chat. Meeting transcription (Otter's job) records a call and turns the conversation into notes. Do not buy one expecting the other.

This trips people up constantly. Any-app dictation tools put a hotkey on your keyboard and drop text at your cursor, wherever that cursor happens to be. SayOnce, Win+H, Wispr Flow, WhisperTyping, and Braina all work this way. That is what most people mean by voice-to-text. You talk, and the words appear where you are typing. Meeting transcription is a different job. Otter.ai listens to a Zoom or Teams call and produces a shared, searchable transcript with speaker labels after the fact, and it does not dictate into your Word document. Turning an existing recording into text is different again; for that, see our video-to-text converter guide. Some tools also lock you to one place. Google Docs voice typing only works inside a browser and only in Docs, and Speechnotes is a browser tool, not a system-wide one. Decide which job you actually have, typing by voice or capturing meetings, before you compare features.

How to choose the right voice-to-text software for Windows

TL;DR

Start with the free Win+H to test the habit. Need offline privacy or a one-time price? Look at SayOnce. Want AI cleanup and cross-device sync? Wispr Flow. For specialist medical or legal work, Dragon. For meetings, Otter.

Choosing the right speech to text software for Windows is easier once you name your priority. Never dictated before? Spend a week with the free Win+H and see whether talking-to-type suits you at all; there is no reason to pay before you know. Once privacy becomes the deciding factor, pick an offline, one-time tool so your audio never leaves the PC. For AI polish across several devices, a cloud subscription like Wispr Flow fits, as long as you accept the monthly cost and cloud processing. And when your real problem is meeting notes rather than typing, Otter is the answer, not a dictation app.

To make that concrete, here is how the picks map to real people:

  • The author or long-form writer drafting chapters or articles wants no monthly word cap and no cloud sitting between them and the page. SayOnce (one-time, offline) fits, or Wispr Flow if you value AI polish over privacy.
  • The developer dictating comments, commit messages, and pseudocode needs technical terms to survive. SayOnce's Code preset sidesteps the word filter that trips up Win+H on legitimate jargon, and WhisperTyping's rewrite modes are an option.
  • The clinician with a medical vocabulary still leans to Dragon and its trained dictionaries, eyes open on the climbing price and the recent quality complaints.
  • The student or anyone with RSI who needs hands-free input for accessibility can start free with Win+H, or with Microsoft's Voice Access, which is built around hands-free control rather than fast drafting.
  • The meeting-heavy manager does not need dictation at all; they need Otter to capture and summarize calls.

For professional dictation with a trained vocabulary - the medical or legal work where a word filter blocking legitimate terms is a daily headache on the built-in tool - Dragon is still the specialist route, priced accordingly. Working in healthcare? Our guide to medical dictation software digs deeper into that niche.
For a broader look at desktop dictation beyond this Windows list, see our roundup of the best dictation software. Match the tool to the job and the trade-offs sort themselves out.

One last practical note: if noisy recordings are dragging your accuracy down, cleaning the audio first with noise reduction software often helps before you transcribe.

Also considered: tools that did not make our top 7

TL;DR

We looked at more than seven. These are the ones we left off, with the honest reason for each. A few are browser-only, some are locked to Office, a couple do a different job altogether, and others are simply too young to judge fairly.

Naming what we cut, and why, is as useful as the shortlist itself. It shows the top seven are picks, not just the first results we found.

  • Google Docs Voice Typing. Free and decent, but it runs only inside a browser and only inside Google Docs or Slides, per Google's own documentation. It is not any-app dictation and not a standalone Windows program, so it cannot type into your email client or a chat box.
  • Speechnotes. A browser-based notepad with no offline mode; any-app use comes only through a Chrome extension, not a system-wide hotkey. Fine for jotting notes in a tab, not a full desktop tool.
  • Microsoft Dictate (Office add-in). Works only inside the main Office apps such as Word and Outlook, not system-wide the way Win+H is. If you spend your day outside Office, it does not help.
  • Sonix.ai. Not live dictation at all. It transcribes recordings you have already made, a different job, and it bills per hour of transcription. For turning existing files into text, that is a category of its own.
  • Superwhisper. A real, capable tool that does have a Windows build, but that build launched in 2025 and is newer and less polished than its flagship Mac version, with some features still Mac-only. Worth watching as the Windows side matures.
  • VoiceDash and Talkpad. Both are genuine products, though Talkpad's Windows app is still waitlist or beta rather than a full download. They are just very young. VoiceDash launched in early 2025 and Talkpad is a small Sydney release. Both run on cloud back-ends and have thin third-party track records. Independent reviews are still scarce, so it is too early to rank them against established picks.

SayOnce - Voice Dictation Software SayOnce - Voice Dictation Software
We compared the best voice to text software for Windows in 2026 - offline vs cloud, one-time vs subscription, any-app vs meetings. 7 picks ranked.
SayOnce - Voice Dictation Software Screenshot.


🙋Frequently Asked Questions

Yes. Windows 10 and 11 include Windows Voice Typing. Press Win+H in any text field to start dictating. It is free and needs no install, but it requires an internet connection and is best for short, casual use rather than long or technical documents.

Win+H is already built into Windows 10 and 11, so there is nothing to download for basic dictation. Several third-party tools also offer free tiers. Wispr Flow gives 2,000 words per week, and WhisperTyping has a free plan, though paid versions add more. SayOnce offers a free trial with a daily dictation cap before you buy.

There is no single winner. Accuracy depends on your accent, your microphone, and your vocabulary. Whisper-based and on-device tools handle clean English well; Dragon leads for trained specialist terms but lags outside English. Built-in Win+H is the weakest on technical words and strong accents, based on repeated user reports.

Press the Windows key and H together (Win+H) in any text field, then start speaking; the words appear at your cursor. To stop, press Win+H again or say "stop listening." You will need a working microphone and an internet connection for the built-in tool.

Yes. SayOnce and Dragon Professional both run recognition locally, so your audio stays on your PC with no cloud upload. Most other options, including the built-in Win+H, Wispr Flow, and Otter, send audio to a server to transcribe it, so they need an internet connection.

Dragon Professional is still the specialist standard, with custom vocabularies and voice commands built for legal and medical work, and it runs offline. Pricing is quote-only, so check the vendor site. Some users have moved to newer offline or Whisper-based tools as Dragon's price has risen.

Some tools do this better than others. Built-in Win+H supports basic commands, but users report that ones like "scratch that" often get typed as text instead of deleting anything. If reliable voice editing matters, test that specific feature on any tool before you commit.

Yes, the same Windows Voice Typing found in Windows 10. Press Win+H to open it in any text field. Windows 11 also includes Voice Access, which is aimed at hands-free control and accessibility rather than fast document dictation, and it supports a shorter list of languages.

Sources