After reading this guide, you will be able to:

  1. 1️⃣ See 8 real alternatives to Windows' built-in dictation, ranked with honest pros and cons.
  2. 2️⃣ Know which ones work offline and keep your voice off the cloud.
  3. 3️⃣ Pick the right one for your apps, your languages, and your budget.
SayOnce - Voice Dictation Software Screenshot.
Eugene - CEO at SoftOrbits, Candidate of Technical Sciences, has more than 16 years of expertise in software development, photo and multimedia applications, enhancing and transforming digital images and videos.
📅 Last updated on:  2026-07-02

Windows already gives you two ways to talk instead of type, and this guide is about what to use when they let you down. Win+H voice typing leans on Microsoft's cloud and wants an internet connection every session. Voice Access runs on your PC but reaches only about 15 locale variants (six of them English) and was built for hands-free control first. If you have hit either wall, here are 8 Win+H and Voice Access alternatives for Windows that fix one gap or both, starting with SayOnce, our own on-device dictation app. This roundup is for Windows 10 and 11 users who want a clear pick rather than a feature checklist.

Disclosure: SoftOrbits makes SayOnce, our voice dictation software. We ranked every tool below on the same criteria, including our own, so you can compare before you download.

What you will learn
Apply in 9 min Saves 3 hEasy

Why look for a Win+H or Voice Access alternative?

TL;DR

Each built-in tool has one hard limit. Win+H needs the internet and sends your speech to Microsoft's cloud, while Voice Access stays on your PC but reaches only about 15 locale variants (mostly English) and leans on voice commands rather than clean writing.

Windows dictation is genuinely useful until you reach the edge of what it was designed for. Win+H, or Windows Voice Typing, the panel you open with the Windows key plus H, is the one most people know. The panel is free and needs no install, yet Microsoft's own documentation is clear that voice typing requires an internet connection and routes your microphone audio to cloud speech services. For anyone on a shaky connection, a locked-down work laptop, or who simply does not want their voice leaving the machine, that alone ends the conversation before the first word.

Reliability is the other sore spot, and it shows up in real user threads rather than in reviews. On the Windows 11 forum, people report the Voice Access toggle refusing to stay on and Win+H breaking after updates, alongside error codes and dictation that stops once the PC wakes from sleep. Punctuation frustrates people too. A long-running Microsoft Q&A thread describes the auto-punctuation setting quietly resetting between sessions, so you switch it back on again and again. That last one wears people down fastest, because it turns a hands-free tool into a fix-it-by-hand tool. None of this makes the built-in options bad. It makes them a starting point that a lot of Windows users outgrow.

Win+H vs Voice Access: which one are you fighting with?

TL;DR

Win+H is cloud voice typing that needs the internet. Voice Access is a separate on-device feature in Windows 11 that works offline once a language pack is downloaded, but it covers roughly 15 locale variants and centers on voice commands.

Separating the two matters, because they fail for opposite reasons and the right replacement depends on which one you use. Win+H, the voice typing panel, is the quick way to dictate into any text box. Its limit is architectural. The panel is cloud based, so no internet means no dictation, and your audio always makes the round trip to a server. Voice Access is the newer accessibility feature in Windows 11 version 22H2 and later. Microsoft states that Voice Access lets you control your PC and write text without an internet connection, and the feature genuinely runs on your machine once the first-time language file download finishes. So Voice Access is not a cloud tool at all. Its limit is coverage. The feature officially supports about 15 locale variants, and it is designed around spoken commands like "click" and "scroll down" as much as free dictation. Write in a language outside that short list, or just want a plain hold-a-key-and-speak flow without command grammar, and neither built-in tool is a great fit. The picks below close that gap, and choosing the right Win+H alternative comes down to a few questions we work through next.

Quick comparison: 8 Windows dictation alternatives

ToolBest forProcessingPrice modelPlatform
SayOnce (ours)One-time price plus on-device privacyOn-deviceOne-timeWindows 10/11
WhisperstreamCheapest local, no subscriptionOn-deviceOne-timeWindows
Wispr FlowAI cleanup with cross-device syncCloudSubscriptionWin/Mac/iOS
Dragon ProfessionalCustom vocabulary and voice commandsOn-deviceOne-time (high)Windows
SuperwhisperOn-device with an optional cloud boostOn-device / cloudFree / subscriptionWin/Mac/iOS
WillowLow-latency cloud dictationCloudSubscriptionWin/Mac/iOS
VoicySimple shortcut dictation everywhereCloudSubscription / lifetimeWin/Mac/Browser
OpenWhisprFree, open-source, self-hostedOn-device / BYOK cloudFreeWin/Mac/Linux

How we picked (and tested on Windows 11)

TL;DR

We scored each tool on five things that shape daily use. Those are offline privacy, accuracy and punctuation, whether it works in every app, the price model, and setup effort.

A feature list will not tell you whether a tool is pleasant to live with, so we ranked on the qualities that show up in real complaints about the built-in options. Offline privacy came first. Accuracy and punctuation came next, since punctuation is where Win+H frustrates people. Then we weighed system-wide use, because a tool that types into any window beats one locked to a single editor. Price model mattered too, as a monthly fee stacks up for anyone who dictates daily. Setup mattered less. On the "Our take" lines below, we speak from hands-on use for the tools we can actually run on Windows, which means our own SayOnce plus the free and open-source options. For the paid-only tools like Dragon we go by their documentation and track record, plus public user reports, and we say so rather than pretending we bought a $699 license.

The 8 best alternatives, reviewed

1. SayOnce, our pick for one-time-price private dictation

SayOnce is our own tool, and we built it for the one thing Win+H cannot do, which is dictate without ever touching the cloud on software you pay for once. Recognition runs on your machine with the NVIDIA Parakeet V3 model, about 478 MB downloaded on first run, and it is CPU-first so a graphics card stays optional. You hold a global hotkey, speak, and the text drops into the app you are working in, whether that is Word, a browser, or a VS Code editor. Three rules-based presets clean up the output. Prose mode fixes punctuation and capitalization, Code mode leaves your identifiers alone, and Email mode tightens a short message. A personal dictionary catches names the model would miss, and a Files tab batch-transcribes recordings to plain text. Think of it as the offline, buy-once end of our wider roundup of the best dictation software for Windows.

Pros:

Pay once, no subscription

Audio stays on your PC, fully offline after setup

Types system-wide from one keyboard shortcut

Batch-transcribes audio and video files to text

Cleans up output with presets and a personal dictionary

Cons:

Windows 10 and 11 only, no Mac or mobile build

Recognizes 25 languages, not the 100-plus some cloud tools advertise

Presets are rules-based, so they will not rephrase or repair a genuine misread the way an AI cleanup tool does

Free tier stops at 21 dictations a day before it prompts you to buy

Verdict: Choose SayOnce if you want accurate Windows dictation offline. It pastes into any app and costs one payment, not a monthly bill.
Our take: This is the tool we reach for on our own Windows 11 machines. Local processing is the reason. Nothing waits on a server, and the Code preset alone earns its keep in an editor.

2. Whisperstream, the cheapest way to go fully local

Whisperstream sits closest to our own thinking on this list. Processing happens on your machine, you pay once, and there is no subscription anywhere. The app runs the NVIDIA Parakeet model locally, supports CPU-only machines, and uses a push-to-talk key that drops text into any window. At $29 one-time on the vendor's site, with a 30-day refund, it is the budget choice for anyone who wants their voice to stay on the machine and refuses to rent software by the month.

Pros:

One-time $29 price (per the vendor site), no subscription

On-device processing, so audio never leaves the PC

Runs on CPU-only machines and works in any window

30-day refund lowers the risk of trying it

Cons:

Newer, smaller vendor with a thinner review trail than the big names

Windows-focused, so cross-platform users get less

Fewer polish features than the pricier AI tools

Verdict: Get Whisperstream if your priorities are local processing and a rock-bottom one-time price, and you do not need a big-brand support history.
Our take: We have not run it for months. But the pitch is honest, and it matches what Windows users in "not cloud-only" threads keep asking for.

3. Wispr Flow, the polished cloud option with AI cleanup

Wispr Flow is the most refined of the cloud tools. It dictates system-wide and cleans up your speech with AI, so filler and false starts get trimmed automatically, then syncs your settings across Windows, Mac, and iOS. If you live across several devices and want dictation that reads edited rather than raw, this is the one people point to. The trade-off is the model itself. Wispr Flow is cloud based, so your audio leaves the device and you need a connection, and it bills monthly rather than once. Pricing on the vendor site runs in the low-teens per month depending on billing. When Wispr Flow specifically is what you are weighing, we go deeper in our Wispr Flow alternatives guide.

Pros:

Strong AI cleanup that trims filler on its own

Works system-wide across your apps

Syncs your settings across Windows and Mac, with an iOS app too

Cons:

Cloud-only, so your audio leaves the device and you need internet

Subscription pricing rather than a one-time purchase

Overkill if you only dictate on one Windows PC

Verdict: Pick Wispr Flow if you want cross-device dictation with AI cleanup and a subscription does not bother you.
Our take: The cleanup is the selling point, and it is good, but it is also the reason your voice has to reach a server, which is the exact thing many people leave Win+H to avoid.

4. Dragon Professional, the accuracy benchmark that is coasting

Dragon has been the serious name in dictation for years. After training, its accuracy and voice-command control still set the bar others get measured against. The engine runs offline, and it handles long documents and custom vocabulary better than almost anything. The problems sit around that engine. Users describe reading training passages for a good while before it fully understands them, the consumer edition was discontinued after Nuance joined Microsoft, and no new mainstream version has shipped in roughly three years. Pricing lands around $699 one-time based on reseller listings. Nuance does not publish a single clean consumer price page, so treat that figure as a reseller number rather than an official one.

Pros:

Top-tier accuracy and custom vocabulary once trained

Deep voice commands for editing and control

Runs offline

Cons:

Around $699 one-time (reseller listings), the priciest pick here

Long training before it performs at its best

Consumer edition discontinued, and it feels in maintenance mode

Windows-focused and heavy to set up

Verdict: Consider Dragon only if you need its command depth for professional or specialist work and can absorb the cost and setup.
Our take: We have not bought a current license, so we go by its documented feature set and long track record. The recognition pedigree is real, the momentum is not.

5. Superwhisper, on-device with an optional cloud boost

Superwhisper started on Mac and now ships a Windows build. It can run small speech models on your machine for free, keeping audio local, and it offers larger, more accurate models on paid tiers. That flexibility is the appeal, since you choose between fully local and a cloud boost as your needs change. The caveat is the maturity of the Windows build. The tool's own public feedback board carries tickets about the Windows app crashing during dictation or freezing the app you are dictating into, so test it on your setup before you rely on it. Current pricing is best checked on the vendor site, since the tiers move around.

Pros:

On-device option keeps audio local at no cost

Optional larger models for higher accuracy

Runs on Windows and Mac, with an iOS app too

Cons:

Windows build has reported crashes and freezes

Mac-first history, so Windows can lag behind

Best models sit behind a paid tier

Verdict: Try Superwhisper if you want the choice between local and cloud and you are willing to test the Windows build first.
Our take: The local-or-cloud switch is a smart idea. Just go in expecting to verify stability on your own machine, given the public bug reports.

6. Willow, built around speed

Willow (Willow Voice) sells one thing hard, and that thing is low latency. The vendor says text appears within about 200 milliseconds and claims it beats built-in dictation on accuracy by a wide margin, and the app learns your vocabulary over time. Treat those as the vendor's figures rather than independently tested numbers, but speed is clearly the focus. Willow is cloud based and subscription priced, and Windows support arrived only in early 2026, so it is younger on this OS than on Mac.

Pros:

Vendor cites very low latency (about 200 ms) for near-instant text

Adapts to your vocabulary over time

Runs across Windows, Mac, and iOS

Cons:

Cloud-only, so audio leaves the device and you need internet

Subscription pricing

Windows support only arrived in early 2026, so it is young here

Verdict: Look at Willow if raw dictation speed is your top priority and you are comfortable in the cloud.
Our take: The latency pitch reads well on paper. We would want to test the speed and accuracy claims against a real accent before committing, since both are vendor-stated.

7. Voicy, simple shortcut dictation everywhere

Voicy aims for a clean, shortcut-driven experience that works across your apps with little fuss. The tool fits people who want to press a key and talk without configuring much. One detail matters for this list. Voicy routes audio to the cloud, by way of Groq, at every tier, so there is no on-device option at any price. If offline is a must for you, that rules it out. If offline is not a must, the simplicity is real. Check the vendor site for current subscription and lifetime pricing.

Pros:

Simple, shortcut-first workflow

Works across many apps

Offers a lifetime option alongside subscriptions

Cons:

Cloud-only via Groq, with no on-device tier at any price

Needs an internet connection to work

Not an option for privacy-first or offline users

Verdict: Voicy suits people who want dead-simple dictation everywhere and do not need offline processing.
Our take: As a plug-and-talk cloud tool it is pleasant, but "no local option at any price" is a hard stop for the offline crowd this guide is written for.

8. OpenWhispr, the free open-source route

OpenWhispr is the pick for technical users who want full control at zero cost. The project is MIT-licensed and open-source, runs local Whisper or NVIDIA Parakeet models on your machine, and can optionally call a cloud model with your own API key. Nothing is billed, and nothing phones home unless you tell it to. The cost is your time, because setup is a do-it-yourself affair and there is no formal support line when something breaks.

Pros:

Free and open-source (MIT), with no telemetry

Runs local Whisper or Parakeet models on your machine

Optional cloud model with your own API key

Cons:

Technical setup, not for non-technical users

No formal support

Less polished than the commercial tools

Verdict: Choose OpenWhispr if you are comfortable with a bit of setup and want a free, private, self-hosted stack.
Our take: We like that it exists. For a developer who wants local dictation without paying, it is a genuinely good answer, though a non-technical user will bounce off the setup.

Which alternatives work offline and keep your voice private?

TL;DR

For fully offline dictation, look at SayOnce and Whisperstream, plus OpenWhispr and Dragon. The cloud tools (Wispr Flow, Willow, Voicy) send your audio to a server, and so does Win+H.

Privacy is the single biggest reason people leave Win+H, so it deserves a direct answer. The on-device picks keep your voice on the machine, and that group is SayOnce and Whisperstream, plus OpenWhispr and Dragon Professional. Superwhisper belongs there too when you run its local models, though its larger models are cloud based. On the other side, those cloud tools all process on a server, which is what makes their AI cleanup and device syncing possible while also meaning your audio leaves your PC. There is a useful docs-versus-practice note here. Voice Access runs on your machine, so the built-in option is not automatically the least private one. The privacy problem is specifically Win+H, the shortcut most people actually use. The framing is not "built-in bad, third-party good." Win+H is the one real privacy problem here. Want the hotkey without the cloud? An on-device pick like ours is the fix.

Which ones type into every app, not just a text box?

TL;DR

Almost every dedicated pick here pastes system-wide through a global hotkey, and OpenWhispr does the same once set up. Voice Access and Win+H work in standard fields but stumble in some older or non-standard apps.

A dictation tool is only as good as the apps it can reach. The common complaint with the built-in options is that they behave in a plain text box yet turn flaky in legacy software or non-standard editors, and the Win+H panel can even drop text if focus shifts at the wrong moment. The dedicated tools take a different approach. They capture your speech and paste the finished words into the active window, so a code editor behaves the same way as a chat box or a Word document. That is why our own tool and most of the paid options advertise "works everywhere" so heavily. It is the practical fix for the app-compatibility gap. When your main job is dictating into Microsoft Word specifically, that has its own quirks, and we cover them in our guide to voice to text in Word.

Which is most accurate, and does the punctuation behave?

TL;DR

Dragon still leads on trained accuracy. For everyday use the modern tools are close, and the real difference is punctuation handling. Rules-based cleanup like our presets is fast and local, while AI cleanup like Wispr Flow's reads better but needs the cloud.

Accuracy questions usually come down to how well the engine hears you and what it does with punctuation. On raw recognition after training, Dragon is still the benchmark. For everyday dictation the gap has narrowed, and most people will not notice a big difference between the modern engines on clear speech. Punctuation is where they split. The built-in tools frustrate people because, as one Microsoft Q&A thread describes, a command like "period" sometimes gets typed out literally and quotes do not always close. The alternatives handle this two ways. Rules-based cleanup, which is what our presets use, applies punctuation and capitalization locally and instantly, though it will not rewrite a genuine misread. AI cleanup, like Wispr Flow's, reads more naturally because a model rewrites the text, at the cost of sending your audio to a server. Neither wins outright. You are trading privacy for polish.

Free, one-time, or subscription: what will it cost?

TL;DR

The free route covers the built-in Win+H and Voice Access, plus OpenWhispr and Talon. One-time purchases are SayOnce and Whisperstream at the low end, with Dragon far pricier. Subscriptions are the cloud tools (Wispr Flow, Willow, Voicy). Match the model to how often you dictate.

Price model matters more than the sticker number when you dictate every day. The free options start with the built-in Win+H and Voice Access, then add open-source OpenWhispr and, for developers, Talon. One-time purchases run from our own SayOnce and Whisperstream at the affordable end up to Dragon at the high end, which is around $699 on reseller listings. Those cloud subscriptions generally run in the low-teens per month depending on billing, and each vendor's page carries the current figure since these change often. The math is simple. Dictate daily for years and a one-time tool usually wins on total cost, while a subscription makes sense when you want continuous AI cleanup and device syncing and are fine paying for it. For a wider price-by-price breakdown across more tools, our companion roundup of dictation apps lays them out side by side.

How many languages do they cover versus Voice Access's ~15 locales?

TL;DR

Voice Access supports about 15 locale variants, six of them English. SayOnce covers 25 speech languages and 20 interface languages. Cloud tools often advertise the highest counts, so check that your language is genuinely supported before you buy.

Language coverage is where the built-in Voice Access is narrowest, and it is a common reason to look elsewhere. Microsoft lists about 15 supported locale variants for Voice Access, six of them English and the rest a handful of major languages with a handful of other major languages. Write in something outside that list, and the built-in option simply is not for you. Among the alternatives, our own tool recognizes 25 speech languages and ships a 20-language interface, which covers most of Europe plus Russian and Ukrainian. The cloud tools tend to advertise the biggest numbers, sometimes past 100, because a server can host many models at once. The practical advice holds either way. Do not trust the headline count, find your specific language on the vendor's supported list, and test it with your own accent before you commit.

Meetings and accessibility: Otter and Talon

TL;DR

Otter.ai and Talon Voice solve adjacent problems. Otter is for meeting transcription, not live dictation into your apps. Talon is a free, powerful hands-free control tool for developers and accessibility users, though the setup is steep.

Two more tools come up in these searches, and both are worth naming even though neither is a general Win+H replacement. Otter.ai is excellent at meeting transcription and speaker notes, yet it turns a recording into a document after the fact rather than typing into your apps live, and it is cloud based with no offline mode, so it answers a different question than "replace my dictation hotkey." Talon Voice sits at the other end. The tool is a free, on-device option for full hands-free computer control, popular with developers and in repetitive-strain and accessibility communities, and it does far more than dictate. The trade-off is a steep configuration curve that most casual users will not want to climb. When your real need is meetings, look at Otter. When it is voice coding or accessibility, Talon is the serious option.

SayOnce - Voice Dictation Software SayOnce - Voice Dictation Software
Win+H needs the cloud and Voice Access covers only about 15 languages. Here are 8 offline, cross-app dictation alternatives for Windows - our picks compared.
SayOnce - Voice Dictation Software Screenshot.


🙋Frequently Asked Questions

For a free and private option, OpenWhispr is the strongest pick because it is open-source and runs on your machine, though it needs a technical setup. Voice Access is the free built-in choice when your language sits among its roughly 15 supported locales and you mainly need on-device dictation.

No. Microsoft's documentation states that Win+H voice typing needs an internet connection and sends your audio to cloud speech services. For offline dictation, use an on-device tool such as SayOnce or Whisperstream, or use Voice Access, which runs locally after its first-time language download.

No. Win+H is cloud-based voice typing for any text box. Voice Access is a separate Windows 11 accessibility feature that runs on your machine and centers on voice commands as well as dictation, but it supports only about 15 locale variants, most of them English.

The on-device tools keep audio on the PC, and that group is SayOnce and Whisperstream, plus OpenWhispr, Dragon and Superwhisper's local models. The cloud tools process on a server, so with those your audio is sent off the device.

Yes. SayOnce and Whisperstream are one-time purchases, and Dragon Professional is a one-time, though expensive, license. The cloud tools bill monthly, so a one-time tool usually costs less over time when you dictate regularly.

Users report on Microsoft's Q&A that the auto-punctuation setting for the built-in tools does not always persist between sessions, so it reverts and has to be re-enabled. Dedicated alternatives handle punctuation with their own cleanup, either rules-based and local or AI-based in the cloud.

Yes for the dedicated tools. SayOnce and most paid alternatives paste text into the window you are working in, including code editors and terminals. The built-in tools work in standard text fields but can be unreliable in some legacy or non-standard apps.

Sources