🟣🎙️
4 Ways to Dictate into Obsidian
System hotkey: MetaWhisp Right Option
Built-in: Apple Dictation Fn-Fn
Plugin path: Obsidian Whisper
Capture-first: Drafts → Obsidian
TL;DR: Four practical ways to dictate into Obsidian on Mac in 2026. (1) Use a system-wide voice-to-text app like MetaWhisp with global hotkey — works in Obsidian like any other app, free, on-device. (2) Apple's built-in Dictation via Fn-Fn — free but lower accuracy, frequent issues in Electron apps. (3) Obsidian community plugins like Whisper or Voice2Text — direct file-level recording inside the vault. (4) Drafts as a capture layer — dictate to Drafts via iOS or Mac, action-send to Obsidian vault. For most Obsidian users, the system-wide hotkey approach is the lowest-friction option because it works in every part of Obsidian (editor, command palette, search) without plugin overhead.
Four methods to dictate into Obsidian on Mac diagram showing MetaWhisp hotkey Apple Dictation Whisper plugin Drafts pipeline workflows

Which Dictation Method Works Best with Obsidian?

The right answer depends on what part of Obsidian you're dictating into and how much setup friction you're willing to accept: I'm Andrew Dyuzhov, solo founder of MetaWhisp. I built MetaWhisp as a system-wide voice-to-text tool that works in any Mac app, including Obsidian. This guide covers all four methods because the right choice depends on your specific Obsidian workflow — whether you're capturing daily notes, working on long-form drafts, or building a research database.
Obsidian is built on Electron, the cross-platform framework that powers VS Code, Slack, Discord, and many other modern apps. Electron apps render their UI in a Chromium-based webview rather than using native macOS text input controls. This architectural choice creates a known compatibility issue: macOS Dictation's "Enhanced" mode often fails to insert text into Electron app text fields because the dictation engine expects to interact with native NSTextView controls. System-wide voice-to-text tools like MetaWhisp work around this by using the macOS Accessibility API to simulate a Cmd+V paste, which Electron apps handle correctly because they implement standard clipboard handling. The practical result: tools that use Accessibility paste (MetaWhisp, Wispr Flow, SuperWhisper) work reliably in Obsidian, while tools that depend on native text-input APIs (Apple Dictation) work inconsistently. The fix isn't to wait for Apple Dictation to improve — it's to use a tool architected around the Accessibility API from the start.

Method 1: MetaWhisp Global Hotkey (Recommended)

The lowest-friction option for Obsidian users is a system-wide voice-to-text app with a global hotkey. MetaWhisp presses Right Option, captures audio, transcribes with Whisper large-v3-turbo on Apple Neural Engine, and pastes the result wherever your cursor is — including inside Obsidian's editor. Setup:
  1. Download MetaWhisp from metawhisp.com (free, no account)
  2. Launch MetaWhisp; it downloads Whisper large-v3-turbo (~800 MB, one-time)
  3. Grant Accessibility permission in System Settings → Privacy & Security → Accessibility
  4. Open MetaWhisp Settings → Global Hotkey → assign Right Option
  5. Choose Hold-to-talk or Toggle mode
  6. Open Obsidian, position cursor in any note, hold Right Option, speak, release
  7. Text appears at the cursor
Where MetaWhisp works in Obsidian: Where MetaWhisp doesn't work in Obsidian:
Pro tip: Combine MetaWhisp's Clean mode with Obsidian's daily notes template. Press Right Option in your daily note, speak your stream-of-consciousness journal entry, release. The transcript appears with filler words removed and grammar fixed, ready for tags and linking. This workflow turns Obsidian into a fast-capture journaling tool without the friction of typing every entry.

Method 2: Apple's Built-in Dictation

Every Mac on macOS 14+ has built-in Dictation. Free, no install, available immediately. The official documentation lives at Apple's Dictation support page. Setup:
  1. System Settings → Keyboard → Dictation → toggle on
  2. Set shortcut: default is press Fn twice; can be remapped
  3. On Apple Silicon: enable "Use Enhanced Dictation" for on-device processing
  4. Open Obsidian, position cursor in a note
  5. Press Fn-Fn (or your configured shortcut)
  6. Speak; text appears at cursor
What works: What doesn't work well: For occasional dictation in a single note where you're willing to tolerate errors, Apple Dictation is fine. For daily Obsidian use, the reliability issues become annoying quickly. Most heavy Obsidian users move to a Whisper-based tool within their first month.

Method 3: Obsidian Community Plugins (Whisper, Voice2Text)

The Obsidian plugin ecosystem includes several voice-to-text plugins that integrate directly with the vault. The most popular options: Setup (Whisper plugin example):
  1. Open Obsidian Settings → Community plugins → Browse
  2. Search "Whisper" → Install → Enable
  3. Plugin settings → enter OpenAI API key (for cloud Whisper)
  4. Configure: where to save audio files, transcript format, language
  5. Use the command palette to trigger "Whisper: Start recording"
  6. Speak; recording saves as MP3 in your vault
  7. Transcript inserts into the current note
Strengths: Weaknesses:
The choice between an Obsidian-specific plugin versus a system-wide tool like MetaWhisp comes down to whether you want the audio files in your vault. Plugin-based workflows save the .mp3 of each recording next to the markdown transcript, which is great for journalists, researchers, podcasters, and anyone who needs to verify quotes against the original audio. System-wide tools like MetaWhisp produce only the text — no audio file. For pure note-taking and capture, the system-wide tool is faster and works across all your apps; for journalism-style research where the audio matters, the plugin approach gives you better archival properties. Some users run both: MetaWhisp for daily capture in any app, an Obsidian plugin for specific research interviews where the recording matters. This dual-tool approach costs no more than a single tool (MetaWhisp is free, most plugins are free with bring-your-own-API-key) and gives you the best of both architectures depending on which workflow you're in at the moment.
Obsidian dictation system-wide MetaWhisp versus plugin comparison diagram showing integration scope and audio storage for Mac

Method 4: Drafts as a Capture Layer for Obsidian

Drafts by Greg Pierce is the canonical "capture-first" text app on iOS and macOS. Its voice handling is excellent — fast, low-friction, integrated with iOS Voice-to-Text and iCloud sync. Many Obsidian users use Drafts as a capture layer that feeds into their Obsidian vault. Workflow:
  1. Install Drafts on Mac and iOS (free with Pro tier for advanced features at $19.99/year)
  2. On iOS: tap the microphone button on the Drafts capture screen, dictate, save
  3. On Mac: Cmd+N for new draft, use Apple Dictation Fn-Fn or paste from clipboard
  4. Drafts syncs via iCloud to your Mac automatically
  5. Create a Drafts "Action" that exports to your Obsidian vault folder (Pro feature)
  6. Trigger the action; the draft becomes a markdown file in your vault
Why this works for Obsidian users: Why this doesn't work for Obsidian users: The Drafts approach is excellent if you're already a Drafts user; it's overkill if you only need Mac dictation in Obsidian.

How Do I Dictate Markdown Formatting Into Obsidian?

Voice-to-text tools produce plain text without markdown syntax. To dictate formatted notes into Obsidian, you have three workflow options: The pragmatic approach: dictate plain prose, then use Obsidian's built-in formatting shortcuts after to add structure. The Obsidian community forum has extensive discussion of post-dictation formatting workflows, including community-built plugins that auto-convert structural verbal cues into markdown syntax. Cmd+B for bold, Cmd+I for italic, Cmd+K for links. Faster than trying to speak markdown punctuation. For long-form drafting, the workflow is: dictate stream-of-consciousness into a daily note, edit later with keyboard to add structure, then linkify and tag. This separation of capture from formatting matches how most ADHD writers and journalists already work — see our voice-to-text for ADHD writers guide for the cognitive pattern.

What's the Privacy Story for Each Obsidian Dictation Method?

MethodWhere audio goesWhere transcript goesPrivacy posture
MetaWhisp + ObsidianLocal memory onlyObsidian vault on diskFully on-device
Apple Enhanced DictationLocal memory onlyObsidian vault on diskFully on-device
Apple Standard DictationApple serversObsidian vault on diskCloud audio upload
Obsidian Whisper plugin (OpenAI API)OpenAI serversObsidian vault on diskCloud audio upload
Obsidian Whisper plugin (local model)Local memory onlyObsidian vault on diskFully on-device
Drafts capture + Apple DictationApple serversDrafts → vault on diskCloud audio upload
Obsidian itself is local-first by default — your vault is a folder of markdown files on your disk, with optional sync via Obsidian Sync, iCloud, or third-party services. The dictation method determines whether audio uploads to a vendor's cloud or stays local. For users who chose Obsidian specifically for local-first privacy, the matching dictation method is one that also keeps audio local: MetaWhisp, Apple Enhanced Dictation on Apple Silicon, or an Obsidian plugin configured to use a local Whisper model. Cloud-based options (OpenAI API, Apple Standard Dictation) defeat the privacy story.
For Obsidian users storing sensitive content — therapy journals, medical research notes, legal case notes, financial planning — the privacy posture of your dictation tool matters as much as your sync setup. Audio is often more revealing than text because intonation, pauses, and tone communicate emotional content the typed transcript doesn't capture.

How Do Obsidian Plugins Compare to MetaWhisp for Dictation?

The fundamental difference between Obsidian community plugins and system-wide voice tools comes down to scope and architecture: The decision becomes obvious when you map your typical daily workflow: For users on the fence, install MetaWhisp first (5 minutes, free) and try it inside Obsidian. If the no-audio-storage limitation becomes a blocker, add an Obsidian plugin as the second tool from the official Obsidian community plugins directory. Most users find the system-wide tool is sufficient, and the audio-storage workflow turns out to be a "nice to have" rather than a must-have.
Obsidian cross-device sync architecture diagram showing Mac dictation via MetaWhisp syncing through Obsidian Sync iCloud Git Syncthing to iPhone vault

How Do I Sync Dictated Notes Across Mac and iPhone?

Obsidian's primary sync options: For dictation-heavy workflows, the sync method matters because you'll often capture on iPhone and edit on Mac. Obsidian Sync is the most reliable for this pattern. iCloud Drive works but has occasional sync delays. Git-based sync requires manual commits which interrupts dictation flow. The cleanest cross-device dictation workflow for Obsidian:
  1. On Mac: dictate via MetaWhisp into the open vault. Save (Cmd+S, auto-saves anyway).
  2. Obsidian Sync or iCloud uploads the change
  3. On iPhone: open Obsidian Mobile, see the synced note within 5-30 seconds
  4. Continue dictating on iPhone via iOS Voice-to-Text (long-press microphone on keyboard)
  5. Changes sync back to Mac
Obsidian dictation methods privacy comparison matrix showing audio paths transcript locations and privacy posture for MetaWhisp Apple Enhanced Dictation and Whisper plugins

Can I Dictate Code Blocks or Mermaid Diagrams Into Obsidian?

Generally no — voice-to-text struggles with exact syntax for code, math, and diagram languages. Whisper transcribes natural language, not symbolic notation. What works for code in Obsidian via voice: What doesn't work: The pattern from our voice coding on Mac guide applies here: voice for intent, keyboard for syntax. For Obsidian users maintaining technical knowledge bases with code snippets, the workflow is to dictate the surrounding prose and explanation by voice, then drop in code blocks via keyboard.

Frequently Asked Questions About Dictating to Obsidian

What is the best voice-to-text app for Obsidian on Mac?

For most users, a system-wide voice-to-text app like MetaWhisp works best. It uses a global hotkey (Right Option) to dictate into any part of Obsidian — editor, command palette, search, frontmatter — and runs on-device with Whisper large-v3-turbo. No plugin maintenance, no Obsidian-specific setup, no cloud upload. For users who specifically want audio files saved in their vault, an Obsidian Whisper plugin is the alternative.

Why doesn't Apple Dictation work reliably in Obsidian?

Obsidian is built on Electron, the cross-platform framework. Electron apps render their UI in a Chromium webview rather than using native macOS text input controls. Apple's Dictation expects to interact with native NSTextView controls, which causes inconsistent behavior in Electron-based editors. System-wide voice-to-text tools that use the macOS Accessibility API for paste (MetaWhisp, Wispr Flow, SuperWhisper) work reliably because Electron apps handle standard clipboard paste correctly.

Can I dictate directly into Obsidian without a plugin?

Yes. A system-wide voice-to-text app like MetaWhisp uses a global hotkey to dictate into any Mac app, including Obsidian's editor. No plugin needed. The transcribed text auto-pastes at your cursor position. This approach works in every part of Obsidian and also works in your other apps (Slack, Cursor, Mail), so one tool handles all your dictation across the Mac.

Are there free voice-to-text plugins for Obsidian?

Yes. The community plugin marketplace includes free voice-to-text plugins like Whisper (by djmango) and Voice Notes. Most free plugins require an OpenAI API key for cloud Whisper transcription, which costs roughly $0.006 per minute of audio — typical cost is $1-5 per month for moderate use. A few plugins support local Whisper models for fully free operation, but setup requires installing additional dependencies.

How do I dictate on Obsidian Mobile for iPhone?

Use iOS Voice-to-Text built into the iPhone keyboard. Open Obsidian Mobile, position cursor in a note, long-press the microphone button on the keyboard, dictate. Text inserts as you speak. Apple Intelligence on iPhone 15 Pro and newer runs voice-to-text on-device for privacy. For users who want better accuracy than iOS Voice-to-Text, third-party iOS voice apps with share-sheet integration (like Whisper Memos) can dictate then share-send into Obsidian.

Can I save audio files in my Obsidian vault?

Yes, via Obsidian community plugins like Whisper or Voice Notes. These record audio (typically as .mp3 or .m4a) directly into your vault folder, attach the file link to the note, and transcribe the audio in the same operation. Useful for journalism, research interviews, and any workflow where you need to verify quotes against the original audio. System-wide tools like MetaWhisp don't save audio — they only insert the transcript text.

What's the accuracy difference between Whisper and Apple Dictation in Obsidian?

Whisper large-v3-turbo (used by MetaWhisp and most Whisper-based plugins) achieves 5-7% word error rate on clean English. Apple's Dictation produces roughly 11-14% WER on the same audio per OpenAI's published benchmarks. For Obsidian users who often dictate technical content (project names, library terms, proper nouns), the accuracy gap is meaningful — Whisper produces materially cleaner transcripts that require less manual correction.

Does dictation work in Obsidian's command palette and quick switcher?

System-wide voice-to-text tools like MetaWhisp work reliably in Obsidian's command palette (Cmd+P), quick switcher (Cmd+O), and search bar — these are standard text input fields that respond to keyboard paste. Apple Dictation works inconsistently in these UI elements. Obsidian plugins typically don't help here because they're scoped to the main editor, not the command palette.

About the Author

Andrew Dyuzhov is the solo founder and CEO of MetaWhisp, a free on-device voice-to-text app for macOS that runs Whisper large-v3-turbo on Apple Neural Engine. He uses Obsidian for personal knowledge management and built MetaWhisp's system-wide hotkey to work reliably in Electron-based apps like Obsidian, Slack, Cursor, and Discord where Apple Dictation often fails. The four methods compared in this article reflect hands-on testing inside Obsidian's editor, command palette, and quick switcher on his M3 MacBook Air. Connect on X or GitHub.

Related Reading