Diction flips the script.
A terminal-based notetaking app — yeah, you read that right — with built-in local transcription. No clouds, no subscriptions, just your machine humming along. /u/Bian- dropped this gem on r/opensource, pitching it as a keybind-heavy Markdown beast for class notes or meetings. And here’s the hook: it transcribes device mic or external audio using open-source Whisper models, entirely offline.
Think about it. We’re drowning in Electron apps that suck 2GB RAM for a simple text pad. Diction? Lightweight as a feather, terminal-native. Vim users, rejoice — it’s got that muscle memory vibe, but for notes.
Why Drag Notetaking Back to the Terminal in 2024?
Terminals never died; they evolved. LazyGit for repos, Lazydocker for containers — TUIs (text user interfaces) are sneaking back because they’re fast. No GPU-hogging renders, no context-switching to a GUI. Diction taps that vein.
But — and this is key — it layers on transcription. Fire up Whisper (local models like tiny or base), speak your lecture ramblings, and boom: Markdown output. Keybinds for everything: jump sections, toggle record, export. Bian- nails the use case: quick class or meeting notes, no laptop fan spinning up like a jet engine.
It’s not just nostalgia. Privacy’s the killer app here. SuperWhisper and kin? They whisper to servers sometimes. Diction stays put. Your voice data? Yours alone.
I am working on a Terminal UI and key bind heavy local note taker (Markdown) with a built in component of transcription of either device or outside audio for personal use.
That’s straight from Bian-’s post. Raw, unfiltered ambition.
How Does Local Whisper Transcription Actually Work Here?
Pulls from Whisper.cpp or similar — ultra-efficient C++ port of OpenAI’s model. Runs on CPU, sips power. Device audio? Pipe in from mic. External? FFmpeg grabs it, transcribes chunks real-time or batch.
Architecture’s clever. Terminal UI via something like Textual or Bubble Tea (Ratatui in Rust?). Markdown renderer live, so you see formatted notes sans leaving tmux. Keybinds remappable — hjkl navigation, leader prefixes like Vim. Export to file, sync via git if you’re fancy.
Wander a bit: imagine pairing with tmux sessions. One pane Diction, one chat, one code. No app switcher hell.
But does it beat the hype? Reddit’s buzzing — folks dig the local angle, but whisper for meetings? Accents, noise — it’s iffy without fine-tuning. Still, for personal use, gold.
Is Diction Set to Kill Obsidian or Notion?
Nah. Not yet. Obsidian’s graph views and plugins are crack for knowledge nerds. Notion? Collaborative dream. Diction’s for solo warriors — the power user who lives in i3 or Hyprland, hates mice.
Here’s my unique take, absent from Bian-’s post: this echoes the 90s Unix wars, when tools like vi and mutt turned terminals into power suites. Today, with AI commoditized locally (thanks Ollama, Whisper.cpp), we’re seeing TUI 2.0. Prediction? Diction sparks a wave — notetakers, task boards, even IDEs in text mode. Amid Big Tech’s data grabs, offline TUIs become the underground railroad for creators.
Corporate spin? None here — it’s indie dev grit. No VC fluff, just code.
Skepticism check: is it vaporware? Post seeks similar projects, opinions. Shoutouts to Whisper-Writer, ufal’s streaming Whisper. Nothing exactly matches — terminal + notes + transcrip. Closest? Maybe Neovim plugins, but fragmented.
Build it right, and it’s disruptive. Lightweight wins in battery life, portability. Script kiddies could fork for custom models — domain-specific jargon transcription.
Roadblocks? UI polish. Terminals intimidate noobs. Accessibility — screen readers? Workable, but tweaks needed. Still, for devs, academics? Perfect storm.
Look, I’ve pounded keys in terminals for decades. Diction feels like home — efficient, unintrusive. If Bian- ships, it’ll carve a niche faster than you think.
Why Does This Matter for Open Source Devs?
Open source thrives on niches. Diction fills the ‘local TUI note + AI’ gap. Contributions? UI tweaks, model integrations (faster inference?), mobile ports via termux?
Broader shift: AI’s going on-device. Apple Intelligence, Qualcomm chips — hardware’s ready. Software lags; Diction nudges it.
And yeah, it’s flexible. Class notes? Dictate theorems. Meetings? Transcribe rants, edit live. No $10/month Notion tax.
🧬 Related Insights
- Read more: EU’s CRA Draft Guidance Exposes Open Source Stewards to Chaos – Four Gaps That Could Break Everything
- Read more: Open-Source MCP Server Puts Quran Search in Your AI Agent’s Hands, Offline
Frequently Asked Questions
What is the Diction app? Diction’s a terminal-based Markdown notetaker with local Whisper transcription for audio-to-text, all offline and lightweight.
Is Diction open source? Yes — pitched on r/opensource, so expect GitHub drop soon. Fork-friendly for custom tweaks.
Diction vs SuperWhisper? Diction’s terminal-only, fully local notes + transcrip; SuperWhisper’s GUI-focused, sometimes cloud-reliant. Diction wins on privacy, speed for power users.