Every professional has experienced it: you dictate “prescribe metformin 500mg twice daily” and your screen reads “prescribe met for men 500 milligrams twice daily.” You say “motion for estoppel” and get “motion for a stop all.” A developer dictates “deploy the Kubernetes cluster” and sees “deploy the Cooper Nettie’s cluster.” Voice dictation custom vocabulary is the solution to these maddening misrecognitions — and setting it up correctly can transform your dictation accuracy from frustrating to flawless.
Generic speech recognition achieves 95-99% accuracy on everyday conversation, but research from AssemblyAI shows that accuracy drops to 70-80% when deployed in specialised domains like healthcare. This guide walks you through why custom vocabulary matters, how different tools handle it, and exactly how to configure Weesper Neon Flow to recognise your profession’s terminology with precision.
Why Generic Dictation Fails on Specialised Terminology
Speech recognition models learn from massive datasets of general conversational speech. When you dictate a word like “pneumothorax,” the model has seen it perhaps once per million training examples — compared to thousands of occurrences for common words like “new” or “more.” This statistical imbalance creates a fundamental problem: the model defaults to common words that sound similar, producing embarrassing and sometimes dangerous errors.
The Real Cost of Misrecognition
The consequences extend far beyond minor inconvenience:
- Medical professionals face clinically significant errors in over 63% of notes generated by general speech recognition, according to research published in npj Digital Medicine. A misrecognised drug name or dosage could endanger patient safety.
- Legal professionals cannot afford “a stop all” when they mean “estoppel” in court filings. Incorrect legal terminology undermines credibility and can alter the meaning of arguments. Our guide to voice dictation for lawyers covers these challenges in detail.
- Software developers lose time correcting framework names, CLI commands, and variable naming conventions that generic dictation consistently garbles.
- Academic researchers find that discipline-specific terminology, citation formats, and theoretical frameworks are routinely mangled.
The pattern is clear: the more specialised your vocabulary, the more you need custom terminology dictation to maintain both accuracy and productivity.
How Voice Dictation Custom Vocabulary Works
Custom vocabulary features operate on a straightforward principle: they tell the speech recognition engine which specialised terms to expect, shifting the statistical probability in favour of your domain-specific words. The implementation varies significantly between tools.
Prompt Conditioning (Whisper-Based Tools)
Modern Whisper-based systems, including Weesper Neon Flow, use prompt conditioning — a technique where a text list of expected terms is fed into the language model’s context window. Research from arXiv demonstrates that contextual biasing can reduce word error rates by 40-60% for domain-specific vocabulary without any model fine-tuning.
When you add “metformin” to your custom prompt, the Whisper engine recognises that this pharmaceutical term is likely to appear in your dictation. Instead of defaulting to “met for men,” it correctly transcribes the drug name. The effect is immediate — no voice training, no waiting period.
Voice Training (Legacy Systems)
Older dictation software like Dragon NaturallySpeaking uses voice training: you read passages aloud so the software learns your pronunciation patterns. While effective, this approach requires 2-3 hours of initial training, is tied to a single voice profile, and each new custom word must be trained individually through the Vocabulary Editor.
Text Replacement (OS-Level)
Apple’s Voice Control offers a custom vocabulary feature through System Settings > Accessibility, supporting up to 1,000 terms per language. However, this operates at the operating system level, separate from the dictation engine itself, and lacks the contextual intelligence of prompt conditioning.
Comparison: Custom Vocabulary Features Across Dictation Tools
Understanding how different tools handle technical vocabulary speech recognition helps you choose the right approach for your profession.
| Feature | Weesper Neon Flow | Dragon Professional | Apple Voice Control | Google Docs Voice Typing |
|---|---|---|---|---|
| Custom vocabulary method | Prompt conditioning + custom dictionary | Vocabulary Editor + voice training | System-level custom terms | No custom vocabulary |
| Setup time | 2-5 minutes | 2-3 hours initial training | 5-10 minutes | N/A |
| Terms supported | ~150-180 terms (prompt window) | Unlimited (individual training) | 1,000 per language | None |
| Voice training required | No | Yes (per word) | Optional pronunciation recording | N/A |
| Immediate effect | Yes | After training each word | After adding term | N/A |
| Offline processing | Yes (100% local) | Yes (desktop version) | Yes | No (cloud-only) |
| Multi-device sync | Via settings export | Per-profile only | Via iCloud | N/A |
| Cost | From £4.40/month | £200-700 licence | Free (macOS only) | Free |
| Technical term accuracy | 95-98% with custom prompts | 95-98% after training | 85-92% | 80-90% |
The key differentiator is speed of setup versus depth of customisation. Weesper’s prompt conditioning offers the best balance: near-instant setup with high accuracy, processed entirely offline for privacy and security.
Setting Up Custom Prompts in Weesper Neon Flow
Weesper uses Whisper’s prompt conditioning system to bias recognition towards your specialised terminology. Here is how to configure it for different professions.
Step 1: Access Custom Prompt Settings
- Open Weesper Neon Flow on your Mac or Windows machine
- Navigate to Settings (gear icon)
- Select the Custom Prompts section
- You will see a text field where you can enter your terminology list
Step 2: Structure Your Custom Prompt
Organise your terms by category for clarity. The prompt window supports approximately 150-180 terms (224 tokens). Prioritise the terms you use most frequently and that generic dictation consistently misrecognises.
Medical Professionals
If you work in healthcare, your custom prompt should include drug names, procedures, and diagnostic codes that you dictate daily. Without custom vocabulary, common medications like “metformin” become “met for men,” “lisinopril” becomes “listen oh prill,” and “omeprazole” becomes “oh me pra sole.” These errors are not merely inconvenient — they represent clinical safety risks when they appear in patient records. For HIPAA compliance considerations when using voice dictation in clinical settings, see our comprehensive HIPAA-compliant dictation guide.
Example medical custom prompt:
Medications: metformin, lisinopril, atorvastatin, omeprazole, levothyroxine, amlodipine, gabapentin, pantoprazole, duloxetine, pregabalin
Procedures: cholecystectomy, arthroscopy, colonoscopy, endoscopy, laparoscopy, bronchoscopy, thoracentesis, paracentesis
Diagnoses: pneumothorax, tachycardia, bradycardia, atrial fibrillation, hypertriglyceridemia, hypothyroidism, osteoarthritis
Codes: ICD-10, CPT, E11.9, I10, J06.9, M54.5, Z23
Legal Professionals
Legal terminology presents unique challenges for speech recognition. Latin phrases, case citation formats, and court-specific vocabulary are rarely encountered in general training data. A solicitor dictating a brief cannot afford to manually correct every instance of “certiorari” that appears as “search your worry” or “res judicata” rendered as “race Judy Carter.” The reputational cost of submitting documents with garbled legal terminology is significant.
Example legal custom prompt:
Latin terms: estoppel, habeas corpus, certiorari, mandamus, subpoena duces tecum, res judicata, prima facie, voir dire, amicus curiae, stare decisis
Court terms: plaintiff, defendant, appellant, appellee, interlocutory, adjudication, deposition, interrogatories, demurrer
Citations: v. (versus), supra, infra, Id., ibid., et seq., cf., e.g.
Software Developers and IT Professionals
Framework names, CLI commands, and technical acronyms are consistently misrecognised by general dictation engines. “Kubernetes” becomes “Cooper Nettie’s,” “PostgreSQL” becomes “post Gres sequel,” and “nginx” becomes “engine X” or worse. For developers who dictate documentation, code comments, or technical specifications, these errors multiply across every dictation session and slow down the workflow they adopted dictation to accelerate.
Example developer custom prompt:
Frameworks: Kubernetes, kubectl, Docker, Terraform, Ansible, PostgreSQL, MongoDB, GraphQL, WebSocket, nginx
Languages: TypeScript, JavaScript, Python, Rust, Golang, Kotlin, Swift, C#
Commands: npm install, git commit, pip install, cargo build, dotenv, localhost, sudo, chmod, grep, rsync
Terms: API, SDK, CI/CD, DevOps, microservices, serverless, OAuth, JWT, WebRTC, CORS
Academic Researchers
Discipline-specific terminology, methodological terms, and citation conventions benefit from custom prompts.
Example academic custom prompt:
Methods: ethnography, phenomenology, grounded theory, mixed-methods, meta-analysis, regression analysis, ANOVA, chi-squared
Statistics: p-value, confidence interval, standard deviation, effect size, Cohen's d, Cronbach's alpha, Likert scale
Citations: et al., ibid., op. cit., APA, MLA, Chicago, DOI, ORCID
Step 3: Test and Refine
After entering your custom prompt:
- Dictate a test passage containing your specialised terms
- Review the transcription for accuracy on custom terms
- Adjust your prompt — add terms that were misrecognised, remove common terms that do not need prompting
- Iterate until your most-used terminology is consistently recognised
For general tips on improving dictation accuracy beyond custom vocabulary, see our guide to fixing common dictation mistakes.
Advanced Tips for Maintaining Your Custom Vocabulary
Setting up your initial custom prompt is only the beginning. Here are strategies for keeping your custom dictionary setup effective as your terminology evolves.
Prioritise High-Frequency Misrecognitions
Track which terms are consistently misrecognised during your first week of use. These are your highest-priority additions. A term you dictate 50 times per day that fails 30% of the time costs you far more than a rare term that fails occasionally.
Group Terms by Context
Whisper’s prompt conditioning works best when terms appear in a logical context. Grouping medications together, legal terms together, and technical frameworks together helps the model understand the domain you are working in, improving recognition even for terms not explicitly listed.
Update Regularly
As new medications, frameworks, or legal precedents enter your workflow, add them to your custom prompt. Weesper’s prompt changes take effect immediately — no retraining required.
Use Custom Prompts Alongside Custom Dictionaries
Weesper Neon Flow also supports a custom dictionary feature for direct word mappings. While custom prompts bias the Whisper engine towards certain terms, custom dictionaries allow you to define exact text replacements:
- Dictate “patient ID number” → output “Patient ID No.”
- Dictate “standard operating procedure” → output “SOP”
- Dictate “with respect to” → output “w.r.t.”
Combining both features — prompt conditioning for recognition accuracy and custom dictionaries for output formatting — delivers the highest level of customisation.
Share Vocabulary Across Teams
If you work in a practice, firm, or development team, export your custom prompt configuration and share it with colleagues. This ensures consistent terminology recognition across all team members using Weesper. A cardiology department, for example, can maintain a shared custom prompt covering cardiac medications, procedures, and diagnostic codes. New team members simply import the file and begin dictating with full terminology coverage from day one — no individual training required.
Create Profession-Specific Profiles
If you work across multiple domains — for instance, a medical-legal consultant who dictates both clinical assessments and legal reports — maintain separate custom prompt profiles. Switch between them depending on the type of document you are dictating. This prevents the legal prompt from diluting the medical prompt’s effectiveness and vice versa, keeping recognition accuracy high in both contexts.
The Science Behind Custom Vocabulary Accuracy
Research validates the impact of custom vocabulary on speech recognition accuracy. A 2024 study published on arXiv demonstrated that contextual biasing reduced Whisper’s word error rate from 40.27% to 29.26% on the tiny model, and from 27.82% to 11.12% on the medium model — all without fine-tuning the underlying model. Entity detection accuracy improved from 67% to 98%.
Separately, research using the Vosk toolkit showed that custom language models “consistently reduced word error rates, especially in domain-specific scenarios involving technical terminology, varied accents, or background noise.” The custom model demonstrated a “clear and consistent advantage over the default model across all test samples.”
These findings confirm what professionals experience daily: general models struggle with specialised vocabulary, and custom vocabulary configuration is the most effective remedy. For a deeper look at how modern speech recognition achieves its accuracy benchmarks, see our comprehensive accuracy analysis.
Getting Started With Custom Vocabulary Today
Every minute you spend correcting misrecognised terminology is a minute lost to productive work. Whether you are a clinician dictating patient notes, a solicitor preparing court filings, or a developer documenting code, voice dictation custom vocabulary eliminates the friction between your expertise and your documentation.
Here is your action plan:
- Download Weesper Neon Flow and install it on your Mac or Windows machine
- Open Settings > Custom Prompts and enter your 20 most frequently misrecognised terms
- Dictate a test passage from your typical workflow
- Refine your prompt based on results — add missing terms, remove unnecessary ones
- Expand gradually to cover your full professional vocabulary over the first week
With custom prompts configured, you can expect specialised terminology accuracy to rise from 70-85% to 95-98% — turning voice dictation from a frustrating experiment into an indispensable professional tool. Visit our Help Centre for detailed setup guides and troubleshooting support.