Every professional has experienced it: you dictate “prescribe metformin 500mg twice daily” and your screen reads “prescribe met for men 500 milligrams twice daily.” You say “motion for estoppel” and get “motion for a stop all.” A developer dictates “deploy the Kubernetes cluster” and sees “deploy the Cooper Nettie’s cluster.” Voice dictation custom vocabulary is the solution to these maddening misrecognitions — and setting it up correctly can transform your dictation accuracy from frustrating to flawless.

Generic speech recognition achieves 95-99% accuracy on everyday conversation, but research from AssemblyAI shows that accuracy drops to 70-80% when deployed in specialised domains like healthcare. This guide walks you through why custom vocabulary matters, how different tools handle it, and exactly how to configure Weesper Neon Flow to recognise your profession’s terminology with precision.

Why Generic Dictation Fails on Specialised Terminology

Speech recognition models learn from massive datasets of general conversational speech. When you dictate a word like “pneumothorax,” the model has seen it perhaps once per million training examples — compared to thousands of occurrences for common words like “new” or “more.” This statistical imbalance creates a fundamental problem: the model defaults to common words that sound similar, producing embarrassing and sometimes dangerous errors.

The Real Cost of Misrecognition

The consequences extend far beyond minor inconvenience:

The pattern is clear: the more specialised your vocabulary, the more you need custom terminology dictation to maintain both accuracy and productivity.

How Voice Dictation Custom Vocabulary Works

Custom vocabulary features operate on a straightforward principle: they tell the speech recognition engine which specialised terms to expect, shifting the statistical probability in favour of your domain-specific words. The implementation varies significantly between tools.

Prompt Conditioning (Whisper-Based Tools)

Modern Whisper-based systems, including Weesper Neon Flow, use prompt conditioning — a technique where a text list of expected terms is fed into the language model’s context window. Research from arXiv demonstrates that contextual biasing can reduce word error rates by 40-60% for domain-specific vocabulary without any model fine-tuning.

When you add “metformin” to your custom prompt, the Whisper engine recognises that this pharmaceutical term is likely to appear in your dictation. Instead of defaulting to “met for men,” it correctly transcribes the drug name. The effect is immediate — no voice training, no waiting period.

Voice Training (Legacy Systems)

Older dictation software like Dragon NaturallySpeaking uses voice training: you read passages aloud so the software learns your pronunciation patterns. While effective, this approach requires 2-3 hours of initial training, is tied to a single voice profile, and each new custom word must be trained individually through the Vocabulary Editor.

Text Replacement (OS-Level)

Apple’s Voice Control offers a custom vocabulary feature through System Settings > Accessibility, supporting up to 1,000 terms per language. However, this operates at the operating system level, separate from the dictation engine itself, and lacks the contextual intelligence of prompt conditioning.

Comparison: Custom Vocabulary Features Across Dictation Tools

Understanding how different tools handle technical vocabulary speech recognition helps you choose the right approach for your profession.

FeatureWeesper Neon FlowDragon ProfessionalApple Voice ControlGoogle Docs Voice Typing
Custom vocabulary methodPrompt conditioning + custom dictionaryVocabulary Editor + voice trainingSystem-level custom termsNo custom vocabulary
Setup time2-5 minutes2-3 hours initial training5-10 minutesN/A
Terms supported~150-180 terms (prompt window)Unlimited (individual training)1,000 per languageNone
Voice training requiredNoYes (per word)Optional pronunciation recordingN/A
Immediate effectYesAfter training each wordAfter adding termN/A
Offline processingYes (100% local)Yes (desktop version)YesNo (cloud-only)
Multi-device syncVia settings exportPer-profile onlyVia iCloudN/A
CostFrom £4.40/month£200-700 licenceFree (macOS only)Free
Technical term accuracy95-98% with custom prompts95-98% after training85-92%80-90%

The key differentiator is speed of setup versus depth of customisation. Weesper’s prompt conditioning offers the best balance: near-instant setup with high accuracy, processed entirely offline for privacy and security.

Setting Up Custom Prompts in Weesper Neon Flow

Weesper uses Whisper’s prompt conditioning system to bias recognition towards your specialised terminology. Here is how to configure it for different professions.

Step 1: Access Custom Prompt Settings

  1. Open Weesper Neon Flow on your Mac or Windows machine
  2. Navigate to Settings (gear icon)
  3. Select the Custom Prompts section
  4. You will see a text field where you can enter your terminology list

Step 2: Structure Your Custom Prompt

Organise your terms by category for clarity. The prompt window supports approximately 150-180 terms (224 tokens). Prioritise the terms you use most frequently and that generic dictation consistently misrecognises.

Medical Professionals

If you work in healthcare, your custom prompt should include drug names, procedures, and diagnostic codes that you dictate daily. Without custom vocabulary, common medications like “metformin” become “met for men,” “lisinopril” becomes “listen oh prill,” and “omeprazole” becomes “oh me pra sole.” These errors are not merely inconvenient — they represent clinical safety risks when they appear in patient records. For HIPAA compliance considerations when using voice dictation in clinical settings, see our comprehensive HIPAA-compliant dictation guide.

Example medical custom prompt:

Medications: metformin, lisinopril, atorvastatin, omeprazole, levothyroxine, amlodipine, gabapentin, pantoprazole, duloxetine, pregabalin

Procedures: cholecystectomy, arthroscopy, colonoscopy, endoscopy, laparoscopy, bronchoscopy, thoracentesis, paracentesis

Diagnoses: pneumothorax, tachycardia, bradycardia, atrial fibrillation, hypertriglyceridemia, hypothyroidism, osteoarthritis

Codes: ICD-10, CPT, E11.9, I10, J06.9, M54.5, Z23

Legal terminology presents unique challenges for speech recognition. Latin phrases, case citation formats, and court-specific vocabulary are rarely encountered in general training data. A solicitor dictating a brief cannot afford to manually correct every instance of “certiorari” that appears as “search your worry” or “res judicata” rendered as “race Judy Carter.” The reputational cost of submitting documents with garbled legal terminology is significant.

Example legal custom prompt:

Latin terms: estoppel, habeas corpus, certiorari, mandamus, subpoena duces tecum, res judicata, prima facie, voir dire, amicus curiae, stare decisis

Court terms: plaintiff, defendant, appellant, appellee, interlocutory, adjudication, deposition, interrogatories, demurrer

Citations: v. (versus), supra, infra, Id., ibid., et seq., cf., e.g.

Software Developers and IT Professionals

Framework names, CLI commands, and technical acronyms are consistently misrecognised by general dictation engines. “Kubernetes” becomes “Cooper Nettie’s,” “PostgreSQL” becomes “post Gres sequel,” and “nginx” becomes “engine X” or worse. For developers who dictate documentation, code comments, or technical specifications, these errors multiply across every dictation session and slow down the workflow they adopted dictation to accelerate.

Example developer custom prompt:

Frameworks: Kubernetes, kubectl, Docker, Terraform, Ansible, PostgreSQL, MongoDB, GraphQL, WebSocket, nginx

Languages: TypeScript, JavaScript, Python, Rust, Golang, Kotlin, Swift, C#

Commands: npm install, git commit, pip install, cargo build, dotenv, localhost, sudo, chmod, grep, rsync

Terms: API, SDK, CI/CD, DevOps, microservices, serverless, OAuth, JWT, WebRTC, CORS

Academic Researchers

Discipline-specific terminology, methodological terms, and citation conventions benefit from custom prompts.

Example academic custom prompt:

Methods: ethnography, phenomenology, grounded theory, mixed-methods, meta-analysis, regression analysis, ANOVA, chi-squared

Statistics: p-value, confidence interval, standard deviation, effect size, Cohen's d, Cronbach's alpha, Likert scale

Citations: et al., ibid., op. cit., APA, MLA, Chicago, DOI, ORCID

Step 3: Test and Refine

After entering your custom prompt:

  1. Dictate a test passage containing your specialised terms
  2. Review the transcription for accuracy on custom terms
  3. Adjust your prompt — add terms that were misrecognised, remove common terms that do not need prompting
  4. Iterate until your most-used terminology is consistently recognised

For general tips on improving dictation accuracy beyond custom vocabulary, see our guide to fixing common dictation mistakes.

Advanced Tips for Maintaining Your Custom Vocabulary

Setting up your initial custom prompt is only the beginning. Here are strategies for keeping your custom dictionary setup effective as your terminology evolves.

Prioritise High-Frequency Misrecognitions

Track which terms are consistently misrecognised during your first week of use. These are your highest-priority additions. A term you dictate 50 times per day that fails 30% of the time costs you far more than a rare term that fails occasionally.

Group Terms by Context

Whisper’s prompt conditioning works best when terms appear in a logical context. Grouping medications together, legal terms together, and technical frameworks together helps the model understand the domain you are working in, improving recognition even for terms not explicitly listed.

Update Regularly

As new medications, frameworks, or legal precedents enter your workflow, add them to your custom prompt. Weesper’s prompt changes take effect immediately — no retraining required.

Use Custom Prompts Alongside Custom Dictionaries

Weesper Neon Flow also supports a custom dictionary feature for direct word mappings. While custom prompts bias the Whisper engine towards certain terms, custom dictionaries allow you to define exact text replacements:

Combining both features — prompt conditioning for recognition accuracy and custom dictionaries for output formatting — delivers the highest level of customisation.

Share Vocabulary Across Teams

If you work in a practice, firm, or development team, export your custom prompt configuration and share it with colleagues. This ensures consistent terminology recognition across all team members using Weesper. A cardiology department, for example, can maintain a shared custom prompt covering cardiac medications, procedures, and diagnostic codes. New team members simply import the file and begin dictating with full terminology coverage from day one — no individual training required.

Create Profession-Specific Profiles

If you work across multiple domains — for instance, a medical-legal consultant who dictates both clinical assessments and legal reports — maintain separate custom prompt profiles. Switch between them depending on the type of document you are dictating. This prevents the legal prompt from diluting the medical prompt’s effectiveness and vice versa, keeping recognition accuracy high in both contexts.

The Science Behind Custom Vocabulary Accuracy

Research validates the impact of custom vocabulary on speech recognition accuracy. A 2024 study published on arXiv demonstrated that contextual biasing reduced Whisper’s word error rate from 40.27% to 29.26% on the tiny model, and from 27.82% to 11.12% on the medium model — all without fine-tuning the underlying model. Entity detection accuracy improved from 67% to 98%.

Separately, research using the Vosk toolkit showed that custom language models “consistently reduced word error rates, especially in domain-specific scenarios involving technical terminology, varied accents, or background noise.” The custom model demonstrated a “clear and consistent advantage over the default model across all test samples.”

These findings confirm what professionals experience daily: general models struggle with specialised vocabulary, and custom vocabulary configuration is the most effective remedy. For a deeper look at how modern speech recognition achieves its accuracy benchmarks, see our comprehensive accuracy analysis.

Getting Started With Custom Vocabulary Today

Every minute you spend correcting misrecognised terminology is a minute lost to productive work. Whether you are a clinician dictating patient notes, a solicitor preparing court filings, or a developer documenting code, voice dictation custom vocabulary eliminates the friction between your expertise and your documentation.

Here is your action plan:

  1. Download Weesper Neon Flow and install it on your Mac or Windows machine
  2. Open Settings > Custom Prompts and enter your 20 most frequently misrecognised terms
  3. Dictate a test passage from your typical workflow
  4. Refine your prompt based on results — add missing terms, remove unnecessary ones
  5. Expand gradually to cover your full professional vocabulary over the first week

With custom prompts configured, you can expect specialised terminology accuracy to rise from 70-85% to 95-98% — turning voice dictation from a frustrating experiment into an indispensable professional tool. Visit our Help Centre for detailed setup guides and troubleshooting support.