How do I add custom vocabulary to my voice dictation software?

The method depends on your tool. In Weesper Neon Flow, open Settings, navigate to the Custom Prompts section, and type your specialised terms, acronyms, and preferred spellings. Weesper uses prompt conditioning with Whisper to bias recognition towards your terminology. In Dragon, use the Vocabulary Editor to add words individually or import a text file list. Apple Voice Control allows up to 1,000 custom terms via System Settings > Accessibility > Voice Control > Vocabulary.

Why does my dictation software misrecognise technical terms?

General-purpose speech recognition models are trained on everyday conversational speech, where specialised terms like 'metformin,' 'estoppel,' or 'Kubernetes' appear extremely rarely. A medical term might occur once per million words in general training data, creating a statistical bias towards common homophones. Custom vocabulary features solve this by explicitly telling the recognition engine which specialised terms to expect, shifting the probability in favour of your domain-specific words.

Can Whisper-based dictation handle medical and legal terminology?

Yes, with proper configuration. Research shows that Whisper with prompt conditioning can reduce word error rates by 40-60% for domain-specific vocabulary. Weesper Neon Flow uses Whisper.cpp with custom prompts that let you list medical terms, legal phrases, and technical jargon. The engine then biases its recognition towards those terms, achieving 95-98% accuracy on specialised vocabulary that would otherwise be transcribed at 70-85% accuracy.

What is the difference between custom prompts and voice training?

Voice training (used by older tools like Dragon) requires you to read passages aloud so the software learns your voice patterns and pronunciation. Custom prompts (used by Whisper-based tools like Weesper) provide a text list of expected terms that biases the language model's output without voice samples. Custom prompts are faster to set up, instantly effective, and portable between devices. Voice training adapts to your accent but requires hours of reading and is tied to a single voice profile.

How many custom vocabulary terms can I add to Weesper?

Weesper's custom prompt system supports up to 224 tokens (approximately 150-180 specialised terms), which is the Whisper prompt conditioning window. For most professionals, this covers all critical domain terminology. You can organise terms by category — drug names, procedure codes, case citation formats — and update them as your practice evolves. Apple Voice Control supports up to 1,000 terms, while Dragon has no practical limit but requires individual training for each word.

Do I need to retrain my dictation software when I add new terms?

Not with modern Whisper-based tools like Weesper. Custom prompts take effect immediately — simply add the term to your prompt list and start dictating. There is no retraining, no voice samples, and no waiting period. Older systems like Dragon require you to train each new word by reading it aloud multiple times, and Apple's text replacement system needs a manual restart of dictation services to recognise new entries.

Voice Dictation Custom Vocabulary: Technical Terminology Setup Guide

Every professional has experienced it: you dictate “prescribe metformin 500mg twice daily” and your screen reads “prescribe met for men 500 milligrams twice daily.” You say “motion for estoppel” and get “motion for a stop all.” A developer dictates “deploy the Kubernetes cluster” and sees “deploy the Cooper Nettie’s cluster.” Voice dictation custom vocabulary is the solution to these maddening misrecognitions — and setting it up correctly can transform your dictation accuracy from frustrating to flawless.

Generic speech recognition achieves 95-99% accuracy on everyday conversation, but research from AssemblyAI shows that accuracy drops to 70-80% when deployed in specialised domains like healthcare. This guide walks you through why custom vocabulary matters, how different tools handle it, and exactly how to configure Weesper Neon Flow to recognise your profession’s terminology with precision.

Why Generic Dictation Fails on Specialised Terminology

Speech recognition models learn from massive datasets of general conversational speech. When you dictate a word like “pneumothorax,” the model has seen it perhaps once per million training examples — compared to thousands of occurrences for common words like “new” or “more.” This statistical imbalance creates a fundamental problem: the model defaults to common words that sound similar, producing embarrassing and sometimes dangerous errors.

The Real Cost of Misrecognition

The consequences extend far beyond minor inconvenience:

Medical professionals face clinically significant errors in over 63% of notes generated by general speech recognition, according to research published in npj Digital Medicine. A misrecognised drug name or dosage could endanger patient safety.
Legal professionals cannot afford “a stop all” when they mean “estoppel” in court filings. Incorrect legal terminology undermines credibility and can alter the meaning of arguments. Our guide to voice dictation for lawyers covers these challenges in detail.
Software developers lose time correcting framework names, CLI commands, and variable naming conventions that generic dictation consistently garbles.
Academic researchers find that discipline-specific terminology, citation formats, and theoretical frameworks are routinely mangled.

The pattern is clear: the more specialised your vocabulary, the more you need custom terminology dictation to maintain both accuracy and productivity.

How Voice Dictation Custom Vocabulary Works

Custom vocabulary features operate on a straightforward principle: they tell the speech recognition engine which specialised terms to expect, shifting the statistical probability in favour of your domain-specific words. The implementation varies significantly between tools.

Prompt Conditioning (Whisper-Based Tools)

Modern Whisper-based systems, including Weesper Neon Flow, use prompt conditioning — a technique where a text list of expected terms is fed into the language model’s context window. Research from arXiv demonstrates that contextual biasing can reduce word error rates by 40-60% for domain-specific vocabulary without any model fine-tuning.

When you add “metformin” to your custom prompt, the Whisper engine recognises that this pharmaceutical term is likely to appear in your dictation. Instead of defaulting to “met for men,” it correctly transcribes the drug name. The effect is immediate — no voice training, no waiting period.

Voice Training (Legacy Systems)

Older dictation software like Dragon NaturallySpeaking uses voice training: you read passages aloud so the software learns your pronunciation patterns. While effective, this approach requires 2-3 hours of initial training, is tied to a single voice profile, and each new custom word must be trained individually through the Vocabulary Editor.

Text Replacement (OS-Level)

Apple’s Voice Control offers a custom vocabulary feature through System Settings > Accessibility, supporting up to 1,000 terms per language. However, this operates at the operating system level, separate from the dictation engine itself, and lacks the contextual intelligence of prompt conditioning.

Comparison: Custom Vocabulary Features Across Dictation Tools

Understanding how different tools handle technical vocabulary speech recognition helps you choose the right approach for your profession.

Feature	Weesper Neon Flow	Dragon Professional	Apple Voice Control	Google Docs Voice Typing
Custom vocabulary method	Prompt conditioning + custom dictionary	Vocabulary Editor + voice training	System-level custom terms	No custom vocabulary
Setup time	2-5 minutes	2-3 hours initial training	5-10 minutes	N/A
Terms supported	~150-180 terms (prompt window)	Unlimited (individual training)	1,000 per language	None
Voice training required	No	Yes (per word)	Optional pronunciation recording	N/A
Immediate effect	Yes	After training each word	After adding term	N/A
Offline processing	Yes (100% local)	Yes (desktop version)	Yes	No (cloud-only)
Multi-device sync	Via settings export	Per-profile only	Via iCloud	N/A
Cost	From £4.40/month	£200-700 licence	Free (macOS only)	Free
Technical term accuracy	95-98% with custom prompts	95-98% after training	85-92%	80-90%

The key differentiator is speed of setup versus depth of customisation. Weesper’s prompt conditioning offers the best balance: near-instant setup with high accuracy, processed entirely offline for privacy and security.

Setting Up Custom Prompts in Weesper Neon Flow

Weesper uses Whisper’s prompt conditioning system to bias recognition towards your specialised terminology. Here is how to configure it for different professions.

Step 1: Access Custom Prompt Settings

Open Weesper Neon Flow on your Mac or Windows machine
Navigate to Settings (gear icon)
Select the Custom Prompts section
You will see a text field where you can enter your terminology list

Step 2: Structure Your Custom Prompt

Organise your terms by category for clarity. The prompt window supports approximately 150-180 terms (224 tokens). Prioritise the terms you use most frequently and that generic dictation consistently misrecognises.

Medical Professionals

If you work in healthcare, your custom prompt should include drug names, procedures, and diagnostic codes that you dictate daily. Without custom vocabulary, common medications like “metformin” become “met for men,” “lisinopril” becomes “listen oh prill,” and “omeprazole” becomes “oh me pra sole.” These errors are not merely inconvenient — they represent clinical safety risks when they appear in patient records. For HIPAA compliance considerations when using voice dictation in clinical settings, see our comprehensive HIPAA-compliant dictation guide.

Example medical custom prompt:

Medications: metformin, lisinopril, atorvastatin, omeprazole, levothyroxine, amlodipine, gabapentin, pantoprazole, duloxetine, pregabalin

Procedures: cholecystectomy, arthroscopy, colonoscopy, endoscopy, laparoscopy, bronchoscopy, thoracentesis, paracentesis

Diagnoses: pneumothorax, tachycardia, bradycardia, atrial fibrillation, hypertriglyceridemia, hypothyroidism, osteoarthritis

Codes: ICD-10, CPT, E11.9, I10, J06.9, M54.5, Z23

Legal Professionals

Legal terminology presents unique challenges for speech recognition. Latin phrases, case citation formats, and court-specific vocabulary are rarely encountered in general training data. A solicitor dictating a brief cannot afford to manually correct every instance of “certiorari” that appears as “search your worry” or “res judicata” rendered as “race Judy Carter.” The reputational cost of submitting documents with garbled legal terminology is significant.

Example legal custom prompt:

Latin terms: estoppel, habeas corpus, certiorari, mandamus, subpoena duces tecum, res judicata, prima facie, voir dire, amicus curiae, stare decisis

Court terms: plaintiff, defendant, appellant, appellee, interlocutory, adjudication, deposition, interrogatories, demurrer

Citations: v. (versus), supra, infra, Id., ibid., et seq., cf., e.g.

Software Developers and IT Professionals

Framework names, CLI commands, and technical acronyms are consistently misrecognised by general dictation engines. “Kubernetes” becomes “Cooper Nettie’s,” “PostgreSQL” becomes “post Gres sequel,” and “nginx” becomes “engine X” or worse. For developers who dictate documentation, code comments, or technical specifications, these errors multiply across every dictation session and slow down the workflow they adopted dictation to accelerate.

Example developer custom prompt:

Frameworks: Kubernetes, kubectl, Docker, Terraform, Ansible, PostgreSQL, MongoDB, GraphQL, WebSocket, nginx

Languages: TypeScript, JavaScript, Python, Rust, Golang, Kotlin, Swift, C#

Commands: npm install, git commit, pip install, cargo build, dotenv, localhost, sudo, chmod, grep, rsync

Terms: API, SDK, CI/CD, DevOps, microservices, serverless, OAuth, JWT, WebRTC, CORS

Academic Researchers

Discipline-specific terminology, methodological terms, and citation conventions benefit from custom prompts.

Example academic custom prompt:

Methods: ethnography, phenomenology, grounded theory, mixed-methods, meta-analysis, regression analysis, ANOVA, chi-squared

Statistics: p-value, confidence interval, standard deviation, effect size, Cohen's d, Cronbach's alpha, Likert scale

Citations: et al., ibid., op. cit., APA, MLA, Chicago, DOI, ORCID

Step 3: Test and Refine

After entering your custom prompt:

Dictate a test passage containing your specialised terms
Review the transcription for accuracy on custom terms
Adjust your prompt — add terms that were misrecognised, remove common terms that do not need prompting
Iterate until your most-used terminology is consistently recognised

For general tips on improving dictation accuracy beyond custom vocabulary, see our guide to fixing common dictation mistakes.

Advanced Tips for Maintaining Your Custom Vocabulary

Setting up your initial custom prompt is only the beginning. Here are strategies for keeping your custom dictionary setup effective as your terminology evolves.

Prioritise High-Frequency Misrecognitions

Track which terms are consistently misrecognised during your first week of use. These are your highest-priority additions. A term you dictate 50 times per day that fails 30% of the time costs you far more than a rare term that fails occasionally.

Group Terms by Context

Whisper’s prompt conditioning works best when terms appear in a logical context. Grouping medications together, legal terms together, and technical frameworks together helps the model understand the domain you are working in, improving recognition even for terms not explicitly listed.

Update Regularly

As new medications, frameworks, or legal precedents enter your workflow, add them to your custom prompt. Weesper’s prompt changes take effect immediately — no retraining required.

Use Custom Prompts Alongside Custom Dictionaries

Weesper Neon Flow also supports a custom dictionary feature for direct word mappings. While custom prompts bias the Whisper engine towards certain terms, custom dictionaries allow you to define exact text replacements:

Dictate “patient ID number” → output “Patient ID No.”
Dictate “standard operating procedure” → output “SOP”
Dictate “with respect to” → output “w.r.t.”

Combining both features — prompt conditioning for recognition accuracy and custom dictionaries for output formatting — delivers the highest level of customisation.

If you work in a practice, firm, or development team, export your custom prompt configuration and share it with colleagues. This ensures consistent terminology recognition across all team members using Weesper. A cardiology department, for example, can maintain a shared custom prompt covering cardiac medications, procedures, and diagnostic codes. New team members simply import the file and begin dictating with full terminology coverage from day one — no individual training required.

Create Profession-Specific Profiles

If you work across multiple domains — for instance, a medical-legal consultant who dictates both clinical assessments and legal reports — maintain separate custom prompt profiles. Switch between them depending on the type of document you are dictating. This prevents the legal prompt from diluting the medical prompt’s effectiveness and vice versa, keeping recognition accuracy high in both contexts.

The Science Behind Custom Vocabulary Accuracy

Research validates the impact of custom vocabulary on speech recognition accuracy. A 2024 study published on arXiv demonstrated that contextual biasing reduced Whisper’s word error rate from 40.27% to 29.26% on the tiny model, and from 27.82% to 11.12% on the medium model — all without fine-tuning the underlying model. Entity detection accuracy improved from 67% to 98%.

Separately, research using the Vosk toolkit showed that custom language models “consistently reduced word error rates, especially in domain-specific scenarios involving technical terminology, varied accents, or background noise.” The custom model demonstrated a “clear and consistent advantage over the default model across all test samples.”

These findings confirm what professionals experience daily: general models struggle with specialised vocabulary, and custom vocabulary configuration is the most effective remedy. For a deeper look at how modern speech recognition achieves its accuracy benchmarks, see our comprehensive accuracy analysis.

Getting Started With Custom Vocabulary Today

Every minute you spend correcting misrecognised terminology is a minute lost to productive work. Whether you are a clinician dictating patient notes, a solicitor preparing court filings, or a developer documenting code, voice dictation custom vocabulary eliminates the friction between your expertise and your documentation.

Here is your action plan:

Download Weesper Neon Flow and install it on your Mac or Windows machine
Open Settings > Custom Prompts and enter your 20 most frequently misrecognised terms
Dictate a test passage from your typical workflow
Refine your prompt based on results — add missing terms, remove unnecessary ones
Expand gradually to cover your full professional vocabulary over the first week

With custom prompts configured, you can expect specialised terminology accuracy to rise from 70-85% to 95-98% — turning voice dictation from a frustrating experiment into an indispensable professional tool. Visit our Help Centre for detailed setup guides and troubleshooting support.

Weesper is a desktop app

Got it!

Voice Dictation Custom Vocabulary: Technical Terminology Setup Guide

Why Generic Dictation Fails on Specialised Terminology

The Real Cost of Misrecognition

How Voice Dictation Custom Vocabulary Works

Prompt Conditioning (Whisper-Based Tools)

Voice Training (Legacy Systems)

Text Replacement (OS-Level)

Comparison: Custom Vocabulary Features Across Dictation Tools

Setting Up Custom Prompts in Weesper Neon Flow

Step 1: Access Custom Prompt Settings

Step 2: Structure Your Custom Prompt

Medical Professionals

Legal Professionals

Software Developers and IT Professionals

Academic Researchers

Step 3: Test and Refine

Advanced Tips for Maintaining Your Custom Vocabulary

Prioritise High-Frequency Misrecognitions

Group Terms by Context

Update Regularly

Use Custom Prompts Alongside Custom Dictionaries

Create Profession-Specific Profiles

The Science Behind Custom Vocabulary Accuracy

Getting Started With Custom Vocabulary Today

Simple pricing, no surprises

About the Author

FAQ

Sources & References

Why Generic Dictation Fails on Specialised Terminology

The Real Cost of Misrecognition

How Voice Dictation Custom Vocabulary Works

Prompt Conditioning (Whisper-Based Tools)

Voice Training (Legacy Systems)

Text Replacement (OS-Level)

Comparison: Custom Vocabulary Features Across Dictation Tools

Setting Up Custom Prompts in Weesper Neon Flow

Step 1: Access Custom Prompt Settings

Step 2: Structure Your Custom Prompt

Medical Professionals

Legal Professionals

Software Developers and IT Professionals

Academic Researchers

Step 3: Test and Refine

Advanced Tips for Maintaining Your Custom Vocabulary

Prioritise High-Frequency Misrecognitions

Group Terms by Context

Update Regularly

Use Custom Prompts Alongside Custom Dictionaries

Share Vocabulary Across Teams

Create Profession-Specific Profiles

The Science Behind Custom Vocabulary Accuracy

Getting Started With Custom Vocabulary Today

Simple pricing, no surprises

About the Author

FAQ

Sources & References