Why speed varies
Weesper Neon Flow processes your voice locally on your computer. This is excellent for privacy, but speed depends directly on your machine.
The main factors are:
- the selected recognition profile;
- the quality level;
- processor performance;
- available memory;
- the load from other applications;
- available hardware acceleration depending on your version.
A larger profile can be more accurate, but it requires more time and memory.
Fast mode is not real-time text
A profile called “Instant”, “Fast”, or similar means Weesper Neon Flow prioritizes speed. It does not mean the text appears while you speak.
The workflow stays the same:
- you hold the shortcut;
- you speak;
- you release it;
- Weesper transcribes;
- the text is inserted.
Recommended settings if it is too slow
Try these steps in order:
- Switch to a lighter profile.
- Reduce the quality level.
- Test with a 5 to 10 second dictation.
- Close heavy applications.
- Restart Weesper Neon Flow.
- Check that your computer is not in battery saver mode.
If the lighter profile is much faster, your previous profile was probably too heavy for your setup.
Windows ARM and Snapdragon machines
Some recent Windows machines use ARM processors, for example Snapdragon X Elite.
In the current version, Weesper Neon Flow may not use the Qualcomm Hexagon NPU. It is therefore possible for Task Manager to show 0% NPU usage, even during transcription. This is not necessarily a bug: it simply means NPU acceleration is not used by this version.
GPU and hardware acceleration
Depending on your Weesper Neon Flow version and setup, some GPU acceleration may be available or being improved. If a GPU option exists in your settings, you can test it.
After enabling it, compare:
- the same sentence;
- the same profile;
- the same quality;
- with and without acceleration.
This lets you see whether your machine actually benefits from it.
When to contact support
Contact us if a short dictation remains very slow with a light profile.
Include:
- your Windows version;
- your processor;
- your amount of RAM;
- the selected recognition profile;
- the quality level;
- the approximate duration of a 10 second dictation.