展示HN:Chirp – 使用ParakeetV3的本地Windows语音输入,无需可执行文件
Show HN: Chirp – Local Windows dictation with ParakeetV3 no executable required

原始链接: https://github.com/Whamp/chirp

## Chirp:Windows本地快速语音听写 Chirp是一款注重隐私的Windows语音听写应用,完全本地运行,仅需Python即可,无需任何可执行文件。它利用ParakeetV3语音转文本(STT)引擎,提供与OpenAI的Whisper-large-V3相媲美的准确性,但速度**快17倍**,并且仅使用**CPU运行**(Whisper通常需要GPU)。 Chirp具有高度可配置性,允许用户自定义热键(默认:Ctrl+Shift)、语言和后处理样式。它还具有剪贴板管理、音频反馈和自定义词汇覆盖(例如,将“parrakeat”纠正为“parakeet”)等功能。 该应用利用ONNX进行高效处理,并提供设置来控制线程使用和模型量化,以优化性能。Chirp通过将所有处理保留在用户机器上,优先考虑用户控制和隐私。

## Chirp:使用ParakeetV3的本地Windows语音听写 Chirp是一款新的Windows语音听写应用,专为禁止安装可执行文件的受限环境设计。由开发者whamp创建,它使用英伟达的ParakeetV3模型提供准确、快速且*本地*的语音转文本功能——仅需Python环境,无需`.exe`文件。 Chirp旨在解决基于云的听写和资源密集型设置的局限性。它利用ONNX Runtime,优先使用CPU以提高速度和可访问性。用户可以通过`config.toml`文件配置设置,例如热键、模型选择和词语覆盖。 虽然Parakeet V3的准确性可与Whisper-large-v3媲美,但它在CPU上的性能明显更快。开发者正在寻求来自受限环境用户、熟悉Parakeet/Whisper/ONNX的用户以及有特定功能需求的用户反馈。 许多评论者分享了类似的macOS和Linux项目,强调了Whisper和MacWhisper等工具的优势。该项目在GitHub上可用:[https://github.com/Whamp/chirp](https://github.com/Whamp/chirp)。
相关文章

原文

Chirp is a Windows dictation app that runs fully locally using ParakeetV3 STT and is managed end-to-end with uv. Chirp does not require the ability to run executable files (like .exe) on Windows. It was designed so that if you're allowed to run Python on your machine, you can run Chirp.

ParakeetV3 has indistinguishable accuracy from Whisper-large-V3 (multilingual WER 4.91 vs 5.05) but is 17x faster and only requires a CPU while Whisper models of comparable accuracy require GPU's.

primary_shortcut = "ctrl+shift"                 # Hotkey that toggles recording; any combination supported by the `keyboard` library works (e.g. "ctrl+shift+space").
stt_backend = "parakeet"                        # Only "parakeet" is bundled today, but keeping this key lets us add more backends later if needed.
parakeet_model = "nemo-parakeet-tdt-0.6b-v3"    # Deployed ONNX bundle name; keep as-is unless new models are added.
parakeet_quantization = ""                      # Set to "int8" to download/use the quantized model variant; leave blank for default fp16.
onnx_providers = "cpu"                          # ONNX runtime provider string (comma- or pipe-separated if your build supports multiple providers, e.g. "cuda" or "cpu|dml").
threads = 0                                     # 0 (or empty) lets ONNX decide; set a positive integer to pin thread usage.
language = "en"                                 # Optional ISO language code; leave blank to let Parakeet auto-detect.
post_processing = ""                            # Text prompt for the StyleGuide; see docs/post_processing_style_guide.md (e.g. "sentence case", "prepend: >>", "append: — dictated with Chirp").
paste_mode = "ctrl"                             # Non-Windows platforms honor this: "ctrl" -> Ctrl+V, "ctrl+shift" -> Ctrl+Shift+V. Windows types text directly today.
clipboard_behavior = true                       # Keeps clipboard history clean when true by clearing it after `clipboard_clear_delay` seconds.
clipboard_clear_delay = 0.75                    # Seconds to wait before clearing the clipboard (only if `clipboard_behavior` is true).
audio_feedback = true                           # Enables start/stop chime playback.
start_sound_path = ""                           # Leave blank to use bundled asset; default: src/chirp/assets/ping-up.wav
stop_sound_path = ""                            # Leave blank to use bundled asset; default: src/chirp/assets/ping-down.wav

# Word overrides map spoken tokens (case-insensitive) to replacement text.
[word_overrides]
parrakeat = "parakeet"
"parra keat" = "parakeet"  
联系我们 contact @ memedata.com