Skip to content

Improve performance for ASR#95

Merged
openminddev merged 3 commits intomainfrom
improve-performance
Feb 13, 2026
Merged

Improve performance for ASR#95
openminddev merged 3 commits intomainfrom
improve-performance

Conversation

@openminddev
Copy link
Contributor

This pull request introduces several improvements to the ASR (Automatic Speech Recognition) processing pipeline, logging configuration, and websocket server responsiveness. The main focus is on making the ASR model more adaptive to incoming audio sample rates, fine-tuning silence detection thresholds, and improving system responsiveness by reducing sleep intervals in async message processing.

ASR Processing Improvements:

  • The ASR processor now waits for the first audio chunk to dynamically set the sample rate, reinitializing the model if the incoming rate differs from the default. This ensures the model matches the actual audio input, improving accuracy and flexibility.
  • Default ASR silence detection thresholds and history durations are now set more thoughtfully, with new values for stop_history_eou, stop_threshold_eou, stop_history, and stop_threshold to balance between timely finalization and avoiding mid-sentence cuts.
  • The default sample rate for ASR (--asr-sample-rate-hz) is increased from 16kHz to 48kHz, supporting higher-fidelity audio.

Logging Enhancements:

  • Logging configuration now uses force=True to ensure that the logging setup overrides any existing configurations, providing more predictable logging behavior.
  • ASR logging messages are improved for clarity, distinguishing between final and interim results.

Websocket Server Responsiveness:

  • The async websocket server reduces sleep intervals when waiting for messages, improving responsiveness for both global and per-connection message processing. [1] [2]

openminddev and others added 3 commits February 12, 2026 15:35
Pass force=True to logging.basicConfig so the logging configuration (level and format) is applied even if logging was previously configured. This ensures the application's logging settings take effect before adjusting individual logger levels in the subsequent loop.
Clean up and reformat conditionals and logging in ASRProcessor for readability: use consistent double quotes, wrap long conditional expressions in parentheses, condense simple checks to single lines, and format multi-line logger calls. No functional behavior changes intended — defaults and logic remain the same; changes are purely stylistic and improve maintainability.
@openminddev openminddev merged commit 008288f into main Feb 13, 2026
7 checks passed
@openminddev openminddev deleted the improve-performance branch February 13, 2026 00:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant