Skip to main content

Denoise

The RNNoise plugin enhances audio quality by removing background noise from your audio input, resulting in improved speech-to-text (STT) accuracy and better overall audio processing performance.

RNNoise is a real-time noise suppression library powered by a recurrent neural network that intelligently filters out environmental noise such as air conditioning, computer fans, and other stationary background sounds while preserving the clarity and quality of speech.

Installation

Install the RNNoise plugin for denoising in VideoSDK Agents package:

pip install "videosdk-plugins-rnnoise"

Importing

from videosdk.plugins.rnnoise import RNNoise

Example Usage

from videosdk.plugins.rnnoise import RNNoise
from videosdk.agents import CascadingPipeline

# Initialize the RNNoise Plugin
rnnoise = RNNoise()

# Add Denoise Plugin to cascading pipeline
pipeline = CascadingPipeline(denoise=rnnoise)

It also works with RealTimePipeline.

Example Usage in RealTime Pipeline

from videosdk.plugins.rnnoise import RNNoise
from videosdk.agents import RealTimePipeline

# Initialize the RNNoise Plugin
rnnoise = RNNoise()

# Add Denoise Plugin to realtime pipeline
pipeline = RealTimePipeline(denoise=rnnoise)

Benefits

  • Enhanced STT Accuracy: Cleaner audio input leads to more accurate speech-to-text transcription
  • Real-time Processing: Processes audio streams with minimal latency for seamless user experience
  • Intelligent Noise Reduction: Effectively removes background noise while preserving speech clarity

Additional Resources

The following resources provide more information about using RNNoise with VideoSDK Agents SDK.

  • Python package: The videosdk-plugins-rnnoise package on PyPI.

  • GitHub repo: View the source or contribute to the VideoSDK RNNoise plugin.

  • RNNoise project: The open source RNNoise library that powers the VideoSDK RNNoise plugin.

Got a Question? Ask us on discord