Deezer source separation library including pretrained models.
-
Updated
Apr 2, 2025 - Python
Deezer source separation library including pretrained models.
A PyTorch-based Speech Toolkit
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
Data manipulation and transformation for audio signal processing, powered by PyTorch
🎵 🌈 Real-time LED strip music visualization using Python and the ESP8266 or Raspberry Pi
The collection of pre-trained, state-of-the-art AI models for ailia SDK
Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.
LedFx is a network based LED effect engine designed to deliver advanced real-time audio effects to a wide variety of devices.
A simple GUI application that slices audio with silence detection
Synchronized Translation for Videos. Video dubbing
A fundamental toolkit designed for music, song, and audio generation
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
SincNet is a neural architecture for efficiently processing raw audio samples.
Audio processing by using pytorch 1D convolution network
A Framework for Speech, Language, Audio, Music Processing with Large Language Model
Implementation of the Wave-U-Net for audio source separation
Smarter data pipelines for audio.
Audio Large Language Models
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
[IJCV] FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
Add a description, image, and links to the audio-processing topic page so that developers can more easily learn about it.
To associate your repository with the audio-processing topic, visit your repo's landing page and select "manage topics."