🧰 Project

📂 Datasets

sym

LRS3-For-Speech-Separation

Kai Li

Github Repo | GitHub stars

  • Open source audio-visual dataset processing script. Following are the steps to generate training and testing data. There are several parameters to change in order to match different purpose.

🎤 Audio-only Speech Separation Methods

sym

DPRNN-Pytorch

Github Repo | GitHub stars | 知乎: DPRNN阅读笔记

Kai Li

  • Dual-path RNN. Efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch.
sym

Calculate-SNR-SDR

Github Repo | GitHub stars

Kai Li

  • Calculatie Audio‘s SNR and SDR.
sym

Conv-TasNet

Github Repo | GitHub stars | 知乎: Conv-TasNet阅读笔记

Kai Li

  • Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch’s Implement.
sym

UtterancePIT

Github Repo | GitHub stars | 知乎: uPIT阅读笔记

Kai Li

  • According to funcwj’s uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.
sym

Deep Clustering

Github Repo | GitHub stars | 知乎: DPCL阅读笔记

Kai Li

  • Deep clustering in the field of speech separation implemented by pytorch.
sym

AFRCNN

Github Repo | GitHub stars | 知乎: AFRCNN阅读笔记

Kai Li

  • Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network.

🎬 Audio-visual Speech Separation Methods

sym

Looking to Listen at the Cocktail Party

Github Repo | GitHub stars | 知乎: DPRNN阅读笔记

Kai Li

  • The project is an audiovisual model reproduced by the contents of the paper Looking to Listen at the Cocktail Party: A Speaker-Independent Audio-Visual Model for Speech Separation.

📖 Tutorial

sym

Speech-Separation-Paper-Tutorial

Github Repo | GitHub stars

Kai Li

  • A must-read paper and tutorial list for speech separation based on neural networks.