Kai Li(李凯)

Kai Li is a master student in (Department of Computer Science and Technology, Tsinghua University), supervised by Prof. Xiaolin Hu. And then he will complete his master's career in Hu Lab from 2021 to 2024.

He was an intern at Tencent AI Lab, mainly doing research on causal speech separation, supervi

He got his bachelor's degree from Department of Computer Technology and Application, the Qinghai University, supervised by Prof. Jianqiang Huang and Prof. Chunmei Li in 2020.

His research interests include computer vision, deep learning, speech separation and cross-model speech separation.

   Github      Google Scholar      Email          Twitter          知乎
 
LRS3-For-Speech-Separation
Open source audio-visual dataset processing script.
More Following are the steps to generate training and testing data. There are several parameters to change in order to match different purpose.
[ GithubGitHub stars
DPRNN-Pytorch
Dual-path RNN
More Efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch.
[ Github ] [ 知乎: DPRNN阅读笔记GitHub stars
Looking to Listen at the Cocktail Party
Audio-visual speech separation method :-)
More The project is an audiovisual model reproduced by the contents of the paper Looking to Listen at the Cocktail Party: A Speaker-Independent Audio-Visual Model for Speech Separation.
[ Github ] [ 知乎: LLCP阅读笔记GitHub stars
Calculate-SNR-SDR
Calculatie Audio‘s SNR and SDR.
[ GitHubGitHub stars
Speech-Separation-Paper-Tutorial
A must-read paper and tutorial list for speech separation based on neural networks
More None
[ GithubGitHub stars
Conv-TasNet
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement
[ GitHub ] [ 知乎: Conv-TasNet阅读笔记GitHub stars
UtterancePIT
According to funcwj's uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.
More None
[ Github ] [ 知乎: uPIT阅读笔记GitHub stars
DPCL
Deep clustering in the field of speech separation implemented by pytorch
[ GitHub ] [ 知乎: DPCL阅读笔记GitHub stars
AFRCNN
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network
[ GitHubGitHub stars


News


Publications

Single Image Super-Resolution through Image Pixel Information Clustering and Generative Adversarial Network
Kai Li, Jianqiang Huang, Lingbin Liu, Jinfang Jia, Yu Zhu, Li Wu, Xiaoying Wang,
Submitted IEEE Transactions on Industrial Informatics
On the Use of Deep Mask Estimation Module for Neural Source Separation Systems
Kai Li, Xiaolin Hu, Yi Luo,
Submitted Interpseech 2022
On the Design and Training Strategies for RNN-based Online Neural Speech Separation Systems
Kai Li, Yi Luo,
Submitted Interpseech 2022
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network  Audio Demo Page   Speech Enhancement Demo   Github Repository   Paper 
Research on Speech Separation Based on Audio Visual Model
Kai Li, Jianqiang Huang, Xiaolin Hu
Bachelor Degree Thesis, 2020
 Audio Project Page   Audio-visual Project Page 
A Survey of Single Image Super Resolution Reconstruction
Kai Li, Shenghao Yang, Runting Dong, Jianqiang Huang, Xiaoying Wang
IET Image Processing, 2020
Single Image Super-resolution Reconstruction of Enhanced Loss Function with Multi-GPU Training
Jianqiang Huang*, Kai Li*, Xiaoying Wang
Parallel & Distributed Processing with Applications(ISPA), 2019
Single image super resolution based on generative adversarial networks
Kai Li, Shenghao Yang, Jianqiang Huang, Xiaoying Wang
International Conference on Digital Image Processing (ICDIP), 2019

(* equal contribution, # corresponding author)


© Kai Li | Last updated: April 14th, 2022 | Theme by Xintao Wang