🎊 Welcome to my website!
My name is Kai Li (Chinese name: 李凯). I’m is a first-year master student at Department of Computer Science and Technology, Tsinghua University, supervised by Prof. Xiaolin Hu (胡晓林). I am also a member of TSAIL Group directed by Prof. Bo Zhang (张拨) and Prof. Jun zhu (朱军).
I was an intern at Tencent AI Lab , mainly doing research on causal speech separation, supervised by Yi Luo (罗艺).
I got my bachelor’s degree from Department of Computer Technology and Application, Qinghai University, supervised by Prof. Jianqiang Huang (黄建强) and Prof. Chunmei Li (李春梅)
🤗 These works are open source to the best of my ability.
🤗 I am currently doing research on multimodal speech separation, and am interested in other speech tasks (e.g., pre-training models). If you would like to collaborate, please contact me. Many thanks.
🔥 News
- 2022.06: 🧩 One paper is accepted by Neural Computation:
- Inferring mechanisms of auditory attentional modulation with deep neural networks, Ting-Yu Kuo, Yuanda Liao, Kai Li, Bo Hong, Xiaolin Hu.
- 2022.06: 🎉 One paper is accepted by InterSpeech 2022:
- On the Use of Deep Mask Estimation Module for Neural Source Separation Systems, Kai Li, Xiaolin Hu, Yi Luo.
- 2022.06: 🎲 One paper is appeared by Arxiv:
- On the Design and Training Strategies for RNN-based Online Neural Speech Separation Systems, Kai Li, Yi Luo.
- 2022.05: 🧩 One paper to submit in Nature Machine Intelligence.
- 2022.03: 🎉 We won the first prize 🥇 of the Global College Student Supercomputer Challenge (ASC22)
- 2022.03: 🧩 One paper to submit in IEEE Transactions on Industrial Informatics.
- 2022.03: 🧩🧩 Two paper to submit in Interspeech 2022.
Click for More
- 2021.10: 🎉 paper is accepted by NeurIPS 2021:
- Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network, Xiaolin Hu$^*$, Kai Li$^*$, Weiyi Zhang, Yi Luo, Jean-Marie Lemercier, Timo Gerkmann.
- 2021.05: 🎉 We won the 5% of the [Global College Student Supercomputer Challenge (ASC20-21)](http://www.asc-events.net/ASC20-21/Finals.php)
- 2021.01: 🎉 We won the first prize 🥇 of the [Global College Student Supercomputer Challenge (ASC20-21)](http://www.asc-events.net/ASC20-21/Finals.php)
- 2020.06: 🎉 Outstanding Bachelor Thesis Award, Qinghai University of Computer Science and Technology !
- 2020.06: 🎉 Outstanding Graduates, Qinghai University of Computer Science and Technology !
- 2020.04: 🧩 One paper is accepted by IET image processing:
- A Survey of Single Image Super Resolution Reconstruction, Kai Li, Shenghao Yang, Runting Dong, Jianqiang Huang, Xiaoying Wang.
- 2020.01: 🏢 I am an algorithm intern at Moyin Technology
for speech dirazation and voiceprint recognition.
- 2019.11: 🧩 One paper is accepted by ISPA2019:
- Single Image Super-resolution Reconstruction of Enhanced Loss Function with Multi-GPU Training, Jianqiang Huang$^*$, Kai Li$^*$, Xiaoying Wang.
- 2019.11: 🎉 We won the first prize 🥇 of the first "Ganqingning" Innovation and Entrepreneurship Competition !
- 2019.11: 🎉 I won the National Scholarship, Ministry of Education, China !
- 2019.05: 🎉 We won the second prize 🥈 in the Natural Academic Paper category of the National College Student Challenge Cup Qinghai Provincial Trial !
- 2019.05: 🎉 We won the first prize 🥇 in the Qinghai Division of the 6th National Youth Science Innovation Experiment and Work Competition !
- 2019.05: 🎉 One paper is accepted by ICDIP2019:
- Single image super resolution based on generative adversarial networks, Kai Li, Liang Ye, Shenghao Yang, Jianqiang Huang, Xiaoying Wang.
- 2019.04: 🧩 I won the second prize 🥈 at the provincial level in the Blue Bridge Cup Java Group A!
- 2018.12: 🎉 We won the first prize 🥇 of natural academic paper in the first "Principals Cup" Innovation and Entrepreneurship Competition in Qinghai Province !
🧰 Project
📂 Datasets

LRS3-For-Speech-Separation
Kai Li
- Open source audio-visual dataset processing script. Following are the steps to generate training and testing data. There are several parameters to change in order to match different purpose.
🎤 Audio-only Speech Separation Methods

DPRNN-Pytorch
Kai Li
- Dual-path RNN. Efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch.


Conv-TasNet
Github Repo | | 知乎: Conv-TasNet阅读笔记
Kai Li
- Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch’s Implement.

UtterancePIT
Kai Li
- According to funcwj’s uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.

Deep Clustering
Kai Li
- Deep clustering in the field of speech separation implemented by pytorch.

AFRCNN
Github Repo | | 知乎: AFRCNN阅读笔记
Kai Li
- Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network.
🎬 Audio-visual Speech Separation Methods

Looking to Listen at the Cocktail Party
Kai Li
- The project is an audiovisual model reproduced by the contents of the paper Looking to Listen at the Cocktail Party: A Speaker-Independent Audio-Visual Model for Speech Separation.
📖 Tutorial

Speech-Separation-Paper-Tutorial
Kai Li
- A must-read paper and tutorial list for speech separation based on neural networks.
📝 Publications
( * equal contribution, # corresponding author)
🎙 Speech Separation

Inferring mechanisms of auditory attentional modulation with deep neural networks
Ting-Yu Kuo, Yuanda Liao, Kai Li, Bo Hong, Xiaolin Hu.

An Auditory-Visual Speech Separation Model Inspired by Cortico-thalamo-cortical Circuits
Kai Li, Fenghua Xie, Hang Chen, Kexin Yuan, Xiaolin Hu.

On the Use of Deep Mask Estimation Module for Neural Source Separation Systems
Kai Li, Xiaolin Hu, Yi Luo.


Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network
Xiaolin Hu*, #, Kai Li$^*$, Weiyi Zhang, Yi Luo, Jean-Marie Lemercier, Timo Gerkmann.
Audio Demo Page | Speech Enhancement Demo | | Citations: 6

Research on Speech Separation Based on Audio Visual Model
Kai Li, Xiaolin Hu#, Jianqiang Huang#.
📷 Image Super Resolution

Kai Li, Jianqiang Huang#, Jinfang Jia, Yu Zhu, Li Wu, Xiaoying Wang.

A Survey of Single Image Super Resolution Reconstruction
Kai Li, Shenghao Yang, Runting Dong, Jianqiang Huang#, Xiaoying Wang.
Citations: 25

Single Image Super-resolution Reconstruction of Enhanced Loss Function with Multi-GPU Training
Jianqiang Huang*, #, Kai Li$^*$, Xiaoying Wang.
Citations: 1

Single image super resolution based on generative adversarial networks
Kai Li, Liang Ye, Shenghao Yang, Jianqiang Huang#, Xiaoying Wang.
🎖 Honors and Awards
- 2020.06 Outstanding Bachelor Thesis Award (Top 1%)
- 2020.06 Outstanding Graduates (Top 1%)
- 2019.11 National Scholarship (Top 1%)
📖 Educations
- 2021.09 - now, Master, Tsinghua University, Beijing.
- 2016.09 - 2020.06, Undergraduate, Department of Computer Technology and Application, Qinghai Univeristy, Xining.
- 2013.09 - 2016.06, Zhengzhou fourth Middle School, Zhengzhou.
💻 Internships
- 2021.07 - 2022.01, Tencent AI Lab, Beijing.
- 2020.09 - 2021.01, Moyincloud, Beijing.
🧑🏫 Teaching
2022 Fall, Head TA in Introduction to Deep Learning (00240332), instructed by Prof. Xiaolin Hu
© Kai Li | Last updated: July 22th, 2022 | Theme by Yi Ren