🎊 Welcome to my website!

My name is Kai Li (Chinese name: 李凯). I’m is a first-year master student at Department of Computer Science and Technology, Tsinghua University, supervised by Prof. Xiaolin Hu (胡晓林). I am also a member of TSAIL Group directed by Prof. Bo Zhang (张拨) and Prof. Jun zhu (朱军).

I was an intern at Tencent AI Lab , mainly doing research on causal speech separation, supervised by Yi Luo (罗艺).

I got my bachelor’s degree from Department of Computer Technology and Application, Qinghai University, supervised by Prof. Jianqiang Huang (黄建强) and Prof. Chunmei Li (李春梅)

🤗   These works are open source to the best of my ability.

🤗   I am currently doing research on multimodal speech separation, and am interested in other speech tasks (e.g., pre-training models). If you would like to collaborate, please contact me. Many thanks.

🔥 News

  • 2022.06: 🧩 One paper is accepted by Neural Computation:
    • Inferring mechanisms of auditory attentional modulation with deep neural networks, Ting-Yu Kuo, Yuanda Liao, Kai Li, Bo Hong, Xiaolin Hu.
  • 2022.06: 🎉 One paper is accepted by InterSpeech 2022:
    • On the Use of Deep Mask Estimation Module for Neural Source Separation Systems, Kai Li, Xiaolin Hu, Yi Luo.
  • 2022.06: 🎲 One paper is appeared by Arxiv:
    • On the Design and Training Strategies for RNN-based Online Neural Speech Separation Systems, Kai Li, Yi Luo.
  • 2022.05: 🧩 One paper to submit in Nature Machine Intelligence.
  • 2022.03: 🎉 We won the first prize 🥇 of the Global College Student Supercomputer Challenge (ASC22)
  • 2022.03: 🧩 One paper to submit in IEEE Transactions on Industrial Informatics.
  • 2022.03: 🧩🧩 Two paper to submit in Interspeech 2022.
Click for More
  • 2021.10: 🎉 paper is accepted by NeurIPS 2021:
    • Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network, Xiaolin Hu$^*$, Kai Li$^*$, Weiyi Zhang, Yi Luo, Jean-Marie Lemercier, Timo Gerkmann.
  • 2021.05: 🎉 We won the 5% of the [Global College Student Supercomputer Challenge (ASC20-21)](http://www.asc-events.net/ASC20-21/Finals.php)
  • 2021.01: 🎉 We won the first prize 🥇 of the [Global College Student Supercomputer Challenge (ASC20-21)](http://www.asc-events.net/ASC20-21/Finals.php)
  • 2020.06: 🎉 Outstanding Bachelor Thesis Award, Qinghai University of Computer Science and Technology !
  • 2020.06: 🎉 Outstanding Graduates, Qinghai University of Computer Science and Technology !
  • 2020.04: 🧩 One paper is accepted by IET image processing:
    • A Survey of Single Image Super Resolution Reconstruction, Kai Li, Shenghao Yang, Runting Dong, Jianqiang Huang, Xiaoying Wang.
  • 2020.01: 🏢 I am an algorithm intern at Moyin Technology for speech dirazation and voiceprint recognition.
  • 2019.11: 🧩 One paper is accepted by ISPA2019:
    • Single Image Super-resolution Reconstruction of Enhanced Loss Function with Multi-GPU Training, Jianqiang Huang$^*$, Kai Li$^*$, Xiaoying Wang.
  • 2019.11: 🎉 We won the first prize 🥇 of the first "Ganqingning" Innovation and Entrepreneurship Competition !
  • 2019.11: 🎉 I won the National Scholarship, Ministry of Education, China !
  • 2019.05: 🎉 We won the second prize 🥈 in the Natural Academic Paper category of the National College Student Challenge Cup Qinghai Provincial Trial !
  • 2019.05: 🎉 We won the first prize 🥇 in the Qinghai Division of the 6th National Youth Science Innovation Experiment and Work Competition !
  • 2019.05: 🎉 One paper is accepted by ICDIP2019:
    • Single image super resolution based on generative adversarial networks, Kai Li, Liang Ye, Shenghao Yang, Jianqiang Huang, Xiaoying Wang.
  • 2019.04: 🧩 I won the second prize 🥈 at the provincial level in the Blue Bridge Cup Java Group A!
  • 2018.12: 🎉 We won the first prize 🥇 of natural academic paper in the first "Principals Cup" Innovation and Entrepreneurship Competition in Qinghai Province !

🧰 Project

📂 Datasets

sym

LRS3-For-Speech-Separation

Kai Li

Github Repo | GitHub stars

  • Open source audio-visual dataset processing script. Following are the steps to generate training and testing data. There are several parameters to change in order to match different purpose.

🎤 Audio-only Speech Separation Methods

sym

DPRNN-Pytorch

Github Repo | GitHub stars | 知乎: DPRNN阅读笔记

Kai Li

  • Dual-path RNN. Efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch.
sym

Calculate-SNR-SDR

Github Repo | GitHub stars

Kai Li

  • Calculatie Audio‘s SNR and SDR.
sym

Conv-TasNet

Github Repo | GitHub stars | 知乎: Conv-TasNet阅读笔记

Kai Li

  • Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch’s Implement.
sym

UtterancePIT

Github Repo | GitHub stars | 知乎: uPIT阅读笔记

Kai Li

  • According to funcwj’s uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.
sym

Deep Clustering

Github Repo | GitHub stars | 知乎: DPCL阅读笔记

Kai Li

  • Deep clustering in the field of speech separation implemented by pytorch.
sym

AFRCNN

Github Repo | GitHub stars | 知乎: AFRCNN阅读笔记

Kai Li

  • Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network.

🎬 Audio-visual Speech Separation Methods

sym

Looking to Listen at the Cocktail Party

Github Repo | GitHub stars | 知乎: DPRNN阅读笔记

Kai Li

  • The project is an audiovisual model reproduced by the contents of the paper Looking to Listen at the Cocktail Party: A Speaker-Independent Audio-Visual Model for Speech Separation.

📖 Tutorial

sym

Speech-Separation-Paper-Tutorial

Github Repo | GitHub stars

Kai Li

  • A must-read paper and tutorial list for speech separation based on neural networks.

📝 Publications

( * equal contribution, # corresponding author)

🎙 Speech Separation

Neural Computation 2022
sym

Inferring mechanisms of auditory attentional modulation with deep neural networks

Ting-Yu Kuo, Yuanda Liao, Kai Li, Bo Hong, Xiaolin Hu.

Submitted Nature Machine Intelligence
sym
Arxiv 2022
sym

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

Xiaolin Hu*, #, Kai Li$^*$, Weiyi Zhang, Yi Luo, Jean-Marie Lemercier, Timo Gerkmann.

Audio Demo Page | Speech Enhancement Demo | | Citations: 6

Bachelor Degree Thesis
sym

📷 Image Super Resolution

Submitted IEEE Transactions on Industrial Informatics
sym
IET Image Processing 2020
sym

A Survey of Single Image Super Resolution Reconstruction

Kai Li, Shenghao Yang, Runting Dong, Jianqiang Huang#, Xiaoying Wang.

Citations: 25

ISPA 2019
sym

Single Image Super-resolution Reconstruction of Enhanced Loss Function with Multi-GPU Training

Jianqiang Huang*, #, Kai Li$^*$, Xiaoying Wang.

Citations: 1

ICDIP 2019
sym

Single image super resolution based on generative adversarial networks

Kai Li, Liang Ye, Shenghao Yang, Jianqiang Huang#, Xiaoying Wang.

🎖 Honors and Awards

  • 2020.06 Outstanding Bachelor Thesis Award (Top 1%)
  • 2020.06 Outstanding Graduates (Top 1%)
  • 2019.11 National Scholarship (Top 1%)

📖 Educations

  • 2021.09 - now, Master, Tsinghua University, Beijing.
  • 2016.09 - 2020.06, Undergraduate, Department of Computer Technology and Application, Qinghai Univeristy, Xining.
  • 2013.09 - 2016.06, Zhengzhou fourth Middle School, Zhengzhou.

💻 Internships

🧑‍🏫 Teaching

2022 Fall, Head TA in Introduction to Deep Learning (00240332), instructed by Prof. Xiaolin Hu


© Kai Li | Last updated: July 22th, 2022 | Theme by Yi Ren