VERSA: A Versatile Evaluation Toolkit for Speech, Audio, and Music
WAVLab
VERSA: A Versatile Evaluation Toolkit for Speech, Audio, and Music
1:13
[Fall 2023] Speech Recognition and Understanding (Lecture2: Introduction of Speech Recognition)
WAVLab
[Fall 2023] Speech Recognition and Understanding (Lecture2: Introduction of Speech Recognition)
40:30
[Fall 2023] Speech Recognition and Understanding (Lecture 22: Advanced topics on end-to-end ASR II)
WAVLab
[Fall 2023] Speech Recognition and Understanding (Lecture 22: Advanced topics on end-to-end ASR II)
35:50
[Fall 2023] Speech Recognition and Understanding (Lecture 4: Feature extraction)
WAVLab
[Fall 2023] Speech Recognition and Understanding (Lecture 4: Feature extraction)
53:27
[Fall 2023] Speech Recognition and Understanding (Lecture 5: Alignments)
WAVLab
[Fall 2023] Speech Recognition and Understanding (Lecture 5: Alignments)
45:38
[Fall 2023] Speech Recognition and Understanding (Lecture 9: Hidden Markov Models part III)
WAVLab
[Fall 2023] Speech Recognition and Understanding (Lecture 9: Hidden Markov Models part III)
48:36
[Fall 2023] Speech Recognition and Understanding (Lecture 10: Forward-backward algorithm for CTC)
WAVLab
[Fall 2023] Speech Recognition and Understanding (Lecture 10: Forward-backward algorithm for CTC)
34:58
[Fall 2023] Speech Recognition and Understanding (Lecture 13: Search)
WAVLab
[Fall 2023] Speech Recognition and Understanding (Lecture 13: Search)
50:09
[Fall 2023] Speech Recognition and Understanding (Lecture 7: Hidden Markov Models)
WAVLab
[Fall 2023] Speech Recognition and Understanding (Lecture 7: Hidden Markov Models)
53:43
[Fall 2023] Speech Recognition and Understanding (Lecture 19: End-to-End ASR: CTC)
WAVLab
[Fall 2023] Speech Recognition and Understanding (Lecture 19: End-to-End ASR: CTC)
1:11:43
[Fall 2023] Speech Recognition and Understanding (Lecture 21: Advanced topics on end-to-end ASR)
WAVLab
[Fall 2023] Speech Recognition and Understanding (Lecture 21: Advanced topics on end-to-end ASR)
46:56
[Fall 2023] Speech Recognition and Understanding (Lecture 18: End-to-End ASR: Attention)
WAVLab
[Fall 2023] Speech Recognition and Understanding (Lecture 18: End-to-End ASR: Attention)
58:27
[Fall 2023] Speech Recognition and Understanding (Lecture 8: Hidden Markov Models part II)
WAVLab
[Fall 2023] Speech Recognition and Understanding (Lecture 8: Hidden Markov Models part II)
56:38
[Fall 2023] Speech Recognition and Understanding (Lecture 11: N-gram Language Models)
WAVLab
[Fall 2023] Speech Recognition and Understanding (Lecture 11: N-gram Language Models)
1:06:09
[Fall 2023] Speech Recognition and Understanding (Lecture 20: End-to-End ASR: RNN Transducer)
WAVLab
[Fall 2023] Speech Recognition and Understanding (Lecture 20: End-to-End ASR: RNN Transducer)
46:45
[Fall 2023] Speech Recognition and Understanding (Lecture 3: Speech Recognition Formulation)
WAVLab
[Fall 2023] Speech Recognition and Understanding (Lecture 3: Speech Recognition Formulation)
46:43
[Fall 2023] Speech Recognition and Understanding (Lecture 16: DNN for Acoustic Modeling)
WAVLab
[Fall 2023] Speech Recognition and Understanding (Lecture 16: DNN for Acoustic Modeling)
52:04
[Fall 2023] Speech Recognition and Understanding (Lecture 17: Neural Network Language Model)
WAVLab
[Fall 2023] Speech Recognition and Understanding (Lecture 17: Neural Network Language Model)
59:10
Fall2022-SpeechRecognition&Understanding (Lecture 22 - Search)
WAVLab
Fall2022-SpeechRecognition&Understanding (Lecture 22 - Search)
1:04:16
Fall2022-SpeechRecognition&Understanding (Lecture 21 - Advanced topics on end-to-end ASR)
WAVLab
Fall2022-SpeechRecognition&Understanding (Lecture 21 - Advanced topics on end-to-end ASR)
1:11:21
Fall2022-SpeechRecognition&Understanding (Lecture19 - End-to-End ASR: CTC)
WAVLab
Fall2022-SpeechRecognition&Understanding (Lecture19 - End-to-End ASR: CTC)
1:06:51
Fall2022-SpeechRecognition&Understanding (Lecture20 - End-to-End ASR: RNN Transducer)
WAVLab
Fall2022-SpeechRecognition&Understanding (Lecture20 - End-to-End ASR: RNN Transducer)
1:00:43
Fall2022-SpeechRecognition&Understanding (Lecture16 - Neural Networks for Acoustic Modeling)
WAVLab
Fall2022-SpeechRecognition&Understanding (Lecture16 - Neural Networks for Acoustic Modeling)
1:02:13
Fall2022-SpeechRecognition&Understanding (Lecture17 - Neural Network Language Modell)
WAVLab
Fall2022-SpeechRecognition&Understanding (Lecture17 - Neural Network Language Modell)
53:54
Fall2022-SpeechRecognition&Understanding (Lecture18 - End-to-End ASR - Attention)
WAVLab
Fall2022-SpeechRecognition&Understanding (Lecture18 - End-to-End ASR - Attention)
59:15
Fall2022-SpeechRecognition&Understanding (Lecture13 - N-gram Language Model)
WAVLab
Fall2022-SpeechRecognition&Understanding (Lecture13 - N-gram Language Model)
46:18
Fall2022-SpeechRecognition&Understanding (Lecture14 - Intro to Deep Learning for Speech Recognition)
WAVLab
Fall2022-SpeechRecognition&Understanding (Lecture14 - Intro to Deep Learning for Speech Recognition)
44:09
Fall2022-SpeechRecognition&Understanding (Lecture12 - Advanced Acoustic Modeling)
WAVLab
Fall2022-SpeechRecognition&Understanding (Lecture12 - Advanced Acoustic Modeling)
1:09:50
Fall2022-SpeechRecognition&Understanding (Lecture8 - Alignment Paths)
WAVLab
Fall2022-SpeechRecognition&Understanding (Lecture8 - Alignment Paths)
1:19:22
Fall2022-SpeechRecognition&Understanding (Lecture11 - Hidden Markov Model III)
WAVLab
Fall2022-SpeechRecognition&Understanding (Lecture11 - Hidden Markov Model III)
1:00:56
Fall2022-SpeechRecognition&Understanding (Lecture10 - Hidden Markov Model II)
WAVLab
Fall2022-SpeechRecognition&Understanding (Lecture10 - Hidden Markov Model II)
1:01:19
Fall2022-SpeechRecognition&Understanding (Lecture9 - Hidden Markov Models I)
WAVLab
Fall2022-SpeechRecognition&Understanding (Lecture9 - Hidden Markov Models I)
1:07:49
Fall2022-SpeechRecognition&Understanding (Lecture7 - ESPnet tutorial2 (New Task))
WAVLab
Fall2022-SpeechRecognition&Understanding (Lecture7 - ESPnet tutorial2 (New Task))
33:42
Fall2022-SpeechRecognition&Understanding (Lecture6 - ESPnet tutorial1 (Recipe))
WAVLab
Fall2022-SpeechRecognition&Understanding (Lecture6 - ESPnet tutorial1 (Recipe))
1:11:15
Fall2022-SpeechRecognition&Understanding (Lecture5 - Feature Extraction)
WAVLab
Fall2022-SpeechRecognition&Understanding (Lecture5 - Feature Extraction)
1:13:57
Fall2022-SpeechRecognition&Understanding (Lecture4 - Speech Recognition Formulation)
WAVLab
Fall2022-SpeechRecognition&Understanding (Lecture4 - Speech Recognition Formulation)
1:09:17
Fall2022-SpeechRecognition&Understanding (Lecture2 - Introduction of Speech Recognition)
WAVLab
Fall2022-SpeechRecognition&Understanding (Lecture2 - Introduction of Speech Recognition)
1:03:10
Fall2022-SpeechRecognition&Understanding (Lecture1 - Course-overview)
WAVLab
Fall2022-SpeechRecognition&Understanding (Lecture1 - Course-overview)
1:11:59
Interspeech2022-Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis
WAVLab
Interspeech2022-Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis
14:46
Interspeech2022-Combining Spectral and SSL Features for Low Resource ASR and ST
WAVLab
Interspeech2022-Combining Spectral and SSL Features for Low Resource ASR and ST
14:28
Interspeech2022-Two-Pass Low Latency End-to-End Spoken Language Understanding
WAVLab
Interspeech2022-Two-Pass Low Latency End-to-End Spoken Language Understanding
13:46
Interspeech2022-End-to-End Integration of ASR, SE, and SSL
WAVLab
Interspeech2022-End-to-End Integration of ASR, SE, and SSL
10:33
Interspeech2022-VQ-T: RNN Transducers using Vector-Quantized Prediction Network States
WAVLab
Interspeech2022-VQ-T: RNN Transducers using Vector-Quantized Prediction Network States
14:38
ICASSP2022-ESPnet-SLU: Advancing Spoken Language Understanding Through ESPnet
WAVLab
ICASSP2022-ESPnet-SLU: Advancing Spoken Language Understanding Through ESPnet
10:01
ICASSP2022-Joint Speech Recognition and Audio Captioning
WAVLab
ICASSP2022-Joint Speech Recognition and Audio Captioning
15:08
[ACL-IWSLT 2022] CMU's Dialect Speech Translation System
WAVLab
[ACL-IWSLT 2022] CMU's Dialect Speech Translation System
14:38
[CMU Lecture: Speech Recognition and Understanding  (Fall 2021)] ESPnet Tutorial by Shinji Watanabe
WAVLab
[CMU Lecture: Speech Recognition and Understanding (Fall 2021)] ESPnet Tutorial by Shinji Watanabe
1:20:02
[音学シンポジウム2021] エンドツーエンドニューラルネットワークによる音声処理の一体化 by 渡部晋治
WAVLab
[音学シンポジウム2021] エンドツーエンドニューラルネットワークによる音声処理の一体化 by 渡部晋治
1:00:37
Interspeech2021-Differentiable Allophone Graphs for Language-Universal Speech Recognition
WAVLab
Interspeech2021-Differentiable Allophone Graphs for Language-Universal Speech Recognition
3:01
Interspeech2021-Rethinking End-to-End Evaluation of Decomposable Tasks
WAVLab
Interspeech2021-Rethinking End-to-End Evaluation of Decomposable Tasks
3:00
Interspeech2021-Streaming End-to-End ASR based on Block-wise Non-Autoregressive Models
WAVLab
Interspeech2021-Streaming End-to-End ASR based on Block-wise Non-Autoregressive Models
2:25