WAVLab

VERSA: A Versatile Evaluation Toolkit for Speech, Audio, and Music

WAVLab

VERSA: A Versatile Evaluation Toolkit for Speech, Audio, and Music

1:13

[Fall 2023] Speech Recognition and Understanding (Lecture2: Introduction of Speech Recognition)

WAVLab

[Fall 2023] Speech Recognition and Understanding (Lecture2: Introduction of Speech Recognition)

40:30

[Fall 2023] Speech Recognition and Understanding (Lecture 22: Advanced topics on end-to-end ASR II)

WAVLab

[Fall 2023] Speech Recognition and Understanding (Lecture 22: Advanced topics on end-to-end ASR II)

35:50

[Fall 2023] Speech Recognition and Understanding (Lecture 4: Feature extraction)

WAVLab

[Fall 2023] Speech Recognition and Understanding (Lecture 4: Feature extraction)

53:27

[Fall 2023] Speech Recognition and Understanding (Lecture 5: Alignments)

WAVLab

[Fall 2023] Speech Recognition and Understanding (Lecture 5: Alignments)

45:38

[Fall 2023] Speech Recognition and Understanding (Lecture 9: Hidden Markov Models part III)

WAVLab

[Fall 2023] Speech Recognition and Understanding (Lecture 9: Hidden Markov Models part III)

48:36

[Fall 2023] Speech Recognition and Understanding (Lecture 10: Forward-backward algorithm for CTC)

WAVLab

[Fall 2023] Speech Recognition and Understanding (Lecture 10: Forward-backward algorithm for CTC)

34:58

[Fall 2023] Speech Recognition and Understanding (Lecture 13: Search)

WAVLab

[Fall 2023] Speech Recognition and Understanding (Lecture 13: Search)

50:09

[Fall 2023] Speech Recognition and Understanding (Lecture 7: Hidden Markov Models)

WAVLab

[Fall 2023] Speech Recognition and Understanding (Lecture 7: Hidden Markov Models)

53:43

[Fall 2023] Speech Recognition and Understanding (Lecture 19: End-to-End ASR: CTC)

WAVLab

[Fall 2023] Speech Recognition and Understanding (Lecture 19: End-to-End ASR: CTC)

1:11:43

[Fall 2023] Speech Recognition and Understanding (Lecture 21: Advanced topics on end-to-end ASR)

WAVLab

[Fall 2023] Speech Recognition and Understanding (Lecture 21: Advanced topics on end-to-end ASR)

46:56

[Fall 2023] Speech Recognition and Understanding (Lecture 18: End-to-End ASR: Attention)

WAVLab

[Fall 2023] Speech Recognition and Understanding (Lecture 18: End-to-End ASR: Attention)

58:27

[Fall 2023] Speech Recognition and Understanding (Lecture 8: Hidden Markov Models part II)

WAVLab

[Fall 2023] Speech Recognition and Understanding (Lecture 8: Hidden Markov Models part II)

56:38

[Fall 2023] Speech Recognition and Understanding (Lecture 11: N-gram Language Models)

WAVLab

[Fall 2023] Speech Recognition and Understanding (Lecture 11: N-gram Language Models)

1:06:09

[Fall 2023] Speech Recognition and Understanding (Lecture 20: End-to-End ASR: RNN Transducer)

WAVLab

[Fall 2023] Speech Recognition and Understanding (Lecture 20: End-to-End ASR: RNN Transducer)

46:45

[Fall 2023] Speech Recognition and Understanding (Lecture 3: Speech Recognition Formulation)

WAVLab

[Fall 2023] Speech Recognition and Understanding (Lecture 3: Speech Recognition Formulation)

46:43

[Fall 2023] Speech Recognition and Understanding (Lecture 16: DNN for Acoustic Modeling)

WAVLab

[Fall 2023] Speech Recognition and Understanding (Lecture 16: DNN for Acoustic Modeling)

52:04

[Fall 2023] Speech Recognition and Understanding (Lecture 17: Neural Network Language Model)

WAVLab

[Fall 2023] Speech Recognition and Understanding (Lecture 17: Neural Network Language Model)

59:10

Fall2022-SpeechRecognition&Understanding (Lecture 22 - Search)

WAVLab

Fall2022-SpeechRecognition&Understanding (Lecture 22 - Search)

1:04:16

Fall2022-SpeechRecognition&Understanding (Lecture 21 - Advanced topics on end-to-end ASR)

WAVLab

Fall2022-SpeechRecognition&Understanding (Lecture 21 - Advanced topics on end-to-end ASR)

1:11:21

Fall2022-SpeechRecognition&Understanding (Lecture19 - End-to-End ASR: CTC)

WAVLab

Fall2022-SpeechRecognition&Understanding (Lecture19 - End-to-End ASR: CTC)

1:06:51

Fall2022-SpeechRecognition&Understanding (Lecture20 - End-to-End ASR: RNN Transducer)

WAVLab

Fall2022-SpeechRecognition&Understanding (Lecture20 - End-to-End ASR: RNN Transducer)

1:00:43

Fall2022-SpeechRecognition&Understanding (Lecture16 - Neural Networks for Acoustic Modeling)

WAVLab

Fall2022-SpeechRecognition&Understanding (Lecture16 - Neural Networks for Acoustic Modeling)

1:02:13

Fall2022-SpeechRecognition&Understanding (Lecture17 - Neural Network Language Modell)

WAVLab

Fall2022-SpeechRecognition&Understanding (Lecture17 - Neural Network Language Modell)

53:54

Fall2022-SpeechRecognition&Understanding (Lecture18 - End-to-End ASR - Attention)

WAVLab

Fall2022-SpeechRecognition&Understanding (Lecture18 - End-to-End ASR - Attention)

59:15

Fall2022-SpeechRecognition&Understanding (Lecture13 - N-gram Language Model)

WAVLab

Fall2022-SpeechRecognition&Understanding (Lecture13 - N-gram Language Model)

46:18

Fall2022-SpeechRecognition&Understanding (Lecture14 - Intro to Deep Learning for Speech Recognition)

WAVLab

Fall2022-SpeechRecognition&Understanding (Lecture14 - Intro to Deep Learning for Speech Recognition)

44:09

Fall2022-SpeechRecognition&Understanding (Lecture12 - Advanced Acoustic Modeling)

WAVLab

Fall2022-SpeechRecognition&Understanding (Lecture12 - Advanced Acoustic Modeling)

1:09:50

Fall2022-SpeechRecognition&Understanding (Lecture8 - Alignment Paths)

WAVLab

Fall2022-SpeechRecognition&Understanding (Lecture8 - Alignment Paths)

1:19:22

Fall2022-SpeechRecognition&Understanding (Lecture11 - Hidden Markov Model III)

WAVLab

Fall2022-SpeechRecognition&Understanding (Lecture11 - Hidden Markov Model III)

1:00:56

Fall2022-SpeechRecognition&Understanding (Lecture10 - Hidden Markov Model II)

WAVLab

Fall2022-SpeechRecognition&Understanding (Lecture10 - Hidden Markov Model II)

1:01:19

Fall2022-SpeechRecognition&Understanding (Lecture9 - Hidden Markov Models I)

WAVLab

Fall2022-SpeechRecognition&Understanding (Lecture9 - Hidden Markov Models I)

1:07:49

Fall2022-SpeechRecognition&Understanding (Lecture7 - ESPnet tutorial2 (New Task))

WAVLab

Fall2022-SpeechRecognition&Understanding (Lecture7 - ESPnet tutorial2 (New Task))

33:42

Fall2022-SpeechRecognition&Understanding (Lecture6 - ESPnet tutorial1 (Recipe))

WAVLab

Fall2022-SpeechRecognition&Understanding (Lecture6 - ESPnet tutorial1 (Recipe))

1:11:15

Fall2022-SpeechRecognition&Understanding (Lecture5 - Feature Extraction)

WAVLab

Fall2022-SpeechRecognition&Understanding (Lecture5 - Feature Extraction)

1:13:57

Fall2022-SpeechRecognition&Understanding (Lecture4 - Speech Recognition Formulation)

WAVLab

Fall2022-SpeechRecognition&Understanding (Lecture4 - Speech Recognition Formulation)

1:09:17

Fall2022-SpeechRecognition&Understanding (Lecture2 - Introduction of Speech Recognition)

WAVLab

Fall2022-SpeechRecognition&Understanding (Lecture2 - Introduction of Speech Recognition)

1:03:10

Fall2022-SpeechRecognition&Understanding (Lecture1 - Course-overview)

WAVLab

Fall2022-SpeechRecognition&Understanding (Lecture1 - Course-overview)

1:11:59

Interspeech2022-Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis

WAVLab

Interspeech2022-Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis

14:46

Interspeech2022-Combining Spectral and SSL Features for Low Resource ASR and ST

WAVLab

Interspeech2022-Combining Spectral and SSL Features for Low Resource ASR and ST

14:28

Interspeech2022-Two-Pass Low Latency End-to-End Spoken Language Understanding

WAVLab

Interspeech2022-Two-Pass Low Latency End-to-End Spoken Language Understanding

13:46

Interspeech2022-End-to-End Integration of ASR, SE, and SSL

WAVLab

Interspeech2022-End-to-End Integration of ASR, SE, and SSL

10:33

Interspeech2022-VQ-T: RNN Transducers using Vector-Quantized Prediction Network States

WAVLab

Interspeech2022-VQ-T: RNN Transducers using Vector-Quantized Prediction Network States

14:38

ICASSP2022-ESPnet-SLU: Advancing Spoken Language Understanding Through ESPnet

WAVLab

ICASSP2022-ESPnet-SLU: Advancing Spoken Language Understanding Through ESPnet

10:01

ICASSP2022-Joint Speech Recognition and Audio Captioning

WAVLab

ICASSP2022-Joint Speech Recognition and Audio Captioning

15:08

[ACL-IWSLT 2022] CMU's Dialect Speech Translation System

WAVLab

[ACL-IWSLT 2022] CMU's Dialect Speech Translation System

14:38

[CMU Lecture: Speech Recognition and Understanding (Fall 2021)] ESPnet Tutorial by Shinji Watanabe

WAVLab

[CMU Lecture: Speech Recognition and Understanding (Fall 2021)] ESPnet Tutorial by Shinji Watanabe

1:20:02

[音学シンポジウム2021] エンドツーエンドニューラルネットワークによる音声処理の一体化 by 渡部晋治

WAVLab

[音学シンポジウム2021] エンドツーエンドニューラルネットワークによる音声処理の一体化 by 渡部晋治

1:00:37

Interspeech2021-Differentiable Allophone Graphs for Language-Universal Speech Recognition

WAVLab

Interspeech2021-Differentiable Allophone Graphs for Language-Universal Speech Recognition

3:01

Interspeech2021-Rethinking End-to-End Evaluation of Decomposable Tasks

WAVLab

Interspeech2021-Rethinking End-to-End Evaluation of Decomposable Tasks

3:00

Interspeech2021-Streaming End-to-End ASR based on Block-wise Non-Autoregressive Models

WAVLab

Interspeech2021-Streaming End-to-End ASR based on Block-wise Non-Autoregressive Models

2:25

次のページ