Olivier Sigaud
Advanced Reinforcement Learning : Reinforcement Learning with Prior Data (RLPD)
12:52
Olivier Sigaud
TD-MPC
19:05
Olivier Sigaud
Combining direct policy search and reinforcement learning: population-based training
9:23
Olivier Sigaud
Combining direct policy search and reinforcement learning: optimizing diversity
15:18
Olivier Sigaud
Combining direct policy search and reinforcement learning: optimizing actions
12:02
Olivier Sigaud
Combining direct policy search and reinforcement learning: optimizing policies
13:03
Olivier Sigaud
Direct policy search and reinforcement learning: search spaces and sample reuse
9:16
Olivier Sigaud
Direct policy search and reinforcement learning: taking better steps
12:07
Olivier Sigaud
Direct policy search and reinforcement learning : details about the policy gradient
9:03
Olivier Sigaud
Direct policy search and reinforcement learning: a quick overview of direct policy search methods
14:29
Olivier Sigaud
Direct policy search and reinforcement learning: introduction
9:34
Olivier Sigaud
Goal-conditioned reinforcement learning: state-based goal reachers
12:52
Olivier Sigaud
Goal-conditioned reinforcement learning: hindsight experience replay
7:48
Olivier Sigaud
Goal-conditioned reinforcement learning: curriculum
14:05
Olivier Sigaud
Goal-conditioned reinforcement learning: skill learners
12:34
Olivier Sigaud
Goal-conditioned reinforcement learning: typology of setters
11:12
Olivier Sigaud
Goal-conditioned reinforcement learning: frameworks and core concepts
15:37
Olivier Sigaud
Goal-conditioned reinforcement learning: Introduction
10:47
Olivier Sigaud
IMOL 2023 presentation: Towards Inferential Social Learning in Teachable Autotelic Agents
39:30
Olivier Sigaud
Data collection in SB3
28:17
Olivier Sigaud
Advantage Actor Critic
9:29
Olivier Sigaud
From Policy Gradient to Actor-Critic: Introduction (RLVS 2021 version)
5:57
Olivier Sigaud
Policy Gradient and Actor-Critic: wrap-up (RLVS 2021 version)
4:46
Olivier Sigaud
Policy Gradient and Reward Weighted Regression (RLVS 2021 version)
4:23
Olivier Sigaud
SAC and TQC (RLVS 2021 version)
14:17
Olivier Sigaud
DDPG and TD3 (RLVS 2021 version)
16:53
Olivier Sigaud
Proximal Policy Optimization (RVLS 2021 version)
8:43
Olivier Sigaud
TRPO and ACKTR (RLVS 2021 version)
11:05
Olivier Sigaud
On-Policy versus Off-Policy (RLVS 2021 version)
12:50
Olivier Sigaud
The bias-variance trade-off in Reinforcement Learning (RLVS 2021 version)
9:44
Olivier Sigaud
From Policy Gradient with baseline to Actor-Critic (RLVS 2021 version)
9:42
Olivier Sigaud
Policy Gradient Derivation (part 3/3) (RLVS 2021 version)
6:56
Olivier Sigaud
Policy Gradient Derivation (part 2/3) (RLVS 2021 version)
9:43
Olivier Sigaud
Policy Gradient derivation (part 1/3) (RLVS 2021 version)
12:18
Olivier Sigaud
The Policy Search Problem (RLVS 2021 version)
7:53
Olivier Sigaud
Coding tips for the Basic Policy Gradient lab
41:09
Olivier Sigaud
Radial Basis Function Networks: useful tips for labs
12:58
Olivier Sigaud
Hindsight Experience Replay
14:46
Olivier Sigaud
Deep Reinforcement Learning Class: Conclusion
17:23
Olivier Sigaud
Soft Actor Critic
19:04
Olivier Sigaud
Deep Policy Search Class: TRPO and PPO
13:18
Olivier Sigaud
Deep Policy Search Class: Direct Policy Search versus Policy Gradient
26:18
Olivier Sigaud
Deep Policy Search Class: Introduction
3:52
Olivier Sigaud
Reinforcement Learning Class: Off-policy and Replay Buffer
19:58
Olivier Sigaud
Dynamic Programming
12:33