Clearly one of the best videos on the topic, the use of examples was really good.
فوق العاده بود آقای امینی
You are one of the best teacher i have ever seen..
Fantastic, very clear and concise. Great work!
I love you guys!
very good way of explaining
Great video. The whole series is very good
Outstanding. Thank you.
It was very clear and helpful.
in 34:35 how do I calculate the log-likelihood of the action given the state?
Thank you!
Very well explained. How to get the slides? The link in the bio mentions coming soon!
Is there a typo at 10:01? Intuitively, it seems like the exponent of γ should (i - t) since, in current formulation, the reward terms will quickly go to 0 when t becomes large.
This is really good. Thank you!
Dude it's awesome T^T
wow, thanks.
Thank you all for these great videos. One thing I want to mention is that the audio volume is a little bit too low
Excellent tutorial indeed
What is max Q (s' , a' ) ? When i have a lot of future states and they are unknown , how can I destinate the max Q ( s' , a' ) ? 24:00
@mmattb