Reinforcement Pre-Training (RPT) By Microsoft Explained
AI Papers Academy
Reinforcement Pre-Training (RPT) By Microsoft Explained
8:30
Darwin Gödel Machine Explained: Self-Improving AI Agents
AI Papers Academy
Darwin Gödel Machine Explained: Self-Improving AI Agents
8:07
Continuous Thought Machines (CTMs) - The Era of AI Beyond Transformers?
AI Papers Academy
Continuous Thought Machines (CTMs) - The Era of AI Beyond Transformers?
9:36
Perception Language Models (PLMs) by Meta – A Fully Open SOTA VLM
AI Papers Academy
Perception Language Models (PLMs) by Meta – A Fully Open SOTA VLM
8:35
GRPO Reinforcement Learning Explained (DeepSeekMath Paper)
AI Papers Academy
GRPO Reinforcement Learning Explained (DeepSeekMath Paper)
14:38
GRPO 2.0? DAPO LLM Reinforcement Learning Explained
AI Papers Academy
GRPO 2.0? DAPO LLM Reinforcement Learning Explained
13:42
Cheating LLMs & How (Not) To Stop Them | OpenAI Paper Explained
AI Papers Academy
Cheating LLMs & How (Not) To Stop Them | OpenAI Paper Explained
8:21
START by Alibaba: Teaching LLMs to Debug Their Thinking with Python
AI Papers Academy
START by Alibaba: Teaching LLMs to Debug Their Thinking with Python
8:04
SWE-RL by Meta — Reinforcement Learning for Software Engineering LLMs
AI Papers Academy
SWE-RL by Meta — Reinforcement Learning for Software Engineering LLMs
8:31
Large Language Diffusion Models - The Era Of Diffusion LLMs?
AI Papers Academy
Large Language Diffusion Models - The Era Of Diffusion LLMs?
9:29
CoCoMix by Meta AI - The Future of LLMs Pretraining?
AI Papers Academy
CoCoMix by Meta AI - The Future of LLMs Pretraining?
9:33
s1: Simple Test-Time Scaling - Can 1k Samples Rival o1-Preview?
AI Papers Academy
s1: Simple Test-Time Scaling - Can 1k Samples Rival o1-Preview?
8:49
DeepSeek Janus-Pro: DeepSeek's Revolution in Multimodal AI?
AI Papers Academy
DeepSeek Janus-Pro: DeepSeek's Revolution in Multimodal AI?
9:01
DeepSeek-R1 Paper Explained - A New RL LLMs Era in AI?
AI Papers Academy
DeepSeek-R1 Paper Explained - A New RL LLMs Era in AI?
9:09
Titans by Google: The Era of AI After Transformers?
AI Papers Academy
Titans by Google: The Era of AI After Transformers?
10:53
rStar-Math by Microsoft: Can SLMs Beat OpenAI o1 in Math?
AI Papers Academy
rStar-Math by Microsoft: Can SLMs Beat OpenAI o1 in Math?
10:23
Large Concept Models (LCMs) by Meta: The Era of AI After LLMs?
AI Papers Academy
Large Concept Models (LCMs) by Meta: The Era of AI After LLMs?
10:23
Byte Latent Transformer (BLT) by Meta AI - A Tokenizer-free LLM
AI Papers Academy
Byte Latent Transformer (BLT) by Meta AI - A Tokenizer-free LLM
10:07
Coconut by Meta AI - LLM Reasoning With Chain of Continuous Thought
AI Papers Academy
Coconut by Meta AI - LLM Reasoning With Chain of Continuous Thought
9:41
Hymba by NVIDIA: A Hybrid Mamba-Transformer SOTA Small LM
AI Papers Academy
Hymba by NVIDIA: A Hybrid Mamba-Transformer SOTA Small LM
8:57
LLaMA-Mesh by Nvidia: LLM for 3D Mesh Generation
AI Papers Academy
LLaMA-Mesh by Nvidia: LLM for 3D Mesh Generation
5:11
Tokenformer: The Next Generation of Transformers?
AI Papers Academy
Tokenformer: The Next Generation of Transformers?
6:53
Generative Reward Models: Merging the Power of RLHF and RLAIF for Smarter AI
AI Papers Academy
Generative Reward Models: Merging the Power of RLHF and RLAIF for Smarter AI
7:51
Writing in the Margins: Better LLM Inference Pattern for Long Context Retrieval
AI Papers Academy
Writing in the Margins: Better LLM Inference Pattern for Long Context Retrieval
4:51
Sapiens by Meta AI: Foundation for Human Vision Models
AI Papers Academy
Sapiens by Meta AI: Foundation for Human Vision Models
4:33
Mixture of Nested Experts by Google: Efficient Alternative To MoE?
AI Papers Academy
Mixture of Nested Experts by Google: Efficient Alternative To MoE?
7:37
Introduction to Mixture-of-Experts | Original MoE Paper Explained
AI Papers Academy
Introduction to Mixture-of-Experts | Original MoE Paper Explained
4:41
Mixture-of-Agents (MoA) Enhances Large Language Model Capabilities
AI Papers Academy
Mixture-of-Agents (MoA) Enhances Large Language Model Capabilities
3:54
Arithmetic Transformers with Abacus Positional Embeddings | AI Paper Explained
AI Papers Academy
Arithmetic Transformers with Abacus Positional Embeddings | AI Paper Explained
4:52
CLLMs: Consistency Large Language Models | AI Paper Explained
AI Papers Academy
CLLMs: Consistency Large Language Models | AI Paper Explained
7:26
ReFT: Representation Finetuning for Language Models | AI Paper Explained
AI Papers Academy
ReFT: Representation Finetuning for Language Models | AI Paper Explained
7:30
Stealing Part of a Production Language Model | AI Paper Explained
AI Papers Academy
Stealing Part of a Production Language Model | AI Paper Explained
9:21
The Era of 1-bit LLMs by Microsoft | AI Paper Explained
AI Papers Academy
The Era of 1-bit LLMs by Microsoft | AI Paper Explained
6:10
V-JEPA by Meta AI - A Human-Like Computer Vision Video-based Model
AI Papers Academy
V-JEPA by Meta AI - A Human-Like Computer Vision Video-based Model
11:35
Self-Rewarding Language Models by Meta AI - Path to Open-Source AGI?
AI Papers Academy
Self-Rewarding Language Models by Meta AI - Path to Open-Source AGI?
6:50
Fast Inference of Mixture-of-Experts Language Models with Offloading
AI Papers Academy
Fast Inference of Mixture-of-Experts Language Models with Offloading
11:59
TinyGPT-V: Small but Mighty Multimodal Large Language Model
AI Papers Academy
TinyGPT-V: Small but Mighty Multimodal Large Language Model
5:23
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
AI Papers Academy
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
6:28
Vision Transformers Explained | The ViT Paper
AI Papers Academy
Vision Transformers Explained | The ViT Paper
4:32
Orca 2 by Microsoft: Teaching Small Language Models How to Reason
AI Papers Academy
Orca 2 by Microsoft: Teaching Small Language Models How to Reason
6:21
LCM-LoRA: From Diffusion Models to Fast SDXL with Latent Consistency Models
AI Papers Academy
LCM-LoRA: From Diffusion Models to Fast SDXL with Latent Consistency Models
5:50
CODEFUSION by Microsoft: A Pre-trained Diffusion Model for Code Generation
AI Papers Academy
CODEFUSION by Microsoft: A Pre-trained Diffusion Model for Code Generation
5:27
Table-GPT by Microsoft: Empower LLMs To Understand Tables
AI Papers Academy
Table-GPT by Microsoft: Empower LLMs To Understand Tables
9:31
Vision Transformers Need Registers - Fixing a Bug in DINOv2?
AI Papers Academy
Vision Transformers Need Registers - Fixing a Bug in DINOv2?
9:20
Emu by Meta AI: Enhancing Image Generation Models Using Photogenic Needles in a Haystack
AI Papers Academy
Emu by Meta AI: Enhancing Image Generation Models Using Photogenic Needles in a Haystack
6:01
NExT-GPT: Any-to-Any Multimodal LLM
AI Papers Academy
NExT-GPT: Any-to-Any Multimodal LLM
9:14
Large Language Models As Optimizers - OPRO by Google DeepMind
AI Papers Academy
Large Language Models As Optimizers - OPRO by Google DeepMind
6:28
FACET by Meta AI - Fairness in Computer Vision Evaluation Benchmark
AI Papers Academy
FACET by Meta AI - Fairness in Computer Vision Evaluation Benchmark
6:46
Code Llama Paper Explained
AI Papers Academy
Code Llama Paper Explained
8:32
WizardMath from Microsoft - Best Open Source Math LLM with Reinforced Evol-Instruct
AI Papers Academy
WizardMath from Microsoft - Best Open Source Math LLM with Reinforced Evol-Instruct
8:26
Shepherd by Meta AI - A Critic for Large Language Models
AI Papers Academy
Shepherd by Meta AI - A Critic for Large Language Models
7:36
Soft Mixture of Experts - An Efficient Sparse Transformer
AI Papers Academy
Soft Mixture of Experts - An Efficient Sparse Transformer
7:31
Universal and Transferable LLM Attacks - A New Threat to AI Safety
AI Papers Academy
Universal and Transferable LLM Attacks - A New Threat to AI Safety
6:43
Meta-Transformer: A Unified Framework for Multimodal Learning
AI Papers Academy
Meta-Transformer: A Unified Framework for Multimodal Learning
6:36
Google HyperDreamBooth - HyperNetworks for Fast Personalization of Text-to-Image Models
AI Papers Academy
Google HyperDreamBooth - HyperNetworks for Fast Personalization of Text-to-Image Models
7:36
LongNet from Microsoft - 1B Tokens Transformer with Dilated Attention
AI Papers Academy
LongNet from Microsoft - 1B Tokens Transformer with Dilated Attention
5:12
DreamDiffusion - Thought to Image Generation | Paper Summary
AI Papers Academy
DreamDiffusion - Thought to Image Generation | Paper Summary
6:51
Wanda Network Pruning - Prune LLMs Efficiently
AI Papers Academy
Wanda Network Pruning - Prune LLMs Efficiently
8:30
I-JEPA from Meta AI - A Human-Like Computer Vision Model | Paper Summary
AI Papers Academy
I-JEPA from Meta AI - A Human-Like Computer Vision Model | Paper Summary
8:17
Orca from Microsoft - The Future of Imitation Learning?
AI Papers Academy
Orca from Microsoft - The Future of Imitation Learning?
5:55
StyleDrop from Google AI - Text-to-Image Generation in Any Style!
AI Papers Academy
StyleDrop from Google AI - Text-to-Image Generation in Any Style!
6:37
LIMA from Meta AI - Less Is More for Alignment of LLMs
AI Papers Academy
LIMA from Meta AI - Less Is More for Alignment of LLMs
6:09
ImageBind from Meta AI - One Embedding Space To Bind Them All
AI Papers Academy
ImageBind from Meta AI - One Embedding Space To Bind Them All
6:02
MPT Model - Extrapolate LLM Context with ALiBi
AI Papers Academy
MPT Model - Extrapolate LLM Context with ALiBi
6:02
YOLO-NAS - A New Best Object Detection Model!
AI Papers Academy
YOLO-NAS - A New Best Object Detection Model!
4:58
Will we need a stronger phone for AI?
AI Papers Academy
Will we need a stronger phone for AI?
0:53
How WizardLM Got Better Results Than ChatGPT For Complex Instructions?
AI Papers Academy
How WizardLM Got Better Results Than ChatGPT For Complex Instructions?
4:38
DINOv2 from Meta AI - Finally a Foundational Model in Computer Vision?
AI Papers Academy
DINOv2 from Meta AI - Finally a Foundational Model in Computer Vision?
7:31
Introduction to Consistency Models
AI Papers Academy
Introduction to Consistency Models
5:20
What is a Topological Sort of a Graph and how to find it using Kahn's Algorithm
AI Papers Academy
What is a Topological Sort of a Graph and how to find it using Kahn's Algorithm
17:58
DFS Algorithm | Depth First Search Algorithm for Graph Search With Animated Example
AI Papers Academy
DFS Algorithm | Depth First Search Algorithm for Graph Search With Animated Example
22:16
BFS Algorithm | Breadth First Search Algorithm for Graph Search
AI Papers Academy
BFS Algorithm | Breadth First Search Algorithm for Graph Search
14:04
Graphs Representations - Adjacency Lists vs Adjacency Matrix
AI Papers Academy
Graphs Representations - Adjacency Lists vs Adjacency Matrix
16:10
Introduction to Computer Science | From Algorithm to Running a Program
AI Papers Academy
Introduction to Computer Science | From Algorithm to Running a Program
11:25
HTML Tables Tutorial | How To Create and Customize Tables with HTML
AI Papers Academy
HTML Tables Tutorial | How To Create and Customize Tables with HTML
10:43
Introduction to HTML - What are Tags, Elements and Attributes | HTML Document Structure
AI Papers Academy
Introduction to HTML - What are Tags, Elements and Attributes | HTML Document Structure
10:56