AI Papers Academy
Reinforcement Pre-Training (RPT) By Microsoft Explained
8:30
AI Papers Academy
Darwin Gödel Machine Explained: Self-Improving AI Agents
8:07
AI Papers Academy
Continuous Thought Machines (CTMs) - The Era of AI Beyond Transformers?
9:36
AI Papers Academy
Perception Language Models (PLMs) by Meta – A Fully Open SOTA VLM
8:35
AI Papers Academy
GRPO Reinforcement Learning Explained (DeepSeekMath Paper)
14:38
AI Papers Academy
GRPO 2.0? DAPO LLM Reinforcement Learning Explained
13:42
AI Papers Academy
Cheating LLMs & How (Not) To Stop Them | OpenAI Paper Explained
8:21
AI Papers Academy
START by Alibaba: Teaching LLMs to Debug Their Thinking with Python
8:04
AI Papers Academy
SWE-RL by Meta — Reinforcement Learning for Software Engineering LLMs
8:31
AI Papers Academy
Large Language Diffusion Models - The Era Of Diffusion LLMs?
9:29
AI Papers Academy
CoCoMix by Meta AI - The Future of LLMs Pretraining?
9:33
AI Papers Academy
s1: Simple Test-Time Scaling - Can 1k Samples Rival o1-Preview?
8:49
AI Papers Academy
DeepSeek Janus-Pro: DeepSeek's Revolution in Multimodal AI?
9:01
AI Papers Academy
DeepSeek-R1 Paper Explained - A New RL LLMs Era in AI?
9:09
AI Papers Academy
Titans by Google: The Era of AI After Transformers?
10:53
AI Papers Academy
rStar-Math by Microsoft: Can SLMs Beat OpenAI o1 in Math?
10:23
AI Papers Academy
Large Concept Models (LCMs) by Meta: The Era of AI After LLMs?
10:23
AI Papers Academy
Byte Latent Transformer (BLT) by Meta AI - A Tokenizer-free LLM
10:07
AI Papers Academy
Coconut by Meta AI - LLM Reasoning With Chain of Continuous Thought
9:41
AI Papers Academy
Hymba by NVIDIA: A Hybrid Mamba-Transformer SOTA Small LM
8:57
AI Papers Academy
LLaMA-Mesh by Nvidia: LLM for 3D Mesh Generation
5:11
AI Papers Academy
Tokenformer: The Next Generation of Transformers?
6:53
AI Papers Academy
Generative Reward Models: Merging the Power of RLHF and RLAIF for Smarter AI
7:51
AI Papers Academy
Writing in the Margins: Better LLM Inference Pattern for Long Context Retrieval
4:51
AI Papers Academy
Sapiens by Meta AI: Foundation for Human Vision Models
4:33
AI Papers Academy
Mixture of Nested Experts by Google: Efficient Alternative To MoE?
7:37
AI Papers Academy
Introduction to Mixture-of-Experts | Original MoE Paper Explained
4:41
AI Papers Academy
Mixture-of-Agents (MoA) Enhances Large Language Model Capabilities
3:54
AI Papers Academy
Arithmetic Transformers with Abacus Positional Embeddings | AI Paper Explained
4:52
AI Papers Academy
CLLMs: Consistency Large Language Models | AI Paper Explained
7:26
AI Papers Academy
ReFT: Representation Finetuning for Language Models | AI Paper Explained
7:30
AI Papers Academy
Stealing Part of a Production Language Model | AI Paper Explained
9:21
AI Papers Academy
The Era of 1-bit LLMs by Microsoft | AI Paper Explained
6:10
AI Papers Academy
V-JEPA by Meta AI - A Human-Like Computer Vision Video-based Model
11:35
AI Papers Academy
Self-Rewarding Language Models by Meta AI - Path to Open-Source AGI?
6:50
AI Papers Academy
Fast Inference of Mixture-of-Experts Language Models with Offloading
11:59
AI Papers Academy
TinyGPT-V: Small but Mighty Multimodal Large Language Model
5:23
AI Papers Academy
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
6:28
AI Papers Academy
Vision Transformers Explained | The ViT Paper
4:32
AI Papers Academy
Orca 2 by Microsoft: Teaching Small Language Models How to Reason
6:21
AI Papers Academy
LCM-LoRA: From Diffusion Models to Fast SDXL with Latent Consistency Models
5:50
AI Papers Academy
CODEFUSION by Microsoft: A Pre-trained Diffusion Model for Code Generation
5:27
AI Papers Academy
Table-GPT by Microsoft: Empower LLMs To Understand Tables
9:31
AI Papers Academy
Vision Transformers Need Registers - Fixing a Bug in DINOv2?
9:20
AI Papers Academy
Emu by Meta AI: Enhancing Image Generation Models Using Photogenic Needles in a Haystack
6:01
AI Papers Academy
NExT-GPT: Any-to-Any Multimodal LLM
9:14
AI Papers Academy
Large Language Models As Optimizers - OPRO by Google DeepMind
6:28
AI Papers Academy
FACET by Meta AI - Fairness in Computer Vision Evaluation Benchmark
6:46
AI Papers Academy
Code Llama Paper Explained
8:32
AI Papers Academy
WizardMath from Microsoft - Best Open Source Math LLM with Reinforced Evol-Instruct
8:26
AI Papers Academy
Shepherd by Meta AI - A Critic for Large Language Models
7:36
AI Papers Academy
Soft Mixture of Experts - An Efficient Sparse Transformer
7:31
AI Papers Academy
Universal and Transferable LLM Attacks - A New Threat to AI Safety
6:43
AI Papers Academy
Meta-Transformer: A Unified Framework for Multimodal Learning
6:36
AI Papers Academy
Google HyperDreamBooth - HyperNetworks for Fast Personalization of Text-to-Image Models
7:36
AI Papers Academy
LongNet from Microsoft - 1B Tokens Transformer with Dilated Attention
5:12
AI Papers Academy
DreamDiffusion - Thought to Image Generation | Paper Summary
6:51
AI Papers Academy
Wanda Network Pruning - Prune LLMs Efficiently
8:30
AI Papers Academy
I-JEPA from Meta AI - A Human-Like Computer Vision Model | Paper Summary
8:17
AI Papers Academy
Orca from Microsoft - The Future of Imitation Learning?
5:55
AI Papers Academy
StyleDrop from Google AI - Text-to-Image Generation in Any Style!
6:37
AI Papers Academy
LIMA from Meta AI - Less Is More for Alignment of LLMs
6:09
AI Papers Academy
ImageBind from Meta AI - One Embedding Space To Bind Them All
6:02
AI Papers Academy
MPT Model - Extrapolate LLM Context with ALiBi
6:02
AI Papers Academy
YOLO-NAS - A New Best Object Detection Model!
4:58
AI Papers Academy
Will we need a stronger phone for AI?
0:53
AI Papers Academy
How WizardLM Got Better Results Than ChatGPT For Complex Instructions?
4:38
AI Papers Academy
DINOv2 from Meta AI - Finally a Foundational Model in Computer Vision?
7:31
AI Papers Academy
Introduction to Consistency Models
5:20
AI Papers Academy
What is a Topological Sort of a Graph and how to find it using Kahn's Algorithm
17:58
AI Papers Academy
DFS Algorithm | Depth First Search Algorithm for Graph Search With Animated Example
22:16
AI Papers Academy
BFS Algorithm | Breadth First Search Algorithm for Graph Search
14:04
AI Papers Academy
Graphs Representations - Adjacency Lists vs Adjacency Matrix
16:10
AI Papers Academy
Introduction to Computer Science | From Algorithm to Running a Program
11:25
AI Papers Academy
HTML Tables Tutorial | How To Create and Customize Tables with HTML
10:43
AI Papers Academy
Introduction to HTML - What are Tags, Elements and Attributes | HTML Document Structure
10:56