AI Papers Academy

Reinforcement Pre-Training (RPT) By Microsoft Explained

AI Papers Academy

Reinforcement Pre-Training (RPT) By Microsoft Explained

8:30

Darwin Gödel Machine Explained: Self-Improving AI Agents

AI Papers Academy

Darwin Gödel Machine Explained: Self-Improving AI Agents

8:07

Continuous Thought Machines (CTMs) - The Era of AI Beyond Transformers?

AI Papers Academy

Continuous Thought Machines (CTMs) - The Era of AI Beyond Transformers?

9:36

Perception Language Models (PLMs) by Meta – A Fully Open SOTA VLM

AI Papers Academy

Perception Language Models (PLMs) by Meta – A Fully Open SOTA VLM

8:35

GRPO Reinforcement Learning Explained (DeepSeekMath Paper)

AI Papers Academy

GRPO Reinforcement Learning Explained (DeepSeekMath Paper)

14:38

GRPO 2.0? DAPO LLM Reinforcement Learning Explained

AI Papers Academy

GRPO 2.0? DAPO LLM Reinforcement Learning Explained

13:42

Cheating LLMs & How (Not) To Stop Them | OpenAI Paper Explained

AI Papers Academy

Cheating LLMs & How (Not) To Stop Them | OpenAI Paper Explained

8:21

START by Alibaba: Teaching LLMs to Debug Their Thinking with Python

AI Papers Academy

START by Alibaba: Teaching LLMs to Debug Their Thinking with Python

8:04

SWE-RL by Meta — Reinforcement Learning for Software Engineering LLMs

AI Papers Academy

SWE-RL by Meta — Reinforcement Learning for Software Engineering LLMs

8:31

Large Language Diffusion Models - The Era Of Diffusion LLMs?

AI Papers Academy

Large Language Diffusion Models - The Era Of Diffusion LLMs?

9:29

CoCoMix by Meta AI - The Future of LLMs Pretraining?

AI Papers Academy

CoCoMix by Meta AI - The Future of LLMs Pretraining?

9:33

s1: Simple Test-Time Scaling - Can 1k Samples Rival o1-Preview?

AI Papers Academy

s1: Simple Test-Time Scaling - Can 1k Samples Rival o1-Preview?

8:49

DeepSeek Janus-Pro: DeepSeek's Revolution in Multimodal AI?

AI Papers Academy

DeepSeek Janus-Pro: DeepSeek's Revolution in Multimodal AI?

9:01

DeepSeek-R1 Paper Explained - A New RL LLMs Era in AI?

AI Papers Academy

DeepSeek-R1 Paper Explained - A New RL LLMs Era in AI?

9:09

Titans by Google: The Era of AI After Transformers?

AI Papers Academy

Titans by Google: The Era of AI After Transformers?

10:53

rStar-Math by Microsoft: Can SLMs Beat OpenAI o1 in Math?

AI Papers Academy

rStar-Math by Microsoft: Can SLMs Beat OpenAI o1 in Math?

10:23

Large Concept Models (LCMs) by Meta: The Era of AI After LLMs?

AI Papers Academy

Large Concept Models (LCMs) by Meta: The Era of AI After LLMs?

10:23

Byte Latent Transformer (BLT) by Meta AI - A Tokenizer-free LLM

AI Papers Academy

Byte Latent Transformer (BLT) by Meta AI - A Tokenizer-free LLM

10:07

Coconut by Meta AI - LLM Reasoning With Chain of Continuous Thought

AI Papers Academy

Coconut by Meta AI - LLM Reasoning With Chain of Continuous Thought

9:41

Hymba by NVIDIA: A Hybrid Mamba-Transformer SOTA Small LM

AI Papers Academy

Hymba by NVIDIA: A Hybrid Mamba-Transformer SOTA Small LM

8:57

LLaMA-Mesh by Nvidia: LLM for 3D Mesh Generation

AI Papers Academy

LLaMA-Mesh by Nvidia: LLM for 3D Mesh Generation

5:11

Tokenformer: The Next Generation of Transformers?

AI Papers Academy

Tokenformer: The Next Generation of Transformers?

6:53

Generative Reward Models: Merging the Power of RLHF and RLAIF for Smarter AI

AI Papers Academy

Generative Reward Models: Merging the Power of RLHF and RLAIF for Smarter AI

7:51

Writing in the Margins: Better LLM Inference Pattern for Long Context Retrieval

AI Papers Academy

Writing in the Margins: Better LLM Inference Pattern for Long Context Retrieval

4:51

Sapiens by Meta AI: Foundation for Human Vision Models

AI Papers Academy

Sapiens by Meta AI: Foundation for Human Vision Models

4:33

Mixture of Nested Experts by Google: Efficient Alternative To MoE?

AI Papers Academy

Mixture of Nested Experts by Google: Efficient Alternative To MoE?

7:37

Introduction to Mixture-of-Experts | Original MoE Paper Explained

AI Papers Academy

Introduction to Mixture-of-Experts | Original MoE Paper Explained

4:41

Mixture-of-Agents (MoA) Enhances Large Language Model Capabilities

AI Papers Academy

Mixture-of-Agents (MoA) Enhances Large Language Model Capabilities

3:54

Arithmetic Transformers with Abacus Positional Embeddings | AI Paper Explained

AI Papers Academy

Arithmetic Transformers with Abacus Positional Embeddings | AI Paper Explained

4:52

CLLMs: Consistency Large Language Models | AI Paper Explained

AI Papers Academy

CLLMs: Consistency Large Language Models | AI Paper Explained

7:26

ReFT: Representation Finetuning for Language Models | AI Paper Explained

AI Papers Academy

ReFT: Representation Finetuning for Language Models | AI Paper Explained

7:30

Stealing Part of a Production Language Model | AI Paper Explained

AI Papers Academy

Stealing Part of a Production Language Model | AI Paper Explained

9:21

The Era of 1-bit LLMs by Microsoft | AI Paper Explained

AI Papers Academy

The Era of 1-bit LLMs by Microsoft | AI Paper Explained

6:10

V-JEPA by Meta AI - A Human-Like Computer Vision Video-based Model

AI Papers Academy

V-JEPA by Meta AI - A Human-Like Computer Vision Video-based Model

11:35

Self-Rewarding Language Models by Meta AI - Path to Open-Source AGI?

AI Papers Academy

Self-Rewarding Language Models by Meta AI - Path to Open-Source AGI?

6:50

Fast Inference of Mixture-of-Experts Language Models with Offloading

AI Papers Academy

Fast Inference of Mixture-of-Experts Language Models with Offloading

11:59

TinyGPT-V: Small but Mighty Multimodal Large Language Model

AI Papers Academy

TinyGPT-V: Small but Mighty Multimodal Large Language Model

5:23

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

AI Papers Academy

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

6:28

Vision Transformers Explained | The ViT Paper

AI Papers Academy

Vision Transformers Explained | The ViT Paper

4:32

Orca 2 by Microsoft: Teaching Small Language Models How to Reason

AI Papers Academy

Orca 2 by Microsoft: Teaching Small Language Models How to Reason

6:21

LCM-LoRA: From Diffusion Models to Fast SDXL with Latent Consistency Models

AI Papers Academy

LCM-LoRA: From Diffusion Models to Fast SDXL with Latent Consistency Models

5:50

CODEFUSION by Microsoft: A Pre-trained Diffusion Model for Code Generation

AI Papers Academy

CODEFUSION by Microsoft: A Pre-trained Diffusion Model for Code Generation

5:27

Table-GPT by Microsoft: Empower LLMs To Understand Tables

AI Papers Academy

Table-GPT by Microsoft: Empower LLMs To Understand Tables

9:31

Vision Transformers Need Registers - Fixing a Bug in DINOv2?

AI Papers Academy

Vision Transformers Need Registers - Fixing a Bug in DINOv2?

9:20

Emu by Meta AI: Enhancing Image Generation Models Using Photogenic Needles in a Haystack

AI Papers Academy

Emu by Meta AI: Enhancing Image Generation Models Using Photogenic Needles in a Haystack

6:01

NExT-GPT: Any-to-Any Multimodal LLM

AI Papers Academy

NExT-GPT: Any-to-Any Multimodal LLM

9:14

Large Language Models As Optimizers - OPRO by Google DeepMind

AI Papers Academy

Large Language Models As Optimizers - OPRO by Google DeepMind

6:28

FACET by Meta AI - Fairness in Computer Vision Evaluation Benchmark

AI Papers Academy

FACET by Meta AI - Fairness in Computer Vision Evaluation Benchmark

6:46

Code Llama Paper Explained

AI Papers Academy

Code Llama Paper Explained

8:32

WizardMath from Microsoft - Best Open Source Math LLM with Reinforced Evol-Instruct

AI Papers Academy

WizardMath from Microsoft - Best Open Source Math LLM with Reinforced Evol-Instruct

8:26

Shepherd by Meta AI - A Critic for Large Language Models

AI Papers Academy

Shepherd by Meta AI - A Critic for Large Language Models

7:36

Soft Mixture of Experts - An Efficient Sparse Transformer

AI Papers Academy

Soft Mixture of Experts - An Efficient Sparse Transformer

7:31

Universal and Transferable LLM Attacks - A New Threat to AI Safety

AI Papers Academy

Universal and Transferable LLM Attacks - A New Threat to AI Safety

6:43

Meta-Transformer: A Unified Framework for Multimodal Learning

AI Papers Academy

Meta-Transformer: A Unified Framework for Multimodal Learning

6:36

Google HyperDreamBooth - HyperNetworks for Fast Personalization of Text-to-Image Models

AI Papers Academy

Google HyperDreamBooth - HyperNetworks for Fast Personalization of Text-to-Image Models

7:36

LongNet from Microsoft - 1B Tokens Transformer with Dilated Attention

AI Papers Academy

LongNet from Microsoft - 1B Tokens Transformer with Dilated Attention

5:12

DreamDiffusion - Thought to Image Generation | Paper Summary

AI Papers Academy

DreamDiffusion - Thought to Image Generation | Paper Summary

6:51

Wanda Network Pruning - Prune LLMs Efficiently

AI Papers Academy

Wanda Network Pruning - Prune LLMs Efficiently

8:30

I-JEPA from Meta AI - A Human-Like Computer Vision Model | Paper Summary

AI Papers Academy

I-JEPA from Meta AI - A Human-Like Computer Vision Model | Paper Summary

8:17

Orca from Microsoft - The Future of Imitation Learning?

AI Papers Academy

Orca from Microsoft - The Future of Imitation Learning?

5:55

StyleDrop from Google AI - Text-to-Image Generation in Any Style!

AI Papers Academy

StyleDrop from Google AI - Text-to-Image Generation in Any Style!

6:37

LIMA from Meta AI - Less Is More for Alignment of LLMs

AI Papers Academy

LIMA from Meta AI - Less Is More for Alignment of LLMs

6:09

ImageBind from Meta AI - One Embedding Space To Bind Them All

AI Papers Academy

ImageBind from Meta AI - One Embedding Space To Bind Them All

6:02

MPT Model - Extrapolate LLM Context with ALiBi

AI Papers Academy

MPT Model - Extrapolate LLM Context with ALiBi

6:02

YOLO-NAS - A New Best Object Detection Model!

AI Papers Academy

YOLO-NAS - A New Best Object Detection Model!

4:58

Will we need a stronger phone for AI?

AI Papers Academy

Will we need a stronger phone for AI?

0:53

How WizardLM Got Better Results Than ChatGPT For Complex Instructions?

AI Papers Academy

How WizardLM Got Better Results Than ChatGPT For Complex Instructions?

4:38

DINOv2 from Meta AI - Finally a Foundational Model in Computer Vision?

AI Papers Academy

DINOv2 from Meta AI - Finally a Foundational Model in Computer Vision?

7:31

Introduction to Consistency Models

AI Papers Academy

Introduction to Consistency Models

5:20

What is a Topological Sort of a Graph and how to find it using Kahn's Algorithm

AI Papers Academy

What is a Topological Sort of a Graph and how to find it using Kahn's Algorithm

17:58

DFS Algorithm | Depth First Search Algorithm for Graph Search With Animated Example

AI Papers Academy

DFS Algorithm | Depth First Search Algorithm for Graph Search With Animated Example

22:16

BFS Algorithm | Breadth First Search Algorithm for Graph Search

AI Papers Academy

BFS Algorithm | Breadth First Search Algorithm for Graph Search

14:04

Graphs Representations - Adjacency Lists vs Adjacency Matrix

AI Papers Academy

Graphs Representations - Adjacency Lists vs Adjacency Matrix

16:10

Introduction to Computer Science | From Algorithm to Running a Program

AI Papers Academy

Introduction to Computer Science | From Algorithm to Running a Program

11:25

HTML Tables Tutorial | How To Create and Customize Tables with HTML

AI Papers Academy

HTML Tables Tutorial | How To Create and Customize Tables with HTML

10:43

Introduction to HTML - What are Tags, Elements and Attributes | HTML Document Structure

AI Papers Academy

Introduction to HTML - What are Tags, Elements and Attributes | HTML Document Structure

10:56

次のページ