Limited lifetime membership offer. Get lifetime access for $49 (84% off) — Pay once, own foreverJoin now→

Introduction to Large Language Models

Andrej Karpathy

YouTube

A comprehensive deep dive into Large Language Model (LLM) AI technology that powers ChatGPT and related products. This course covers the full training stack of how models are developed, mental models of how to think about their "psychology", and how to get the best use of them in practical applications. Taught by Andrej Karpathy, founding member at OpenAI and former Sr. Director of AI at Tesla.

Instructor

Andrej Karpathy

AI researcher, educator, and former Director of AI at Tesla. Known for his influential work in deep learning and neural networks, and his educational content including the popular "Neural Networks: Zero to Hero" course series.

Course details

Duration

3 hours 21 minutes

Format

video

Certificate

Not included

Pricing

Free

What you'll learn

Understand how LLMs are pretrained on internet data

Learn about tokenization and neural network architecture

Explore GPT and Llama model internals

Understand post-training and supervised finetuning

Prerequisites

Basic understanding of programming concepts

Familiarity with Python (helpful but not required)

Interest in AI and machine learning