4 recommendations
4

Introduction to Large Language Models

Andrej Karpathy
Andrej Karpathy
YouTube
YouTube

A comprehensive deep dive into Large Language Model (LLM) AI technology that powers ChatGPT and related products. This course covers the full training stack of how models are developed, mental models of how to think about their "psychology", and how to get the best use of them in practical applications. Taught by Andrej Karpathy, founding member at OpenAI and former Sr. Director of AI at Tesla.

AI

Instructor

Andrej Karpathy

Andrej Karpathy

AI researcher, educator, and former Director of AI at Tesla. Known for his influential work in deep learning and neural networks, and his educational content including the popular "Neural Networks: Zero to Hero" course series.

Course details

Duration

3 hours 21 minutes

Format

video

Certificate

Not included

Pricing

Free

What you'll learn

Understand how LLMs are pretrained on internet data

Learn about tokenization and neural network architecture

Explore GPT and Llama model internals

Understand post-training and supervised finetuning

Prerequisites

Basic understanding of programming concepts

Familiarity with Python (helpful but not required)

Interest in AI and machine learning

Curriculum

Intro into the growing LLM ecosystem

00:00:00

ChatGPT interaction under the hood

00:02:54

Basic LLM interactions examples

00:13:12

Be aware of the model you're using, pricing tiers

00:18:03

Thinking models and when to use them

00:22:54

Tool use: internet search

00:31:00

Tool use: deep research

00:42:04

File uploads, adding documents to context

00:50:57

Tool use: python interpreter, messiness of the ecosystem

00:59:00

ChatGPT Advanced Data Analysis, figures, plots

01:04:35

Claude Artifacts, apps, diagrams

01:09:00

Cursor: Composer, writing code

01:14:02

Audio (Speech) Input/Output

01:22:28

Advanced Voice Mode aka true audio inside the model

01:27:37

NotebookLM, podcast generation

01:37:09

Image input, OCR

01:40:20

Image output, DALL-E, Ideogram, etc.

01:47:02

Video input, point and talk on app

01:49:14

Video output, Sora, Veo 2, etc etc.

01:52:23

ChatGPT memory, custom instructions

01:53:29

Custom GPTs

01:58:38

Summary

02:06:30

Notice something missing?

Help us improve this course information for the community

Suggest an edit
Loading reviews...