A comprehensive deep dive into Large Language Model (LLM) AI technology that powers ChatGPT and related products. This course covers the full training stack of how models are developed, mental models of how to think about their "psychology", and how to get the best use of them in practical applications. Taught by Andrej Karpathy, founding member at OpenAI and former Sr. Director of AI at Tesla.

AI researcher, educator, and former Director of AI at Tesla. Known for his influential work in deep learning and neural networks, and his educational content including the popular "Neural Networks: Zero to Hero" course series.
3 hours 21 minutes
video
Not included
Free
Understand how LLMs are pretrained on internet data
Learn about tokenization and neural network architecture
Explore GPT and Llama model internals
Understand post-training and supervised finetuning
Basic understanding of programming concepts
Familiarity with Python (helpful but not required)
Interest in AI and machine learning
Notice something missing?
Help us improve this course information for the community