Intro to Large Language Models - Andrej Karpathy

Andrej Karpathy who has worked on AI projects at Google, Open AI and Tesla has the gift of explaining complex topics in a simple & accessible way. It is great that he shares his knowledge though his YouTube channel.

Large Language Models are the core technical component behind systems like ChatGPT, Claude, and Bard. In his recent 1-hour long talk "Intro to Large Language Models", he covers a wide range of LLM topics including:

  •  LLM Inference
  •  LLM Training
  •  LLM dreams
  •  How do they work?
  •  Finetuning into an Assistant
  •  Appendix: Comparisons, Labeling docs, RLHF, Synthetic data, Leaderboard
  •  LLM Scaling Laws
  •  Tool Use (Browser, Calculator, Interpreter, DALL-E)
  •  Multimodality (Vision, Audio)
  •  Thinking, System 1/2
  •  Self-improvement, LLM AlphaGo
  •  LLM Customization, GPTs store
  •  LLM OS
  •  LLM Security Intro
  •  Jailbreaks
  •  Prompt Injection
  •  Data poisoning
  •  LLM Security conclusions

He has also shared the slide deck (links in the YouTube video description).

Also see - A 15-day email course on Generative AI 

Comments