Intro to Large Language Models

Intro to Large Language Models - Andrej Karpathy

Andrej Karpathy who has worked on AI projects at Google, Open AI and Tesla has the gift of explaining complex topics in a simple & accessible way. It is great that he shares his knowledge though his YouTube channel.

Large Language Models are the core technical component behind systems like ChatGPT, Claude, and Bard. In his recent 1-hour long talk "Intro to Large Language Models", he covers a wide range of LLM topics including:

LLM Inference
LLM Training
LLM dreams
How do they work?
Finetuning into an Assistant
Appendix: Comparisons, Labeling docs, RLHF, Synthetic data, Leaderboard
LLM Scaling Laws
Tool Use (Browser, Calculator, Interpreter, DALL-E)
Multimodality (Vision, Audio)
Thinking, System 1/2
Self-improvement, LLM AlphaGo
LLM Customization, GPTs store
LLM OS
LLM Security Intro
Jailbreaks
Prompt Injection
Data poisoning
LLM Security conclusions

He has also shared the slide deck (links in the YouTube video description).

Also see - A 15-day email course on Generative AI

Search This Blog

Tech Tips, Tricks & Trivia

Intro to Large Language Models - Andrej Karpathy

Comments

Post a Comment

Popular posts from this blog

HOW TO dynamically generate a Word document with custom header & footer

HOW TO add a header or footer to a dynamically generated Word document

The Mercurial Grok AI Assistant Understands & Speaks Indian Languages