..
LLM tutorials, books and thoughts
LLM Tutorials and Courses
- llm-course - Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
- UWaterloo CS 886 - Recent Advances on Foundation Models.
- CS25-Transformers United
- ChatGPT Prompt Engineering
- Princeton: Understanding Large Language Models
- CS324 - Large Language Models
- State of GPT
- A Visual Guide to Mamba and State Space Models
- Let’s build GPT: from scratch, in code, spelled out.
- minbpe - Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
- femtoGPT - Pure Rust implementation of a minimal Generative Pretrained Transformer.
- Neurips2022-Foundational Robustness of Foundation Models
- ICML2022-Welcome to the “Big Model” Era: Techniques and Systems to Train and Serve Bigger Models
- GPT in 60 Lines of NumPy
LLM Books
- Generative AI with LangChain: Build large language model (LLM) apps with Python, ChatGPT, and other LLMs - it comes with a GitHub repository that showcases a lot of the functionality
- Build a Large Language Model (From Scratch) - A guide to building your own working LLM.
- BUILD GPT: HOW AI WORKS - explains how to code a Generative Pre-trained Transformer, or GPT, from scratch.
Great thoughts about LLM
- Why did all of the public reproduction of GPT-3 fail?
- A Stage Review of Instruction Tuning
- LLM Powered Autonomous Agents
- Why you should work on AI AGENTS!
- Google “We Have No Moat, And Neither Does OpenAI”
- AI competition statement
- Prompt Engineering
- Noam Chomsky: The False Promise of ChatGPT
- Is ChatGPT 175 Billion Parameters? Technical Analysis
- The Next Generation Of Large Language Models
- Large Language Model Training in 2023
- How does GPT Obtain its Ability? Tracing Emergent Abilities of Language Models to their Sources
- Open Pretrained Transformers
- Scaling, emergence, and reasoning in large language models