Skip to content

Commit c7655d5

Browse files
committed
GPT-2 from scratch
1 parent 60a804a commit c7655d5

File tree

1 file changed

+4
-1
lines changed

1 file changed

+4
-1
lines changed

README.md

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@
33
This repository contains my detailed implementations of Andrej Karpathy's [Neural Networks: Zero to Hero youtube](https://www.youtube.com/playlist?list=PLAqhIrjkxbuWI23v9cThsA9GvCAUhRvKZ) lecture series & exercises in Jupyter Notebook. The notebooks go into extreme details to ensure a proper, robust fundamental understanding of the deep learning concepts being covered. This lecture series covers neural networks and how to build them from scratch in code. The course starts with the basics of backpropagation, then proceeds to multi-layer perceptrons (MLPs), convolutional neural networks (CNNs), and finally builds up to modern deep neural networks like generative pre-trained transformers (GPTs). The course also introduces and covers diagnostic tools for understanding neural networks dynamics and performance. Finally, this course covers the basics of tokenization, byte-pair encoding (BPE), and how to build a tokenizer from scratch. The field of focus in this course is language modeling (LM) because not only are language models a superb place to learn deep learning but also most of the skills learned here are immediately transferable to other fields of deep learning such as computer vision (CV).
44

55
```python
6-
[NN] ---> [MLP] ---> [CNN] ---> [GPT] ---> [BPE]
6+
[NN] ---> [MLP] ---> [CNN] ---> [GPT] ---> [BPE] ---> [GPT-2]
77
```
88

99

@@ -17,6 +17,7 @@ This repository contains my detailed implementations of Andrej Karpathy's [Neura
1717
| Makemore 5 | [Wavenet](006_makemore_WaveNet/makemore_WaveNet.ipynb) | [Wavenet Exercises](006_makemore_WaveNet/WaveNet_Exercises.ipynb)
1818
| GPT | [GPT](007_GPT/gpt.ipynb) | [1, 2a](007_GPT/ex1-2a.ipynb), [2b](007_GPT/ex2b.ipynb), [3](007_GPT/ex3.ipynb)
1919
| minBPE | [minBPE](008_minBPE/minbpe.ipynb) | [minBPE Exercises](008_minBPE/minbpe-exercises.ipynb)
20+
| GPT-2 | [GPT-2](009_GPT-2/gpt-2.ipynb) | ---
2021

2122
Still Ongoing...
2223

@@ -28,6 +29,8 @@ Still Ongoing...
2829
- [makemore Repository](https://github.com/karpathy/makemore)
2930
- [GPT Repository](https://github.com/karpathy/ng-video-lecture)
3031
- [minBPE Repository](https://github.com/karpathy/minBPE)
32+
- [nanoGPT Repository](https://github.com/karpathy/nanoGPT)
33+
3134
## Acknowledgments
3235

3336
Thank you Andrej Karpathy for creating and sharing the **"Neural Networks: Zero to Hero"** tutorial lecture series on [Youtube](https://www.youtube.com/playlist?list=PLAqhIrjkxbuWI23v9cThsA9GvCAUhRvKZ) & [Github](https://github.com/karpathy/nn-zero-to-hero/tree/master). Your dedication and expertise has made this resource possible. Find more of Andrej's work on his [GitHub](https://github.com/karpathy), [Twitter](https://x.com/karpathy), [Website](https://karpathy.ai) and [AI native school, Eureka Labs](https://github.com/EurekaLabsAI).

0 commit comments

Comments
 (0)