Micro GPT from Scratch

Introduction

This project is an implementation of a simplified version of the GPT (Generative Pretrained Transformer) model from scratch in PyTorch. It can be used to generate text based on a given context. The goal of this project is for self-learning purpose and to understand the inner workings of the GPT model.

Setup

To set up the project, follow these steps:

Clone the repository
create a virtual environment

python3 -m venv venv

Install the dependencies

pip install -r requirements.txt

Run the project

python train.py

Code Structure

The project consists of several Python scripts:

train.py: This is the main script that trains the GPT model.
gpt.py: This script contains the implementation of the GPT model and its components, including the TransformerBlock and FeedForward classes.
utils.py: This script contains utility functions for the project, such as functions for text generation, loss calculation, and text-to-token and token-to-text conversion.
mha.py: This script contains the implementation of the MultiHeadAttention class, which is used in the TransformerBlock class.

Reference

https://github.com/rasbt/LLMs-from-scratch

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dataloader.py		dataloader.py
gpt.py		gpt.py
gpt_config.json		gpt_config.json
mha.py		mha.py
requirements.txt		requirements.txt
the-verdict.txt		the-verdict.txt
tokenizer.py		tokenizer.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Micro GPT from Scratch

Introduction

Setup

Code Structure

Reference

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

Bingzw/micro_gpt_from_scratch

Folders and files

Latest commit

History

Repository files navigation

Micro GPT from Scratch

Introduction

Setup

Code Structure

Reference

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages