minLLM

minLLM is a minimal transformer-based language model implemented in PyTorch and in Keras (with a PyTorch backend), featuring causal multi-head attention, feed-forward layers, and basic word-level tokenization on the Tiny Shakespeare dataset. Clone the repository (git clone https://github.com/gustavz/minLLM.git), install dependencies (pip install -r requirements.txt), and run python min_llm_keras.py to download data, train the model with Weights & Biases integration and checkpointing, and generate sample text. All model and training hyperparameters (e.g., MAX_SEQ_LEN, EMBED_DIM, NUM_HEADS, NUM_LAYERS, BATCH_SIZE, EPOCHS, TEMPERATURE) are configurable at the top of the script. Licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
common.py		common.py
config.py		config.py
min_llm_keras.py		min_llm_keras.py
min_llm_pytorch.py		min_llm_pytorch.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

minLLM

About

Uh oh!

Releases

Packages

Languages

License

fredfurst/minLLM

Folders and files

Latest commit

History

Repository files navigation

minLLM

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages