GitHub - mittagessen/bytellama: A llama with octet tokenization

Description

ByteLlama is a tiny Llama 3.2 model (~101M parameters) using octet tokenization. Its primary purpose is to serve as a modernized alternative to ByT5, in particular the small version, in my vision experiments but it should work for any application were a small LM without the drawbacks of tokenization is necessary.

ByteLlama's hyperparameters were shamelessly pilfered from SmolLM-135M. The difference in parameter count (~34M) comes from the drastically smaller embedding sizes with octet tokenization.

This repository contains configuration and tokenization code to train a ByteLlama using the torchtune framework.

Want to try it out?

First install the package from the repository:

$ pip install .

Then run the script creating the randomly initialized weights:

$ bytellama ~/bitey_llamas/model.pt

To start the training on 4 GPUs on a single node with torchtune (includes automatic download of the dataset) from the root directory of the git repository:

tune run --nproc_per_node 4 full_finetune_distributed \
       --config configs/bytellama.yaml \
       checkpointer.checkpoint_dir=~/bitey_llamas \
       checkpointer.output_dir=~/bitey_llamas

ByteLlama is tiny. It can fit a batch size of 32 onto a A40 GPU when using bf16 precision. The batch size can be adjusted to your resources like so:

tune run --nproc_per_node 4 full_finetune_distributed \
       --config configs/bytellama.yaml \
       checkpointer.checkpoint_dir=~/bitey_llamas \
       checkpointer.output_dir=~/bitey_llamas \
       batch_size=8

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
bytellama		bytellama
configs		configs
README.rst		README.rst
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Description

Want to try it out?

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Description

Want to try it out?

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages