Horizen® is an LLM project that aims to help me train on LLMs and transformers.
Note: you need to have python version >= 3.10 . Execute
python3 --versionto check the version
To start using/developping Horizen®, start by setting up a python virtual environment:
python3 -m venv .venv
source .venv/bin/activateAnd then install the dependencies:
python3 -m pip install torch numpy regex datasets tiktokenor simply
python3 -m pip install -r requirements.txtNote: in the notebooks, don't forget to choose your virtual environment.
Quickly create a folder for logs :
mkdir logsHere are some usefull commands:
make train: executes the training script inscripts/bin, will use the hyperparameters specified, otherwise reverts to saved file insavesdirectory.make test: executes the testing script inscripts/bin, especially usefull for text generation.make plot: generates a PNG file containing the plots of the training. Don't forget to put the nave of the model inplot.pyfile inscripts/bin.