Training a 125M parameter decoder-only model on 70b tokens of Nigerian Languages.

demo

Article