A 125M parameter decoder-only language model pretrained from scratch on 70B tokens across 11 Nigerian languages. Best performing model on these languages at AfricaNLP, ACL 2025.
<span title='2025-05-01 04:14:46 +0100 +0100'>May 1, 2025</span> · 0 min · 0 words · Damilola John