Keuro Lab

OribAI

OribAI
AI & NLP

Technological Sovereignty for African Languages: A 14B-parameter LLM fine-tuned on Hausa and Yoruba.

ClientInternal Research
Date2025
ServiceAfrican Language AI
CategoryAI & NLP

The Challenge

The vast majority of large language models are trained on English and Western-centric data. Hausa and Yoruba — spoken by over 150 million people combined — remain severely underrepresented, leaving hundreds of millions without AI systems that understand their language, culture, or context.

Our Solution

OribAI is an instruction-tuned 14B language model built on Qwen2.5-14B-Instruct and fine-tuned on 27,498 curated Hausa and Yoruba conversational pairs using LoRA adapters (r=32, α=64) via Unsloth + TRL. A 4-bit GGUF export enables local inference on consumer hardware with ~10 GB RAM via llama.cpp or Ollama — no cloud required.

Key Impact

  • First high-quality instruction-tuned LLM for Hausa and Yoruba
  • Runs offline on consumer hardware — no GPU or cloud dependency
  • Open-sourced on Hugging Face for the global research community
  • Preserves cultural nuance and linguistic integrity at scale

Have a similar project in mind?

Let's collaborate to build resilient, verifiable infrastructure for your community.

Visit WebsiteStart a Project