report
Gherbal: A Multilingual Classifier for Low-Resource Languages
January 1, 2024 · University Hassan II, Casablanca, Morocco
Omar Kamali
Abstract
Conference presentation at TIM'24, introducing Gherbal — a multilingual classifier designed for low-resource languages.
Conference Presentation
Presented at TIM'24, University Hassan II, Casablanca, Morocco, 2024.
This talk introduces Gherbal, a multilingual classifier built specifically for low-resource languages, addressing the gap in NLP tooling for underrepresented language communities.
Citation
Kamali, O. (2024). Gherbal: A Multilingual Classifier for Low-Resource Languages. TIM'24, University Hassan II, Casablanca, Morocco.
NLPLow-Resource LanguagesCultural AI
Related Research
Sawtone: A Universal Framework for Phonetic Similarity and Alignment Across Languages and Scripts
Introduces a cross-script phonetic alignment framework with modular language-specific adapters. Demonstrates 88% BLEU transliteration and 87–95% phonetic alignment accuracy across language/script pairs. Includes a case study on preprocessing Moroccan Arabic (Darija) for LLM training.
GenAI for Moroccan Darija: Challenges and Early Results
Conference presentation at the 7th International Congress for Moroccan Arabic, discussing challenges and early results in applying generative AI to Moroccan Darija.