Publications
Research
Peer-reviewed papers, conference presentations, and technical reports.
01 202602 202403 2024
paper
Sawtone: A Universal Framework for Phonetic Similarity and Alignment Across Languages and Scripts
Lingua Posnaniensis, Vol. 67(1)
Introduces a cross-script phonetic alignment framework with modular language-specific adapters. Demonstrates 88% BLEU transliteration and 87–95% phonetic alignment accuracy across language/script pairs. Includes a case study on preprocessing Moroccan Arabic (Darija) for LLM training.
PhoneticsTransliterationCross-ScriptNLP
report
GenAI for Moroccan Darija: Challenges and Early Results
University of Navarra, Spain
Conference presentation at the 7th International Congress for Moroccan Arabic, discussing challenges and early results in applying generative AI to Moroccan Darija.
LLMMoroccan DarijaLow-Resource LanguagesNLP
report
Gherbal: A Multilingual Classifier for Low-Resource Languages
University Hassan II, Casablanca, Morocco
Conference presentation at TIM'24, introducing Gherbal — a multilingual classifier designed for low-resource languages.
NLPLow-Resource LanguagesCultural AI