Omneity Labs Omneity
AboutResearchProjectsArticlesTeamContact

Publications

Research

Peer-reviewed papers, conference presentations, and technical reports.

01
paper

Sawtone: A Universal Framework for Phonetic Similarity and Alignment Across Languages and Scripts

Lingua Posnaniensis, Vol. 67(1)

Introduces a cross-script phonetic alignment framework with modular language-specific adapters. Demonstrates 88% BLEU transliteration and 87–95% phonetic alignment accuracy across language/script pairs. Includes a case study on preprocessing Moroccan Arabic (Darija) for LLM training.

PhoneticsTransliterationCross-ScriptNLP
2026
02
report

GenAI for Moroccan Darija: Challenges and Early Results

University of Navarra, Spain

Conference presentation at the 7th International Congress for Moroccan Arabic, discussing challenges and early results in applying generative AI to Moroccan Darija.

LLMMoroccan DarijaLow-Resource LanguagesNLP
2024
03
report

Gherbal: A Multilingual Classifier for Low-Resource Languages

University Hassan II, Casablanca, Morocco

Conference presentation at TIM'24, introducing Gherbal — a multilingual classifier designed for low-resource languages.

NLPLow-Resource LanguagesCultural AI
2024
Omneity Labs Omneity Labs

An independent GenAI R&D lab building AI for the languages and cultures the industry ignores – starting with North Africa.

Navigate

  • About
  • Research
  • Projects
  • Articles
  • Team

Get in touch

  • Contact form
  • info@omneitylabs.com

Locations

Berlin, Germany

Salé, Morocco

© 2026 Omneity Labs. All rights reserved.