Code-switched & African Language Sentiment Classification

Status: Completed | Funding: Howard University

This set of projects explored sentiment classification in linguistically complex and low-resource contexts, specifically targeting code-switched and African languages. One project focused on classifying sentiment in code-switched text using multilingual embeddings, leading to a publication at NATL and a national conference presentation. A follow-up study introduced a novel translation-based preprocessing approach to enhance transformer model performance on code-switched data, culminating in a publication at CCSCE 2023.

Additional research was conducted through participation in the AfriSenti-SemEval challenge, benchmarking transformer models across multiple African languages for sentiment analysis. This work produced competitive results and led to the development of a Language-Specific Model Fusion System that intelligently combines outputs from language-specific transformers. The system was detailed in a publication at SemEval-2023, contributing to the advancement of sentiment analysis in low-resource, multilingual settings.

Contributors

Howard Prioleau

Dr Saurav Aryal

Related Publications

Progress Timeline

2024-2025

Page updated

Google Sites

Report abuse