Join our research team! Recruiting volunteer/paid positions for interested undergraduate students.
This set of projects explored sentiment classification in linguistically complex and low-resource contexts, specifically targeting code-switched and African languages. One project focused on classifying sentiment in code-switched text using multilingual embeddings, leading to a publication at NATL and a national conference presentation. A follow-up study introduced a novel translation-based preprocessing approach to enhance transformer model performance on code-switched data, culminating in a publication at CCSCE 2023.
Additional research was conducted through participation in the AfriSenti-SemEval challenge, benchmarking transformer models across multiple African languages for sentiment analysis. This work produced competitive results and led to the development of a Language-Specific Model Fusion System that intelligently combines outputs from language-specific transformers. The system was detailed in a publication at SemEval-2023, contributing to the advancement of sentiment analysis in low-resource, multilingual settings.
Sentiment analysis across multiple African languages: A current benchmark
Zero-shot classification reveals potential positive sentiment bias in African languages translations
Evaluating Ensembled Transformers for Multilingual Code-Switched Sentiment Analysis
Baselining performance for multilingual codeswitching sentiment classification
2024-2025