Join our research team! Recruiting volunteer/paid positions for interested undergraduate students.
Our work presents a novel three-step GPT pipeline designed to improve the translation of named entities. The process involves three phases: extracting entities from the source text, refining their translations using Wikidata context through GPT, and integrating these pre-processed names into the final translation. This approach addresses common mistranslation issues—such as confusing “Kwame Nkrumah” with “forest”—and results in more accurate, culturally sensitive translations. Evaluations across multiple languages demonstrate consistent performance gains over baseline models, particularly in languages with complex scripts or low resource availability.
2024-2025