Black Press Automation - Automating Archival Processing with LLMs
Black Press Automation - Automating Archival Processing with LLMs
Join our research team! Recruiting volunteer/paid positions for interested undergraduate students.
Developing a pipeline to process newspaper images from microfilm reel by page segmentation and metadata extraction using masking and segmentation models. The system will be able to crop pages out, detect the publication name, date of issue and page numbers and apply color presets and batch OCR on them following FADGI standard. The pdf of every single issue will be uploaded to cloud for final storage with searchability features.
2024-Present