Logo image
About VERSO Report an Issue
Sign in
ocr_scanner
Code

ocr_scanner

Andrew Weymouth
Autumn 2025
Appears in  Data Repository

Abstract

optical character recognition accessibility Auditing Database Management

Python tool that scans specific folders and identifies all of the PDF files lacking a layer of optical character recognition. Identified PDF files are generated in a CSV file alongside their parent folder. Created to have U of I digital collections meet accessibility standards for patrons using screen readers.

url
GitHub RepoView
ocr_scanner

Metrics

1 Record Views

Details

Logo image