Abstract
Python tool which vets batches of PDF file OCR for sensitive information such as social security numbers, addresses and phone numbers using regular expressions and outputting CSV files with pertinent filenames but without the sensitive information for reference.