A Python asyncio wrapper for Tesseract-OCR.
Converts a whole subdirectory with a big (or small) volume of PDF documents to a dataset (pandas DataFrame) with error tracking and choice of features