Auto-CORPus pipeline developed by a University of Nottingham and Imperial College London collaboration to standardise text and table data extracted from full text publications. See Open Access publication at: https://doi.org/10.3389/fdgth.2022.788124.