Fast and memory-efficient Python PDF Parser based on xpdf sources
Using LLM to parse PDF and get better chunk for retrieval
yet another pdf texts and tables extractor