PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
PaddlePaddle
paddleocr

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

2M 77K 10K
microsoft
mattersim

MatterSim: A deep learning atomistic model across elements, temperatures and pressures.

418K 536 80
opendatalab
mineru

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

282K 62K 5K
terrastackai
terratorch

A Python toolkit for fine-tuning Geospatial Foundation Models (GFMs).

132K 787 150
Future-House
fhaviary

A language agent gym with challenging scientific tasks

115K 260 32
opendatalab
magic-pdf

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

77K 62K 5K
bytedance
protenix

Toward High-Accuracy Open-Source Biomolecular Structure Prediction.

13K 2K 267
chaobrain
brainunit

Physical units and unit-aware mathematical system for general-purpose brain dynamics modeling.

7K 15 3
chaobrain
saiunit

Unit-aware Computations for AI-driven Scientific Computing.

6K 17 1
EvoScientist
evoscientist

🔬 Harness Vibe Research with Self-evolving AI Scientists

5K 3K 175
DLS5-Omics
multimolecule

Accelerate Molecular Biology Research with Machine Learning

4K 50 8
opendatalab
mineru-selfhosted-mcp

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

3K 62K 5K
TrinitroCat
buctoolkit

Batch-upscaled Catalysis Toolkit

2K 5 1
adosar
aidsorb

Python package for deep learning on molecular point clouds.

575 6 2
ryannduma
crystalyse-ai

Intelligent Scientific Agent for Materials Design

548 23 4
PaddlePaddle
fadoudou2

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

535 77K 10K
iowarp
iowarp-mcps

Bringing AI practically to science!

453 24 25
iowarp
clio-kit

Bringing AI practically to science!

424 24 25
IntelliGen-AI
intellifold

IntelliFold: A Controllable Foundation Model for General and Specialized Biomolecular Structure Prediction.

394 219 24
iowarp
iowarp-agent-toolkit

Agent Toolkit - MCP Servers, Clients, and Tools for AI Agents

307 24 25
x66ccff
psrn

[𝐍𝐚𝐭𝐮𝐫𝐞 𝐂𝐨𝐦𝐩𝐮𝐭𝐚𝐭𝐢𝐨𝐧𝐚𝐥 𝐒𝐜𝐢𝐞𝐧𝐜𝐞] ⚡️ PSE/PSRN: Fast and efficient symbolic expression discovery through parallelized symbolic enumeration. Evaluates millions of expressions simultaneously on GPU with automated subtree reuse.

303 22 3
LucaOne
lucagplm

The resources of LucaOne, including: the model code, training scripts, embedding inference code, and trained checkpoints.

281 361 34
PaddlePaddle
je-paddleocr

Awesome OCR toolkits based on PaddlePaddle(8.6M ultra-lightweight pre-trained model, support training and deployment among server, mobile, embedded and IoT devices)

214 77K 10K
PaddlePaddle
langchain-paddleocr

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

192 77K 10K
    • Data from PyPI, GitHub, ClickHouse, and BigQuery