PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
JohnSnowLabs
spark-nlp

State of the Art Natural Language Processing

1.1M 4K 743
deepset-ai
haystack-ai

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

763K 25K 3K
ThilinaRajapakse
simpletransformers

Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI

73K 4K 717
deepset-ai
farm-haystack

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

67K 25K 3K
PaddlePaddle
paddlenlp

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

36K 13K 3K
SylphAI-Inc
adalflow

AdalFlow: The library to build & auto-optimize LLM applications.

22K 4K 370
nlpcloud
nlpcloud

NLP Cloud serves high performance pre-trained or custom models for NER, sentiment-analysis, classification, summarization, paraphrasing, intent classification, product description and ad generation, chatbot, grammar and spelling correction, keywords and keyphrases extraction, text generation, image generation, code generation, and more...

16K 86 8
PaddlePaddle
tool-helpers

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

10K 13K 3K
PaddlePaddle
fast-dataindex

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

8K 13K 3K
vectorlessflow
vectorless

Knowing by reasoning, not vectors. ⭐ Star this repo if you find it useful.

6K 29 2
nanonets
nanoindex

Agentic RAG Harness for long documents, Tree and Graph based reasoning. Cited answers down to the pixel

5K 49 5
SylphAI-Inc
lightrag

AdalFlow: The library to build & auto-optimize LLM applications.

5K 4K 370
deeppavlov
deeppavlov

An open source library for deep learning end-to-end dialog systems and chatbots.

5K 7K 1K
allenai
ai2-scholar-qa

Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library

4K 277 57
thiswillbeyourgithub
wdoc

A perfect AI powered RAG for document query and summary. Supports ~all LLM and ~all filetypes (url, pdf, epub, youtube (incl playlist), audio, anki, md, docx, pptx, oe any combination!)

3K 517 41
deepset-ai
farm

Framework for finetuning and evaluating transformer based language models

3K 2K 247
EricFillion
happytransformer

Happy Transformer makes it easy to fine-tune and perform inference with NLP Transformer models.

2K 546 69
Mohan-Zhang-u
mzutils

Mohan Zhang's toolkit

2K 104 9
texttron
tevatron

Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.

2K 735 129
thiswillbeyourgithub
doctoolsllm

Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, advanced RAG, advanced summaries, scriptable, etc

2K 520 41
luozhouyang
transformers-keras

Transformer-based models implemented in tensorflow 2.x(Keras)

1K 76 13
PaddlePaddle
fast-tokenizer-python

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

1K 13K 3K
kiri-ai
kiri

Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.

1K 241 11
Ki6an
fastt5

⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.

1K 588 75
    • Data from PyPI, GitHub, ClickHouse, and BigQuery