Dependents of pymultirole-plugins

79 dependents

Package	Description	Downloads/month
pyconverters-mistralocr	Convert PDF to structured text using MistralOCR	11K
pyprocessors-openai-completion	OpenAICompletion processor	9K
pyconverters-mineru	Convert PDF to structured text using MinerU	9K
pyprocessors-chunk-sentences	Sherpa sentence chunking processor	9K
pyannotators-acronyms	Annotator based on Facebook's Acronyms	8K
pyconverters-openai-vision	OpenAIVision converter	8K
pyprocessors-consolidate	Sherpa Consolidation processor	7K
pyconverters-inscriptis	Convert HTML to text using inscriptis	7K
pyformatters-xml-rf	Groupe RF XML formatter	6K
pyconverters-grobid	Convert PDF to structured text using Grobid	6K
pyannotators-duckling	Annotator based on Facebook Duckling	6K
pyconverters-pypowerpoint	Convert PPTX to text using python-pptx	6K
pyconverters-newsml	NewsML converter (AFP news)	6K
pyformatters-tabular	Tabular formatter for Sherpa	6K
pysegmenters-pysdb	Rule-based segmenter	5K
pyannotators-spacyner	Annotator based on Spacy NER	5K
pyformatters-afp-quality	Sherpa AFP Quality formatter	5K
pyprocessors-afp-entities	AFPEntities annotations coming from different annotators	5K
pyconverters-pyword	Convert DOCX to Markdown using [mammoth](https://github.com/mwilliamson/python-m...	5K
pysegmenters-blingfire	Segmenter based on BlingFire	5K
pyconverters-pubmedfetcher	Fetch and convert Pubmed articles	5K
pysegmenters-syntok	syntok segmenter	5K
pysegmenters-rules-segmenter	Rule-based segmenter	5K
pyprocessors-tag2segment	Create segments from annotations	5K
pyprocessors-deepl	DeepL processor plugin for pymultirole	5K
pyprocessors-document-fingerprint	Sherpa Consolidation processor	5K
pyannotators-patterns	Annotator based on Presidio pattern recognizer	4K
pyconverters-cairn-xml	Cairn.info XML converter	4K
pyprocessors-pseudonimizer	Processor based on Presidio anonymizer	4K
pyconverters-whisperx	WhisperX converter for audio transcription with speaker diarization support.	4K
pyconverters-openai-audio	OpenAIAudio converter	4K
pyconverters-pyexcel	Convert XLSX to 1-segment per row document	4K
pyprocessors-segment-renseignor	Create segments from annotations based on Renseignor document structure	4K
pyannotators-spacymatcher	SpacyMatcher annotator using the spacy rule-matching engine	4K
pyprocessors-reconciliation	Sherpa reconciliation processor	4K
pyconverters-paddleocr	Convert PDF to structured text using PaddleOCR	4K
pyprocessors-categories-from-annotations	Sherpa transform annotations to categories processor	4K
pyprocessors-capitalizer	Replace document text with capitalized annotations	4K
pysegmenters-md-splitter	Markdown splitter segmenter	4K
pyprocessors-iptc-mapper	Sherpa IPTC category mapper	4K
pyprocessors-nameparser	Processor based on Nameparser	4K
pyprocessors-afp-keywords	Processor based on AFP keywords extraction	4K
pyprocessors-rf-consolidate	RFConsolidate annotations coming from different annotators	4K
pyprocessors-standoff2inline	Sherpa transform annotations to categories processor	4K
pyannotators-entityfishing	Annotator based on entity-fishing	3K
pyformatters-json	Json formatter for Sherpa	3K
pyannotators-zeroshotclassifier	Annotator based on Huggingface transformers zero-shot classification pipeline	2K
pyprocessors-opennre	Processor based on Huggingface transformers zero-shot classification pipeline	2K
pyformatters-summarizer	Formatter based on Huggingface transformers summarization pipeline	2K
pyannotators-trankitner	Annotator based on Trankit NER	2K