PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Digital Preservation Python Packages

Python packages with the GitHub topic digital-preservation. Sorted by relevance, with stars and monthly downloads.
bagit-profiles
bagit-profile

A simple Python module for validating BagIt Profiles.

101K 12 6
bagit-profiles
ocrd-fork-bagit-profile

A simple Python module for validating BagIt Profiles.

11K 12 6
artefactual-labs
amclient

Archivematica API client module

7K 1 4
artefactual-labs
pygfried

Siegfried as a Python extension

4K 7 1
artefactual-labs
a3m

Lightweight Archivematica — 8 less than a11m.

2K 12 6
Irish-Film-Institute
ifiscripts

Scripts for processing moving image material in the Irish Film Institute/Irish Film Archive

2K 31 11
ffdev-info
jsonid

Identification of JSON (JSONL, YAML, and TOML) objects: JSONID

2K 10 0
GeiserX
wayback-archive

A comprehensive tool for downloading and archiving websites from the Wayback Machine

1K 8 3
tw4l
brunnhilde

Siegfried-based characterization tool for directories and disk images

998 92 11
gdamdam
iagitup

Archive GitHub, GitLab, Bitbucket & any git repo to the Internet Archive as portable bundles with rich metadata.

660 99 9
davidfstr
crystal-web

Downloads websites for long-term archival.

513 90 7
yaniv-golan
offlickr

Convert your Flickr export into a self-contained static site

403 1 1
wellcomecollection
wellcome-storage-service

A client for the Wellcome Storage Service

365 35 5
xy-liao
jp2forge

A comprehensive JPEG2000 processing tool with BnF compatibility

346 2 0
GeiserX
wayback-diff

Intelligent web page comparison tool with Wayback Machine support and visual regression testing

283 1 0
exponential-decay
demystify-digipres

Engine for analysis of Siegfried export files and DROID CSV. The tool has three purposes, break the export into its components and store them within a SQLite database; create additional columns to augment the output where useful; and query the SQLite database, outputting results in a readable form useful for analysis by researchers and archivists within digital preservation departments in memory institutions. The tool will find duplicates, unidentified files, blacklisted objects, character encoding issues, and more.

267 33 5
ffdev-info
pronom-tools

Tools, and API for working with PRONOM releases

255 2 0
nks1990
pdf2pdfa

Converts PDF to PDF/A-1b, embeds fonts, sets sRGB profile, syncs metadata. CLI & Python library.

216 1 0
ross-spencer
sumfolder1

Checksums for folders.

197 9 0
ruarxive
wparc

Wordpress API data and files archival command line tool

183 10 1
exponential-decay
sqlitefid

Library and executable for converting format identification reports such as DROID and Siegfried to an sqlite database

94 4 0
kieranjol
bitc

Detailed documentation is available here: http://ifiscripts.readthedocs.io/en/latest/index.html

65 52 33
exponential-decay
pathlesstaken

Profile strings, e.g. file paths for digital preservation considerations, e.g. characters that you want to preserve, or characters that you don't want to preserve.

64 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery