edspdf-mupdf provides a MuPdf-based PDF parser component for EDS-PDF
Beware, MuPdf is AGPL-licensed: edspdf-mupdf is therefore also AGPL-licensed, and any model depending on this component must be too.
Install the library with pip:
$ pip install edspdf-mupdf
from edspdf import Pipeline
model = Pipeline()
model.add_pipe('mupdf-extractor')
model(pdf_bytes)
Visit the documentation for more information!
We would like to thank Assistance Publique – Hôpitaux de Paris and AP-HP Foundation for funding this project.