πŸͺ„ Automatically convert a PDF into a fillable form.

πŸ’» Hosted Models (detect.semanticdocs.org) | πŸ“„ CommonForms Paper | πŸ€— Dataset | 🦾 Models

Pipeline

FFDNet-L

FFDNet-L is the 25 million parameter object detector trained on the dataset from the paper CommonForms: A Large, Diverse Dataset for Form Field Detection. The model detects widgets from among three classes: TextBoxes, ChoiceButtons (checkboxes), and Signature fields.

Results

Model Text Choice Signature AP (↑)
FFDNet-S (1216px) 61.5 71.3 84.2 72.3
FFDNet-L (1216px) 71.4 78.1 93.5 81.0

Installation

The commonforms package can be installed with either uv or pip, feel free to choose your package manager flavor. The uv command:

uv pip install commonforms

The pip command:

pip install commonforms

Once it's installed, you should be able to run the CLI command on ~any PDF. Refer to the commonforms documentation for the latest information.

CLI

The simplest usage will run inference on your CPU using the default suggested settings:

commonforms <input.pdf> <output.pdf>
Input Output
Input PDF Output PDF

Command Line Arguments

Argument Type Default Description
input Path Required Path to the input PDF file
output Path Required Path to save the output PDF file
--model str FFDNet-L Model name (FFDNet-L/FFDNet-S) or path to custom .pt file
--keep-existing-fields flag False Keep existing form fields in the PDF
--use-signature-fields flag False Use signature fields instead of text fields for detected signatures
--device str cpu Device for inference (e.g., cpu, cuda, 0)
--image-size int 1600 Image size for inference
--confidence float 0.3 Confidence threshold for detection
--fast flag False If running on a CPU, you can trade off accuracy for speed and run in about half the time

CommonForms API

In addition to the CLI, you can use

from commonforms import prepare_form

prepare_form(
    "path/to/input.pdf",
    "path/to/output.pdf"
)

All of the above arguments are keyword arguments to the prepare_form function. E.g. if you want to prepare a with signature fields and keep existing fields at 1216 resolution, you would run:

from commonforms import prepare_form

prepare_form(
    "path/to/input.pdf",
    "path/to/output.pdf",
    keep_existing_fields=True,
    use_signature_fields=True,
    image_size=1216
)

References

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Dataset used to train jbarrow/FFDNet-L

Space using jbarrow/FFDNet-L 1