[Feat] Add CLI support for OCR by simont2k · Pull Request #2058 · mindee/doctr

simont2k · 2026-05-05T13:11:22Z

This PR:

Add CLI to perform OCR
options:

--input_path INPUT_PATH
                      path to input image or PDF file (default: None)
--det_arch DET_ARCH   name of the detection architecture or the model itself to use (default: db_resnet50)
--reco_arch RECO_ARCH
                      name of the recognition architecture or the model itself to use (default: crnn_vgg16_bn)
--assume_straight_pages, --no-assume_straight_pages
                      assume only straight pages without rotated textual elements (default: True)
--straighten_pages    attempt to straighten skewed pages before analysis (default: False)
--preserve_aspect_ratio, --no-preserve_aspect_ratio
                      preserve aspect ratio when resizing pages (default: True)
--symmetric_pad       apply symmetric padding (default: False)
--det_bs DET_BS       batch size for detection (default: 2)
--reco_bs RECO_BS     batch size for recognition (default: 128)
--detect_orientation  automatically detect page orientation (default: False)
--detect_language     detect language of the text (default: False)
--output OUTPUT       path to output results in JSON format (default: results.json)

usage example: doctr-cli --input_path sample.png

Add CLI support for OCR

eb84ef5

simont2k mentioned this pull request May 5, 2026

[docTR] CLI support text2knowledge/docTR-Labeler#29

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feat] Add CLI support for OCR#2058

[Feat] Add CLI support for OCR#2058
simont2k wants to merge 1 commit intomindee:mainfrom
simont2k:feat/CLI-support

simont2k commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

simont2k commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant