Skip to content

s3-ocr fetch command to fetch OCR results #7

@simonw

Description

@simonw
s3-ocr fetch name-of-bucket path/to/file.pdf

This will download the relevant OCR result files to the current directory.

It will first lookup the job ID associated with the file, then save textract-output/a806e67e504fc15f8b9d61d9e8e99f2b329a93410d1859a6fb4c7ba37a48314e/1 as a806e67e504fc15f8b9d61d9e8e99f2b329a93410d1859a6fb4c7ba37a48314e-1.json (and -2.json and so on).

A -c or --combine output.json option will combine them into a single file on disk instead.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions