Skip to content

/usr/bin/python: No module named olmocr.pipeline when using Docker Β #231

@mjgaughan

Description

@mjgaughan

πŸ› Describe the bug

Hello! Thank you for your work on olmocr, it's really exciting!

I am currently trying to use OlmoCR through running the Docker image through Singularity. I've pulled and built the SIF file from your container on Docker Hub but I'm running into an issue once I'm inside the image.

Checking through pip list it seems as though olmocr v.0.1.71 is installed. However, when checking through pip freeze it seems as though it's referencing an editable install of olmocr @ file:///root. When I try to run python -m olmocr.pipeline I get the error that /usr/bin/python: No module named olmocr.pipeline When I try to run python -m olmocr it recognizes the package as it returns the error /usr/bin/python: No module named olmocr.__main__; 'olmocr' is a package and cannot be directly executed

I'm not sure if this is an issue with how I'm running the Docker image through the SIF file or if I've messed up the install somehow but I would appreciate any help with this! Thank you.

Versions

Python 3.11.12
aiohappyeyeballs==2.6.1
aiohttp==3.12.4
aiosignal==1.3.2
annotated-types==0.7.0
anthropic==0.52.1
anyio==4.9.0
asttokens==3.0.0
attrs==25.3.0
beaker-py==2.4.2
beautifulsoup4==4.13.4
bleach==6.2.0
blinker==1.9.0
boto3==1.38.26
botocore==1.38.26
cached_path==1.7.3
cachetools==5.5.2
certifi==2025.4.26
cffi==1.17.1
chardet==3.0.4
charset-normalizer==3.4.2
click==8.2.1
cloudpickle==3.1.1
compressed-tensors==0.8.0
cryptography==45.0.3
cuda-bindings==12.9.0
cuda-python==12.9.0
datasets==3.6.0
dbus-python==1.2.16
decorator==5.2.1
decord==0.6.0
defusedxml==0.7.1
Deprecated==1.2.18
dill==0.3.8
diskcache==5.6.3
distro==1.9.0
distro-info==0.23+ubuntu1.1
einops==0.8.1
eval_type_backport==0.2.2
executing==2.2.0
fastapi==0.115.12
fastjsonschema==2.21.1
filelock==3.18.0
flashinfer==0.1.6+cu124torch2.4
Flask==3.1.1
frozenlist==1.6.0
fsspec==2025.3.0
ftfy==6.3.1
fuzzysearch==0.8.0
gguf==0.10.0
google-api-core==2.24.2
google-auth==2.40.2
google-cloud-core==2.4.3
google-cloud-storage==2.19.0
google-crc32c==1.7.1
google-genai==1.17.0
google-resumable-media==2.7.2
googleapis-common-protos==1.70.0
greenlet==3.2.2
grpcio==1.71.0
h11==0.16.0
hf-xet==1.1.2
hf_transfer==0.1.9
httpcore==1.0.9
httptools==0.6.4
httpx==0.28.1
huggingface-hub==0.32.3
idna==3.10
img2pdf==0.6.1
importlib_metadata==8.7.0
interegular==0.3.3
ipython==9.2.0
ipython_pygments_lexers==1.1.1
itsdangerous==2.2.0
jedi==0.19.2
Jinja2==3.1.6
jiter==0.10.0
jmespath==1.0.1
jsonschema==4.24.0
jsonschema-specifications==2025.4.1
jupyter_client==8.6.3
jupyter_core==5.8.1
jupyterlab_pygments==0.3.0
lark==1.2.2
lingua-language-detector==2.1.1
litellm==1.71.2
llvmlite==0.44.0
lm-format-enforcer==0.10.11
lxml==5.4.0
markdown-it-py==3.0.0
markdown2==2.5.3
MarkupSafe==3.0.2
matplotlib-inline==0.1.7
mdurl==0.1.2
mistral_common==1.5.6
mistralai==1.8.1
mistune==3.1.3
modelscope==1.26.0
mpmath==1.3.0
msgpack==1.1.0
msgspec==0.19.0
multidict==6.4.4
multiprocess==0.70.16
nbclient==0.10.2
nbconvert==7.16.6
nbformat==5.10.4
nest-asyncio==1.6.0
networkx==3.5
ninja==1.11.1.4
numba==0.61.2
numpy==1.26.4
nvidia-cublas-cu12==12.4.5.8
nvidia-cuda-cupti-cu12==12.4.127
nvidia-cuda-nvrtc-cu12==12.4.127
nvidia-cuda-runtime-cu12==12.4.127
nvidia-cudnn-cu12==9.1.0.70
nvidia-cufft-cu12==11.2.1.3
nvidia-cufile-cu12==1.11.1.6
nvidia-curand-cu12==10.3.5.147
nvidia-cusolver-cu12==11.6.1.9
nvidia-cusparse-cu12==12.3.1.170
nvidia-cusparselt-cu12==0.6.3
nvidia-ml-py==12.575.51
nvidia-nccl-cu12==2.21.5
nvidia-nvjitlink-cu12==12.4.127
nvidia-nvtx-cu12==12.4.127
olmocr @ file:///root
openai==1.82.1
opencv-python-headless==4.11.0.86
orjson==3.10.18
outlines==0.0.46
packaging==25.0
pandas==2.2.3
pandocfilters==1.5.1
parso==0.8.4
partial-json-parser==0.2.1.1.post5
pexpect==4.9.0
pikepdf==9.8.1
pillow==11.2.1
platformdirs==4.3.8
playwright==1.52.0
prometheus-fastapi-instrumentator==7.1.0
prometheus_client==0.22.0
prompt_toolkit==3.0.51
propcache==0.3.1
proto-plus==1.26.1
protobuf==5.29.5
psutil==7.0.0
ptyprocess==0.7.0
pure_eval==0.2.3
py-cpuinfo==9.0.0
pyairports==2.1.1
pyarrow==20.0.0
pyasn1==0.6.1
pyasn1_modules==0.4.2
pycountry==24.6.1
pycparser==2.22
pydantic==2.11.5
pydantic_core==2.33.2
pyee==13.0.0
Pygments==2.19.1
PyGObject==3.36.0
pypdf==5.5.0
pypdfium2==4.30.1
python-apt==2.0.1+ubuntu0.20.4.1
python-dateutil==2.9.0.post0
python-debian==0.1.36+ubuntu1.1
python-dotenv==1.1.0
python-magic==0.4.27
python-multipart==0.0.20
pytz==2025.2
PyYAML==6.0.2
pyzmq==26.4.0
RapidFuzz==3.13.0
ray==2.46.0
referencing==0.36.2
regex==2024.11.6
requests==2.32.3
requests-unixsocket==0.2.0
rich==13.9.4
rpds-py==0.25.1
rsa==4.9.1
s3transfer==0.13.0
safetensors==0.5.3
sentencepiece==0.2.0
sequence_align==0.3.0
setproctitle==1.3.6
sgl-kernel==0.0.3.post1
sglang==0.4.2
six==1.17.0
smart-open==7.1.0
sniffio==1.3.1
soupsieve==2.7
stack-data==0.6.3
starlette==0.46.2
sympy==1.13.1
syntok==1.4.4
tiktoken==0.9.0
tinycss2==1.4.0
tinyhost==0.4.18
tokenizers==0.20.3
torch==2.5.1
torchao==0.11.0
torchvision==0.20.1
tornado==6.5.1
tqdm==4.67.1
traitlets==5.14.3
transformers==4.46.2
triton==3.1.0
typing-inspection==0.4.1
typing_extensions==4.13.2
tzdata==2025.2
ubuntu-pro-client==8001
unattended-upgrades==0.1
urllib3==2.4.0
uvicorn==0.34.2
uvloop==0.21.0
vllm==0.6.4.post1
watchfiles==1.0.5
wcwidth==0.2.13
webencodings==0.5.1
websockets==15.0.1
Werkzeug==3.1.3
wrapt==1.17.2
xformers==0.0.28.post3
xgrammar==0.1.19
xxhash==3.5.0
yarl==1.20.0
zipp==3.22.0
zstandard==0.23.0

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions