| 12345678910111213141516171819202122232425262728293031323334353637383940414243444546474849505152535455565758596061626364656667686970717273747576777879808182838485868788 |
- FAQ
- ==========================
- 1. When using the command ``pip install magic-pdf[full]`` on newer versions of macOS, the error ``zsh: no matches found: magic-pdf[full]`` occurs.
- ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
- On macOS, the default shell has switched from Bash to Z shell, which has
- special handling logic for certain types of string matching. This can
- lead to the “no matches found” error. You can try disabling the globbing
- feature in the command line and then run the installation command again.
- .. code:: bash
- setopt no_nomatch
- pip install magic-pdf[full]
- 2. Encountering the error ``pickle.UnpicklingError: invalid load key, 'v'.`` during use
- ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
- This might be due to an incomplete download of the model file. You can
- try re-downloading the model file and then try again. Reference:
- https://github.com/opendatalab/MinerU/issues/143
- 3. Where should the model files be downloaded and how should the ``/models-dir`` configuration be set?
- ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
- The path for the model files is configured in “magic-pdf.json”. just
- like:
- .. code:: json
- {
- "models-dir": "/tmp/models"
- }
- This path is an absolute path, not a relative path. You can obtain the
- absolute path in the models directory using the “pwd” command.
- Reference:
- https://github.com/opendatalab/MinerU/issues/155#issuecomment-2230216874
- 4. Encountered the error ``ImportError: libGL.so.1: cannot open shared object file: No such file or directory`` in Ubuntu 22.04 on WSL2
- ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
- The ``libgl`` library is missing in Ubuntu 22.04 on WSL2. You can
- install the ``libgl`` library with the following command to resolve the
- issue:
- .. code:: bash
- sudo apt-get install libgl1-mesa-glx
- Reference: https://github.com/opendatalab/MinerU/issues/388
- 5. Encountered error ``ModuleNotFoundError: No module named 'fairscale'``
- ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
- You need to uninstall the module and reinstall it:
- .. code:: bash
- pip uninstall fairscale
- pip install fairscale
- Reference: https://github.com/opendatalab/MinerU/issues/411
- 6. On some newer devices like the H100, the text parsed during OCR using CUDA acceleration is garbled.
- ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
- The compatibility of cuda11 with new graphics cards is poor, and the
- CUDA version used by Paddle needs to be upgraded.
- .. code:: bash
- pip install paddlepaddle-gpu==3.0.0b1 -i https://www.paddlepaddle.org.cn/packages/stable/cu123/
- Reference: https://github.com/opendatalab/MinerU/issues/558
- 7. On some Linux servers, the program immediately reports an error ``Illegal instruction (core dumped)``
- ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
- This might be because the server's CPU does not support the AVX/AVX2
- instruction set, or the CPU itself supports it but has been disabled by
- the system administrator. You can try contacting the system
- administrator to remove the restriction or change to a different server.
- References: https://github.com/opendatalab/MinerU/issues/591 ,
- https://github.com/opendatalab/MinerU/issues/736
|