faq.rst 3.5 KB

12345678910111213141516171819202122232425262728293031323334353637383940414243444546474849505152535455565758596061626364656667686970717273747576777879808182838485868788
  1. FAQ
  2. ==========================
  3. 1. When using the command ``pip install magic-pdf[full]`` on newer versions of macOS, the error ``zsh: no matches found: magic-pdf[full]`` occurs.
  4. ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  5. On macOS, the default shell has switched from Bash to Z shell, which has
  6. special handling logic for certain types of string matching. This can
  7. lead to the “no matches found” error. You can try disabling the globbing
  8. feature in the command line and then run the installation command again.
  9. .. code:: bash
  10. setopt no_nomatch
  11. pip install magic-pdf[full]
  12. 2. Encountering the error ``pickle.UnpicklingError: invalid load key, 'v'.`` during use
  13. ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  14. This might be due to an incomplete download of the model file. You can
  15. try re-downloading the model file and then try again. Reference:
  16. https://github.com/opendatalab/MinerU/issues/143
  17. 3. Where should the model files be downloaded and how should the ``/models-dir`` configuration be set?
  18. ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  19. The path for the model files is configured in “magic-pdf.json”. just
  20. like:
  21. .. code:: json
  22. {
  23. "models-dir": "/tmp/models"
  24. }
  25. This path is an absolute path, not a relative path. You can obtain the
  26. absolute path in the models directory using the “pwd” command.
  27. Reference:
  28. https://github.com/opendatalab/MinerU/issues/155#issuecomment-2230216874
  29. 4. Encountered the error ``ImportError: libGL.so.1: cannot open shared object file: No such file or directory`` in Ubuntu 22.04 on WSL2
  30. ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  31. The ``libgl`` library is missing in Ubuntu 22.04 on WSL2. You can
  32. install the ``libgl`` library with the following command to resolve the
  33. issue:
  34. .. code:: bash
  35. sudo apt-get install libgl1-mesa-glx
  36. Reference: https://github.com/opendatalab/MinerU/issues/388
  37. 5. Encountered error ``ModuleNotFoundError: No module named 'fairscale'``
  38. ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  39. You need to uninstall the module and reinstall it:
  40. .. code:: bash
  41. pip uninstall fairscale
  42. pip install fairscale
  43. Reference: https://github.com/opendatalab/MinerU/issues/411
  44. 6. On some newer devices like the H100, the text parsed during OCR using CUDA acceleration is garbled.
  45. ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  46. The compatibility of cuda11 with new graphics cards is poor, and the
  47. CUDA version used by Paddle needs to be upgraded.
  48. .. code:: bash
  49. pip install paddlepaddle-gpu==3.0.0b1 -i https://www.paddlepaddle.org.cn/packages/stable/cu123/
  50. Reference: https://github.com/opendatalab/MinerU/issues/558
  51. 7. On some Linux servers, the program immediately reports an error ``Illegal instruction (core dumped)``
  52. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  53. This might be because the server's CPU does not support the AVX/AVX2
  54. instruction set, or the CPU itself supports it but has been disabled by
  55. the system administrator. You can try contacting the system
  56. administrator to remove the restriction or change to a different server.
  57. References: https://github.com/opendatalab/MinerU/issues/591 ,
  58. https://github.com/opendatalab/MinerU/issues/736