install.rst 4.5 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139
  1. Install
  2. ===============================================================
  3. If you encounter any installation issues, please first consult the :doc:`../../additional_notes/faq`.
  4. If the parsing results are not as expected, refer to the :doc:`../../additional_notes/known_issues`.
  5. Also you can try `online demo <https://www.modelscope.cn/studios/OpenDataLab/MinerU>`_ without installation.
  6. .. admonition:: Warning
  7. :class: tip
  8. **Pre-installation Notice—Hardware and Software Environment Support**
  9. To ensure the stability and reliability of the project, we only optimize
  10. and test for specific hardware and software environments during
  11. development. This ensures that users deploying and running the project
  12. on recommended system configurations will get the best performance with
  13. the fewest compatibility issues.
  14. By focusing resources on the mainline environment, our team can more
  15. efficiently resolve potential bugs and develop new features.
  16. In non-mainline environments, due to the diversity of hardware and
  17. software configurations, as well as third-party dependency compatibility
  18. issues, we cannot guarantee 100% project availability. Therefore, for
  19. users who wish to use this project in non-recommended environments, we
  20. suggest carefully reading the documentation and FAQ first. Most issues
  21. already have corresponding solutions in the FAQ. We also encourage
  22. community feedback to help us gradually expand support.
  23. .. raw:: html
  24. <style>
  25. table, th, td {
  26. border: 1px solid black;
  27. border-collapse: collapse;
  28. }
  29. </style>
  30. <table>
  31. <tr>
  32. <td colspan="3" rowspan="2">Operating System</td>
  33. </tr>
  34. <tr>
  35. <td>Ubuntu 22.04 LTS</td>
  36. <td>Windows 10 / 11</td>
  37. <td>macOS 11+</td>
  38. </tr>
  39. <tr>
  40. <td colspan="3">CPU</td>
  41. <td>x86_64(unsupported ARM Linux)</td>
  42. <td>x86_64(unsupported ARM Windows)</td>
  43. <td>x86_64 / arm64</td>
  44. </tr>
  45. <tr>
  46. <td colspan="3">Memory</td>
  47. <td colspan="3">16GB or more, recommended 32GB+</td>
  48. </tr>
  49. <tr>
  50. <td colspan="3">Python Version</td>
  51. <td colspan="3">3.10(Please make sure to create a Python 3.10 virtual environment using conda)</td>
  52. </tr>
  53. <tr>
  54. <td colspan="3">Nvidia Driver Version</td>
  55. <td>latest (Proprietary Driver)</td>
  56. <td>latest</td>
  57. <td>None</td>
  58. </tr>
  59. <tr>
  60. <td colspan="3">CUDA Environment</td>
  61. <td>Automatic installation [12.1 (pytorch) + 11.8 (paddle)]</td>
  62. <td>11.8 (manual installation) + cuDNN v8.7.0 (manual installation)</td>
  63. <td>None</td>
  64. </tr>
  65. <tr>
  66. <td rowspan="2">GPU Hardware Support List</td>
  67. <td colspan="2">Minimum Requirement 8G+ VRAM</td>
  68. <td colspan="2">3060ti/3070/4060<br>
  69. 8G VRAM enables layout, formula recognition acceleration and OCR acceleration</td>
  70. <td rowspan="2">None</td>
  71. </tr>
  72. <tr>
  73. <td colspan="2">Recommended Configuration 10G+ VRAM</td>
  74. <td colspan="2">3080/3080ti/3090/3090ti/4070/4070ti/4070tisuper/4080/4090<br>
  75. 10G VRAM or more can enable layout, formula recognition, OCR acceleration and table recognition acceleration simultaneously
  76. </td>
  77. </tr>
  78. </table>
  79. Create an environment
  80. ---------------------------
  81. .. code-block:: shell
  82. conda create -n MinerU python=3.10
  83. conda activate MinerU
  84. pip install -U magic-pdf[full] --extra-index-url https://wheels.myhloli.com
  85. Download model weight files
  86. ------------------------------
  87. .. code-block:: shell
  88. pip install huggingface_hub
  89. wget https://github.com/opendatalab/MinerU/raw/master/scripts/download_models_hf.py -O download_models_hf.py
  90. python download_models_hf.py
  91. Install LibreOffice[Optional]
  92. ----------------------------------
  93. This section is required for handle **doc**, **docx**, **ppt**, **pptx** filetype, You can **skip** this section if no need for those filetype processing.
  94. Linux/Macos Platform
  95. """"""""""""""""""""""
  96. .. code::
  97. apt-get/yum/brew install libreoffice
  98. Windows Platform
  99. """"""""""""""""""""
  100. .. code::
  101. install libreoffice
  102. append "install_dir\LibreOffice\program" to ENVIRONMENT PATH
  103. .. tip::
  104. The MinerU is installed, Check out :doc:`../usage/command_line` to convert your first pdf **or** reading the following sections for more details about install