boost_with_cuda.rst 7.7 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301
  1. Boost With Cuda
  2. ================
  3. If your device supports CUDA and meets the GPU requirements of the
  4. mainline environment, you can use GPU acceleration. Please select the
  5. appropriate guide based on your system:
  6. - :ref:`ubuntu_22_04_lts_section`
  7. - :ref:`windows_10_or_11_section`
  8. .. _ubuntu_22_04_lts_section:
  9. Ubuntu 22.04 LTS
  10. -----------------
  11. 1. Check if NVIDIA Drivers Are Installed
  12. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  13. .. code:: sh
  14. nvidia-smi
  15. If you see information similar to the following, it means that the
  16. NVIDIA drivers are already installed, and you can skip Step 2.
  17. .. note::
  18. ``CUDA Version`` should be >= 12.1, If the displayed version number is less than 12.1, please upgrade the driver.
  19. .. code:: text
  20. +---------------------------------------------------------------------------------------+
  21. | NVIDIA-SMI 537.34 Driver Version: 537.34 CUDA Version: 12.2 |
  22. |-----------------------------------------+----------------------+----------------------+
  23. | GPU Name TCC/WDDM | Bus-Id Disp.A | Volatile Uncorr. ECC |
  24. | Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
  25. | | | MIG M. |
  26. |=========================================+======================+======================|
  27. | 0 NVIDIA GeForce RTX 3060 Ti WDDM | 00000000:01:00.0 On | N/A |
  28. | 0% 51C P8 12W / 200W | 1489MiB / 8192MiB | 5% Default |
  29. | | | N/A |
  30. +-----------------------------------------+----------------------+----------------------+
  31. 2. Install the Driver
  32. ~~~~~~~~~~~~~~~~~~~~~
  33. If no driver is installed, use the following command:
  34. .. code:: sh
  35. sudo apt-get update
  36. sudo apt-get install nvidia-driver-545
  37. Install the proprietary driver and restart your computer after
  38. installation.
  39. .. code:: sh
  40. reboot
  41. 3. Install Anaconda
  42. ~~~~~~~~~~~~~~~~~~~
  43. If Anaconda is already installed, skip this step.
  44. .. code:: sh
  45. wget https://repo.anaconda.com/archive/Anaconda3-2024.06-1-Linux-x86_64.sh
  46. bash Anaconda3-2024.06-1-Linux-x86_64.sh
  47. In the final step, enter ``yes``, close the terminal, and reopen it.
  48. 4. Create an Environment Using Conda
  49. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  50. Specify Python version 3.10.
  51. .. code:: sh
  52. conda create -n MinerU python=3.10
  53. conda activate MinerU
  54. 5. Install Applications
  55. ~~~~~~~~~~~~~~~~~~~~~~~
  56. .. code:: sh
  57. pip install -U magic-pdf[full] --extra-index-url https://wheels.myhloli.com
  58. .. admonition:: Important
  59. :class: tip
  60. ❗ After installation, make sure to check the version of ``magic-pdf`` using the following command:
  61. .. code:: sh
  62. magic-pdf --version
  63. If the version number is less than 0.7.0, please report the issue.
  64. 6. Download Models
  65. ~~~~~~~~~~~~~~~~~~
  66. Refer to detailed instructions on :doc:`download_model_weight_files`
  67. 7. Understand the Location of the Configuration File
  68. ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  69. After completing the `6. Download Models <#6-download-models>`__ step,
  70. the script will automatically generate a ``magic-pdf.json`` file in the
  71. user directory and configure the default model path. You can find the
  72. ``magic-pdf.json`` file in your user directory.
  73. .. admonition:: TIP
  74. :class: tip
  75. The user directory for Linux is “/home/username”.
  76. 8. First Run
  77. ~~~~~~~~~~~~
  78. Download a sample file from the repository and test it.
  79. .. code:: sh
  80. wget https://github.com/opendatalab/MinerU/raw/master/demo/small_ocr.pdf
  81. magic-pdf -p small_ocr.pdf -o ./output
  82. 9. Test CUDA Acceleration
  83. ~~~~~~~~~~~~~~~~~~~~~~~~~
  84. If your graphics card has at least **8GB** of VRAM, follow these steps
  85. to test CUDA acceleration:
  86. 1. Modify the value of ``"device-mode"`` in the ``magic-pdf.json``
  87. configuration file located in your home directory.
  88. .. code:: json
  89. {
  90. "device-mode": "cuda"
  91. }
  92. 2. Test CUDA acceleration with the following command:
  93. .. code:: sh
  94. magic-pdf -p small_ocr.pdf -o ./output
  95. 10. Enable CUDA Acceleration for OCR
  96. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  97. 1. Download ``paddlepaddle-gpu``. Installation will automatically enable
  98. OCR acceleration.
  99. .. code:: sh
  100. python -m pip install paddlepaddle-gpu==3.0.0b1 -i https://www.paddlepaddle.org.cn/packages/stable/cu118/
  101. 2. Test OCR acceleration with the following command:
  102. .. code:: sh
  103. magic-pdf -p small_ocr.pdf -o ./output
  104. .. _windows_10_or_11_section:
  105. Windows 10/11
  106. --------------
  107. 1. Install CUDA and cuDNN
  108. ~~~~~~~~~~~~~~~~~~~~~~~~~
  109. Required versions: CUDA 11.8 + cuDNN 8.7.0
  110. - CUDA 11.8: https://developer.nvidia.com/cuda-11-8-0-download-archive
  111. - cuDNN v8.7.0 (November 28th, 2022), for CUDA 11.x:
  112. https://developer.nvidia.com/rdp/cudnn-archive
  113. 2. Install Anaconda
  114. ~~~~~~~~~~~~~~~~~~~
  115. If Anaconda is already installed, you can skip this step.
  116. Download link: https://repo.anaconda.com/archive/Anaconda3-2024.06-1-Windows-x86_64.exe
  117. 3. Create an Environment Using Conda
  118. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  119. Python version must be 3.10.
  120. ::
  121. conda create -n MinerU python=3.10
  122. conda activate MinerU
  123. 4. Install Applications
  124. ~~~~~~~~~~~~~~~~~~~~~~~
  125. ::
  126. pip install -U magic-pdf[full] --extra-index-url https://wheels.myhloli.com
  127. .. admonition:: Important
  128. :class: tip
  129. ❗️After installation, verify the version of ``magic-pdf``:
  130. .. code:: bash
  131. magic-pdf --version
  132. If the version number is less than 0.7.0, please report it in the issues section.
  133. 5. Download Models
  134. ~~~~~~~~~~~~~~~~~~
  135. Refer to detailed instructions on :doc:`download_model_weight_files`
  136. 6. Understand the Location of the Configuration File
  137. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  138. After completing the `5. Download Models <#5-download-models>`__ step,
  139. the script will automatically generate a ``magic-pdf.json`` file in the
  140. user directory and configure the default model path. You can find the
  141. ``magic-pdf.json`` file in your 【user directory】 .
  142. .. admonition:: Tip
  143. :class: tip
  144. The user directory for Windows is “C:/Users/username”.
  145. 7. First Run
  146. ~~~~~~~~~~~~
  147. Download a sample file from the repository and test it.
  148. .. code:: powershell
  149. wget https://github.com/opendatalab/MinerU/raw/master/demo/small_ocr.pdf -O small_ocr.pdf
  150. magic-pdf -p small_ocr.pdf -o ./output
  151. 8. Test CUDA Acceleration
  152. ~~~~~~~~~~~~~~~~~~~~~~~~~
  153. If your graphics card has at least 8GB of VRAM, follow these steps to
  154. test CUDA-accelerated parsing performance.
  155. 1. **Overwrite the installation of torch and torchvision** supporting CUDA.
  156. .. code:: sh
  157. pip install --force-reinstall torch==2.3.1 torchvision==0.18.1 --index-url https://download.pytorch.org/whl/cu118
  158. .. admonition:: Important
  159. :class: tip
  160. ❗️Ensure the following versions are specified in the command:
  161. .. code:: sh
  162. torch==2.3.1 torchvision==0.18.1
  163. These are the highest versions we support. Installing higher versions without specifying them will cause the program to fail.
  164. 2. **Modify the value of ``"device-mode"``** in the ``magic-pdf.json``
  165. configuration file located in your user directory.
  166. .. code:: json
  167. {
  168. "device-mode": "cuda"
  169. }
  170. 3. **Run the following command to test CUDA acceleration**:
  171. ::
  172. magic-pdf -p small_ocr.pdf -o ./output
  173. 9. Enable CUDA Acceleration for OCR
  174. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  175. 1. **Download paddlepaddle-gpu**, which will automatically enable OCR
  176. acceleration upon installation.
  177. ::
  178. pip install paddlepaddle-gpu==2.6.1
  179. 2. **Run the following command to test OCR acceleration**:
  180. ::
  181. magic-pdf -p small_ocr.pdf -o ./output