|
|
@@ -28,12 +28,12 @@ NVIDIA drivers are already installed, and you can skip Step 2.
|
|
|
|
|
|
.. note::
|
|
|
|
|
|
- ``CUDA Version`` should be >= 12.1, If the displayed version number is less than 12.1, please upgrade the driver.
|
|
|
+ ``CUDA Version`` should be >= 12.4, If the displayed version number is less than 12.4, please upgrade the driver.
|
|
|
|
|
|
.. code:: text
|
|
|
|
|
|
+---------------------------------------------------------------------------------------+
|
|
|
- | NVIDIA-SMI 537.34 Driver Version: 537.34 CUDA Version: 12.2 |
|
|
|
+ | NVIDIA-SMI 570.133.07 Driver Version: 572.83 CUDA Version: 12.8 |
|
|
|
|-----------------------------------------+----------------------+----------------------+
|
|
|
| GPU Name TCC/WDDM | Bus-Id Disp.A | Volatile Uncorr. ECC |
|
|
|
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
|
|
|
@@ -52,7 +52,7 @@ If no driver is installed, use the following command:
|
|
|
.. code:: sh
|
|
|
|
|
|
sudo apt-get update
|
|
|
- sudo apt-get install nvidia-driver-545
|
|
|
+ sudo apt-get install nvidia-driver-570-server
|
|
|
|
|
|
Install the proprietary driver and restart your computer after
|
|
|
installation.
|
|
|
@@ -80,15 +80,15 @@ Specify Python version 3.10.
|
|
|
|
|
|
.. code:: sh
|
|
|
|
|
|
- conda create -n MinerU python=3.10
|
|
|
- conda activate MinerU
|
|
|
+ conda create -n mineru 'python<3.13' -y
|
|
|
+ conda activate mineru
|
|
|
|
|
|
5. Install Applications
|
|
|
~~~~~~~~~~~~~~~~~~~~~~~
|
|
|
|
|
|
.. code:: sh
|
|
|
|
|
|
- pip install -U magic-pdf[full] --extra-index-url https://wheels.myhloli.com
|
|
|
+ pip install -U magic-pdf[full]
|
|
|
|
|
|
.. admonition:: Important
|
|
|
:class: tip
|
|
|
@@ -99,7 +99,7 @@ Specify Python version 3.10.
|
|
|
|
|
|
magic-pdf --version
|
|
|
|
|
|
-If the version number is less than 0.7.0, please report the issue.
|
|
|
+If the version number is less than 1.3.0, please report the issue.
|
|
|
|
|
|
6. Download Models
|
|
|
~~~~~~~~~~~~~~~~~~
|
|
|
@@ -126,7 +126,7 @@ Download a sample file from the repository and test it.
|
|
|
|
|
|
.. code:: sh
|
|
|
|
|
|
- wget https://github.com/opendatalab/MinerU/raw/master/demo/small_ocr.pdf
|
|
|
+ wget https://github.com/opendatalab/MinerU/raw/master/demo/pdfs/small_ocr.pdf
|
|
|
magic-pdf -p small_ocr.pdf -o ./output
|
|
|
|
|
|
9. Test CUDA Acceleration
|
|
|
@@ -150,23 +150,6 @@ to test CUDA acceleration:
|
|
|
|
|
|
magic-pdf -p small_ocr.pdf -o ./output
|
|
|
|
|
|
-10. Enable CUDA Acceleration for OCR
|
|
|
-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
|
|
|
-
|
|
|
-1. Download ``paddlepaddle-gpu``. Installation will automatically enable
|
|
|
- OCR acceleration.
|
|
|
-
|
|
|
- .. code:: sh
|
|
|
-
|
|
|
- python -m pip install paddlepaddle-gpu==3.0.0b1 -i https://www.paddlepaddle.org.cn/packages/stable/cu118/
|
|
|
-
|
|
|
-2. Test OCR acceleration with the following command:
|
|
|
-
|
|
|
- .. code:: sh
|
|
|
-
|
|
|
- magic-pdf -p small_ocr.pdf -o ./output
|
|
|
-
|
|
|
-
|
|
|
|
|
|
.. _windows_10_or_11_section:
|
|
|
|
|
|
@@ -176,11 +159,12 @@ Windows 10/11
|
|
|
1. Install CUDA and cuDNN
|
|
|
~~~~~~~~~~~~~~~~~~~~~~~~~
|
|
|
|
|
|
-Required versions: CUDA 11.8 + cuDNN 8.7.0
|
|
|
+You need to install a CUDA version that is compatible with torch's requirements. Currently, torch supports CUDA 11.8/12.4/12.6.
|
|
|
+
|
|
|
+- CUDA 11.8 https://developer.nvidia.com/cuda-11-8-0-download-archive
|
|
|
+- CUDA 12.4 https://developer.nvidia.com/cuda-12-4-0-download-archive
|
|
|
+- CUDA 12.6 https://developer.nvidia.com/cuda-12-6-0-download-archive
|
|
|
|
|
|
-- CUDA 11.8: https://developer.nvidia.com/cuda-11-8-0-download-archive
|
|
|
-- cuDNN v8.7.0 (November 28th, 2022), for CUDA 11.x:
|
|
|
- https://developer.nvidia.com/rdp/cudnn-archive
|
|
|
|
|
|
2. Install Anaconda
|
|
|
~~~~~~~~~~~~~~~~~~~
|
|
|
@@ -192,19 +176,17 @@ Download link: https://repo.anaconda.com/archive/Anaconda3-2024.06-1-Windows-x86
|
|
|
3. Create an Environment Using Conda
|
|
|
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
|
|
|
|
|
|
-Python version must be 3.10.
|
|
|
-
|
|
|
::
|
|
|
|
|
|
- conda create -n MinerU python=3.10
|
|
|
- conda activate MinerU
|
|
|
+ conda create -n mineru 'python<3.13' -y
|
|
|
+ conda activate mineru
|
|
|
|
|
|
4. Install Applications
|
|
|
~~~~~~~~~~~~~~~~~~~~~~~
|
|
|
|
|
|
::
|
|
|
|
|
|
- pip install -U magic-pdf[full] --extra-index-url https://wheels.myhloli.com
|
|
|
+ pip install -U magic-pdf[full]
|
|
|
|
|
|
.. admonition:: Important
|
|
|
:class: tip
|
|
|
@@ -215,7 +197,7 @@ Python version must be 3.10.
|
|
|
|
|
|
magic-pdf --version
|
|
|
|
|
|
- If the version number is less than 0.7.0, please report it in the issues section.
|
|
|
+ If the version number is less than 1.3.0, please report it in the issues section.
|
|
|
|
|
|
5. Download Models
|
|
|
~~~~~~~~~~~~~~~~~~
|
|
|
@@ -242,7 +224,7 @@ Download a sample file from the repository and test it.
|
|
|
|
|
|
.. code:: powershell
|
|
|
|
|
|
- wget https://github.com/opendatalab/MinerU/raw/master/demo/small_ocr.pdf -O small_ocr.pdf
|
|
|
+ wget https://github.com/opendatalab/MinerU/raw/master/demo/pdfs/small_ocr.pdf -O small_ocr.pdf
|
|
|
magic-pdf -p small_ocr.pdf -o ./output
|
|
|
|
|
|
8. Test CUDA Acceleration
|
|
|
@@ -251,23 +233,12 @@ Download a sample file from the repository and test it.
|
|
|
If your graphics card has at least 8GB of VRAM, follow these steps to
|
|
|
test CUDA-accelerated parsing performance.
|
|
|
|
|
|
-1. **Overwrite the installation of torch and torchvision** supporting CUDA.
|
|
|
+1. **Overwrite the installation of torch and torchvision** supporting CUDA.(Please select the appropriate index-url based on your CUDA version. For more details, refer to the [PyTorch official website](https://pytorch.org/get-started/locally/).)
|
|
|
|
|
|
.. code:: sh
|
|
|
|
|
|
- pip install --force-reinstall torch==2.3.1 torchvision==0.18.1 --index-url https://download.pytorch.org/whl/cu118
|
|
|
-
|
|
|
-.. admonition:: Important
|
|
|
- :class: tip
|
|
|
-
|
|
|
- ❗️Ensure the following versions are specified in the command:
|
|
|
-
|
|
|
-
|
|
|
- .. code:: sh
|
|
|
+ pip install --force-reinstall torch==2.6.0 torchvision==0.21.1 "numpy<2.0.0" --index-url https://download.pytorch.org/whl/cu124
|
|
|
|
|
|
- torch==2.3.1 torchvision==0.18.1
|
|
|
-
|
|
|
- These are the highest versions we support. Installing higher versions without specifying them will cause the program to fail.
|
|
|
|
|
|
2. **Modify the value of ``"device-mode"``** in the ``magic-pdf.json``
|
|
|
configuration file located in your user directory.
|
|
|
@@ -283,19 +254,3 @@ test CUDA-accelerated parsing performance.
|
|
|
::
|
|
|
|
|
|
magic-pdf -p small_ocr.pdf -o ./output
|
|
|
-
|
|
|
-9. Enable CUDA Acceleration for OCR
|
|
|
-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
|
|
|
-
|
|
|
-1. **Download paddlepaddle-gpu**, which will automatically enable OCR
|
|
|
- acceleration upon installation.
|
|
|
-
|
|
|
- ::
|
|
|
-
|
|
|
- pip install paddlepaddle-gpu==2.6.1
|
|
|
-
|
|
|
-2. **Run the following command to test OCR acceleration**:
|
|
|
-
|
|
|
- ::
|
|
|
-
|
|
|
- magic-pdf -p small_ocr.pdf -o ./output
|