| 123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168 |
- Config
- =========
- File **magic-pdf.json** is typically located in the **${HOME}** directory under a Linux system or in the **C:\Users\{username}** directory under a Windows system.
- .. admonition:: Tip
- :class: tip
- You can override the default location of config file via the following command:
-
- export MINERU_TOOLS_CONFIG_JSON=new_magic_pdf.json
- magic-pdf.json
- ----------------
- .. code:: json
- {
- "bucket_info":{
- "bucket-name-1":["ak", "sk", "endpoint"],
- "bucket-name-2":["ak", "sk", "endpoint"]
- },
- "models-dir":"/tmp/models",
- "layoutreader-model-dir":"/tmp/layoutreader",
- "device-mode":"cpu",
- "layout-config": {
- "model": "doclayout_yolo"
- },
- "formula-config": {
- "mfd_model": "yolo_v8_mfd",
- "mfr_model": "unimernet_small",
- "enable": true
- },
- "table-config": {
- "model": "rapid_table",
- "enable": true,
- "max_time": 400
- },
- "config_version": "1.0.0"
- }
- bucket_info
- ^^^^^^^^^^^^^^
- Store the access_key, secret_key and endpoint of AWS S3 Compatible storage config
- Example:
- .. code:: text
- {
- "image_bucket":[{access_key}, {secret_key}, {endpoint}],
- "video_bucket":[{access_key}, {secret_key}, {endpoint}]
- }
- models-dir
- ^^^^^^^^^^^^
- Store the models download from **huggingface** or **modelshop**. You do not need to modify this field if you download the model using the scripts shipped with **MinerU**
- layoutreader-model-dir
- ^^^^^^^^^^^^^^^^^^^^^^^
- Store the models download from **huggingface** or **modelshop**. You do not need to modify this field if you download the model using the scripts shipped with **MinerU**
- devide-mode
- ^^^^^^^^^^^^^^
- This field have two options, **cpu** or **cuda**.
- **cpu**: inference via cpu
- **cuda**: using cuda to accelerate inference
- layout-config
- ^^^^^^^^^^^^^^^
- .. code:: json
- {
- "model": "doclayout_yolo"
- }
- layout model can not be disabled now.
- formula-config
- ^^^^^^^^^^^^^^^^
- .. code:: json
- {
- "mfd_model": "yolo_v8_mfd",
- "mfr_model": "unimernet_small",
- "enable": true
- }
- mfd_model
- """"""""""
- Specify the formula detection model, options are ['yolo_v8_mfd']
- mfr_model
- """"""""""
- Specify the formula recognition model, options are ['unimernet_small']
- Check `UniMERNet <https://github.com/opendatalab/UniMERNet>`_ for more details
- enable
- """"""""
- on-off flag, options are [true, false]. **true** means enable formula inference, **false** means disable formula inference
- table-config
- ^^^^^^^^^^^^^^^^
- .. code:: json
- {
- "model": "rapid_table",
- "enable": true,
- "max_time": 400
- }
- model
- """"""""
- Specify the table inference model, options are ['rapid_table']
- max_time
- """""""""
- Since table recognition is a time-consuming process, we set a timeout period. If the process exceeds this time, the table recognition will be terminated.
- enable
- """""""
- on-off flag, options are [true, false]. **true** means enable table inference, **false** means disable table inference
- config_version
- ^^^^^^^^^^^^^^^^
- The version of config schema.
- .. admonition:: Tip
- :class: tip
-
- Check `Config Schema <https://github.com/opendatalab/MinerU/blob/master/magic-pdf.template.json>`_ for the latest details
|