Selaa lähdekoodia

feat(新增YUSYS本地OCR配置): 在processor_configs.yaml中新增yusys_mineruocr_local配置,支持本地文档解析,包含输入输出参数、额外参数及日志目录设置,提升OCR处理的灵活性与可用性。

zhch158_admin 1 kuukausi sitten
vanhempi
commit
b599507513
1 muutettua tiedostoa jossa 16 lisäystä ja 0 poistoa
  1. 16 0
      ocr_tools/ocr_batch/processor_configs.yaml

+ 16 - 0
ocr_tools/ocr_batch/processor_configs.yaml

@@ -110,6 +110,22 @@ processors:
     venv: "conda activate mineru"
     description: "YUSYS(local) Wired UNET OCR PaddleOCR-VL"
 
+  yusys_mineruocr_local:
+    script: "/Users/zhch158/workspace/repository.git/ocr_platform/ocr_tools/universal_doc_parser/main_v2.py"
+    input_arg: "--input"
+    output_arg: "--output_dir"
+    scene_arg: "--scene"
+    extra_args:
+      - "--config=/Users/zhch158/workspace/repository.git/ocr_platform/ocr_tools/universal_doc_parser/config/bank_statement_mineru_vl_local.yaml"
+      - "--pages=1-35"
+      - "--streaming"
+      - "--debug"
+      - "--log_level=DEBUG"
+    output_subdir: "bank_statement_yusys_mineruocr_local"
+    log_subdir: "logs/bank_statement_yusys_mineruocr_local"
+    venv: "conda activate mineru"
+    description: "YUSYS(local) Wired UNET OCR MinerU-VL"
+
   # -------------------------------------------------------------------------
   # PaddleOCR-VL 处理器
   # -------------------------------------------------------------------------