Browse Source

docs: update OCR language support to reflect recognition of 109 languages

myhloli 2 tuần trước cách đây
mục cha
commit
2238c49352
2 tập tin đã thay đổi với 2 bổ sung2 xóa
  1. 1 1
      docs/en/index.md
  2. 1 1
      docs/zh/index.md

+ 1 - 1
docs/en/index.md

@@ -57,7 +57,7 @@ Compared to well-known commercial products domestically and internationally, Min
 - Automatically identify and convert formulas in documents to LaTeX format
 - Automatically identify and convert tables in documents to HTML format
 - Automatically detect scanned PDFs and garbled PDFs, and enable OCR functionality
-- OCR supports detection and recognition of 84 languages
+- OCR supports detection and recognition of 109 languages
 - Support multiple output formats, such as multimodal and NLP Markdown, reading-order-sorted JSON, and information-rich intermediate formats
 - Support multiple visualization results, including layout visualization, span visualization, etc., for efficient confirmation of output effects and quality inspection
 - Support pure CPU environment operation, and support GPU(CUDA)/NPU(CANN)/MPS acceleration

+ 1 - 1
docs/zh/index.md

@@ -56,7 +56,7 @@ MinerU诞生于[书生-浦语](https://github.com/InternLM/InternLM)的预训练
 - 自动识别并转换文档中的公式为LaTeX格式
 - 自动识别并转换文档中的表格为HTML格式
 - 自动检测扫描版PDF和乱码PDF,并启用OCR功能
-- OCR支持84种语言的检测与识别
+- OCR支持109种语言的检测与识别
 - 支持多种输出格式,如多模态与NLP的Markdown、按阅读顺序排序的JSON、含有丰富信息的中间格式等
 - 支持多种可视化结果,包括layout可视化、span可视化等,便于高效确认输出效果与质检
 - 支持纯CPU环境运行,并支持 GPU(CUDA)/NPU(CANN)/MPS 加速