提交历史

作者 SHA1 备注 提交日期
  myhloli 8d178b2b7e feat: enhance file type detection by using guess_suffix_by_path for document parsing 2 月之前
  myhloli 1083476a02 fix: typo 2 月之前
  myhloli da29782a26 feat: add contrast calculation for span images to improve OCR accuracy 2 月之前
  myhloli 75797a3b7c feat: update header title to MinerU 2.5 and add model link in header.html; add Dingo tool link in README_zh-CN.md 2 月之前
  myhloli 5b73b89ceb fix: add handling for reference text blocks in draw_bbox.py 2 月之前
  myhloli c5b2926c7b fix: extend text block handling to include reference text in draw_bbox.py 2 月之前
  myhloli 3ca520a3fe feat: implement dynamic batch size calculation based on GPU memory in vlm_analyze.py 2 月之前
  myhloli ba36a94aa0 fix: streamline model argument handling in server.py 2 月之前
  myhloli 11ebb47891 fix: remove redundant model_path checks for vllm backends in vlm_analyze.py 2 月之前
  myhloli dd8dd5197b fix: correct variable usage for language guessing in code block formatting 2 月之前
  myhloli 7a71cfe288 feat: add support for vllm-async-engine backend in vlm_analyze.py 2 月之前
  myhloli bba31191a4 fix: update backend handling to enforce correct usage of vlm engines in sync and async modes 2 月之前
  Xiaomeng Zhao 9041f04588 Merge pull request #18 from myhloli/vlm_2.5 2 月之前
  Xiaomeng Zhao 69a9d11b0b Merge pull request #3489 from e06084/dev 2 月之前
  chupei 36e7267ce1 docs: README add dingo link 2 月之前
  myhloli 14f347d613 feat: add code_content_clean function to sanitize Markdown code blocks 2 月之前
  myhloli 6ea2cfeb21 fix: update MinerU version references in enum_class.py and header.html 2 月之前
  myhloli 078099f19d feat: enhance language guessing for code blocks by integrating guess_lang into line structure 2 月之前
  myhloli 25d4a4588a fix: specify version range for Magika dependency in pyproject.toml 2 月之前
  myhloli 679dad3aac fix: streamline temporary file handling for image and PDF processing in fast_api.py 2 月之前
  myhloli e60da65cca feat: enhance file type detection using Magika for improved suffix guessing 2 月之前
  myhloli f081d36a3a feat: implement language guessing for code blocks using Magika 2 月之前
  myhloli c74e712918 fix: correct language guessing in code block formatting in vlm_middle_json_mkcontent.py 2 月之前
  myhloli f2b944ab06 fix: enhance language guessing for code blocks in VLM processing 2 月之前
  myhloli 2e945adcc0 docs: update output_files.md to reflect significant changes in VLM backend output for version 2.5 2 月之前
  myhloli 39eaf31fb9 docs: update output_files.md to reflect significant changes in VLM backend output for version 2.5 2 月之前
  myhloli 7717534ea7 fix: remove unused import of list_iterator from draw_bbox.py 2 月之前
  Xiaomeng Zhao 6166b98cd4 Merge pull request #17 from myhloli/dev 2 月之前
  Xiaomeng Zhao a02ab97ea0 Merge pull request #3473 from myhloli/dev 2 月之前
  myhloli beadb7a689 fix: adjust overlap area ratio for image and table spans in span_block_fix 2 月之前