myhloli
|
8d178b2b7e
feat: enhance file type detection by using guess_suffix_by_path for document parsing
|
пре 2 месеци |
myhloli
|
1083476a02
fix: typo
|
пре 2 месеци |
myhloli
|
da29782a26
feat: add contrast calculation for span images to improve OCR accuracy
|
пре 2 месеци |
myhloli
|
75797a3b7c
feat: update header title to MinerU 2.5 and add model link in header.html; add Dingo tool link in README_zh-CN.md
|
пре 2 месеци |
myhloli
|
5b73b89ceb
fix: add handling for reference text blocks in draw_bbox.py
|
пре 2 месеци |
myhloli
|
c5b2926c7b
fix: extend text block handling to include reference text in draw_bbox.py
|
пре 2 месеци |
myhloli
|
3ca520a3fe
feat: implement dynamic batch size calculation based on GPU memory in vlm_analyze.py
|
пре 2 месеци |
myhloli
|
ba36a94aa0
fix: streamline model argument handling in server.py
|
пре 2 месеци |
myhloli
|
11ebb47891
fix: remove redundant model_path checks for vllm backends in vlm_analyze.py
|
пре 2 месеци |
myhloli
|
dd8dd5197b
fix: correct variable usage for language guessing in code block formatting
|
пре 2 месеци |
myhloli
|
7a71cfe288
feat: add support for vllm-async-engine backend in vlm_analyze.py
|
пре 2 месеци |
myhloli
|
bba31191a4
fix: update backend handling to enforce correct usage of vlm engines in sync and async modes
|
пре 2 месеци |
Xiaomeng Zhao
|
9041f04588
Merge pull request #18 from myhloli/vlm_2.5
|
пре 2 месеци |
Xiaomeng Zhao
|
69a9d11b0b
Merge pull request #3489 from e06084/dev
|
пре 2 месеци |
chupei
|
36e7267ce1
docs: README add dingo link
|
пре 2 месеци |
myhloli
|
14f347d613
feat: add code_content_clean function to sanitize Markdown code blocks
|
пре 2 месеци |
myhloli
|
6ea2cfeb21
fix: update MinerU version references in enum_class.py and header.html
|
пре 2 месеци |
myhloli
|
078099f19d
feat: enhance language guessing for code blocks by integrating guess_lang into line structure
|
пре 2 месеци |
myhloli
|
25d4a4588a
fix: specify version range for Magika dependency in pyproject.toml
|
пре 2 месеци |
myhloli
|
679dad3aac
fix: streamline temporary file handling for image and PDF processing in fast_api.py
|
пре 2 месеци |
myhloli
|
e60da65cca
feat: enhance file type detection using Magika for improved suffix guessing
|
пре 2 месеци |
myhloli
|
f081d36a3a
feat: implement language guessing for code blocks using Magika
|
пре 2 месеци |
myhloli
|
c74e712918
fix: correct language guessing in code block formatting in vlm_middle_json_mkcontent.py
|
пре 2 месеци |
myhloli
|
f2b944ab06
fix: enhance language guessing for code blocks in VLM processing
|
пре 2 месеци |
myhloli
|
2e945adcc0
docs: update output_files.md to reflect significant changes in VLM backend output for version 2.5
|
пре 2 месеци |
myhloli
|
39eaf31fb9
docs: update output_files.md to reflect significant changes in VLM backend output for version 2.5
|
пре 2 месеци |
myhloli
|
7717534ea7
fix: remove unused import of list_iterator from draw_bbox.py
|
пре 2 месеци |
Xiaomeng Zhao
|
6166b98cd4
Merge pull request #17 from myhloli/dev
|
пре 2 месеци |
Xiaomeng Zhao
|
a02ab97ea0
Merge pull request #3473 from myhloli/dev
|
пре 2 месеци |
myhloli
|
beadb7a689
fix: adjust overlap area ratio for image and table spans in span_block_fix
|
пре 2 месеци |