Xiaomeng Zhao b3ac3ac148 Merge branch 'master' into release-1.3.2 il y a 7 mois
..
config 20438bd2b7 feat(language-detection): add YOLOv11 language detection model il y a 11 mois
data e36a083dc3 fix: image dataset add lang field il y a 7 mois
dict2md c46d3373de refactor(ocr_mkcontent): improve title level handling and formatting il y a 8 mois
filter a5342950f6 fix(filter): toggle invalid character detection method il y a 9 mois
integrations b492c19c4c refactor: move some constants or enums defs to config folder il y a 1 an
libs 79feb926b7 Update version.py with new version il y a 7 mois
model ea730ae2e9 refactor(ocr): improve OCR score precision to three decimal places il y a 7 mois
operators 52efe94da8 feat(api): simplify markdown and content list generation il y a 10 mois
post_proc 842483ccb3 refactor(magic_pdf): improve paragraph splitting logic and update dependencies il y a 8 mois
pre_proc be505a958c fix(pre_proc): improve character overlap handling in OCR processing il y a 8 mois
resources e327e9bad5 fix(table): add model path for slanet-plus to resolve RapidTableError il y a 7 mois
spark b492c19c4c refactor: move some constants or enums defs to config folder il y a 1 an
tools 3e8ee23eed fix: convert image with pymupdf il y a 7 mois
utils f6af67eb11 feat: support convert ppt/pptx/doc/docx il y a 11 mois
__init__.py d5dbed7325 目录重构 il y a 1 an
pdf_parse_union_core_v2.py ea730ae2e9 refactor(ocr): improve OCR score precision to three decimal places il y a 7 mois