myhloli 9b3339f1e4 Merge remote-tracking branch 'origin/dev' into dev há 8 meses atrás
..
config 20438bd2b7 feat(language-detection): add YOLOv11 language detection model há 11 meses atrás
data adbf492111 fix: support auto method and auto lang há 8 meses atrás
dict2md c46d3373de refactor(ocr_mkcontent): improve title level handling and formatting há 8 meses atrás
filter a5342950f6 fix(filter): toggle invalid character detection method há 10 meses atrás
integrations b492c19c4c refactor: move some constants or enums defs to config folder há 1 ano atrás
libs 978ef41cdd feat(performance_stats): improve function identification in execution time logging há 8 meses atrás
model 41f1fb8ad6 refactor(ocr): remove unused OCR dictionaries and update model configurations há 8 meses atrás
operators 52efe94da8 feat(api): simplify markdown and content list generation há 11 meses atrás
post_proc 842483ccb3 refactor(magic_pdf): improve paragraph splitting logic and update dependencies há 9 meses atrás
pre_proc be505a958c fix(pre_proc): improve character overlap handling in OCR processing há 8 meses atrás
resources af27c0cc81 refactor(magic_pdf): support mps device and optimize image processing há 8 meses atrás
spark b492c19c4c refactor: move some constants or enums defs to config folder há 1 ano atrás
tools bbba2a120c feat: batch inference with ocr and lang flag há 8 meses atrás
utils f6af67eb11 feat: support convert ppt/pptx/doc/docx há 1 ano atrás
__init__.py d5dbed7325 目录重构 há 1 ano atrás
pdf_parse_union_core_v2.py a024c30fc4 feat(ocr): implement dynamic OCR processing for text spans with low contrast há 8 meses atrás