myhloli ddf5a8781a fix(batch): refactor OCR detection integration and area ratio calculation 5 달 전
..
config 20438bd2b7 feat(language-detection): add YOLOv11 language detection model 11 달 전
data 1e01ffcf78 fix(ocr): adjust area ratio threshold and update fitz document handling in image conversion 5 달 전
dict2md 002333a8d7 fix(ocr_mkcontent): improve image handling and footnote integration in markdown output 6 달 전
filter a5342950f6 fix(filter): toggle invalid character detection method 9 달 전
integrations b492c19c4c refactor: move some constants or enums defs to config folder 1 년 전
libs 580193bae0 Update version.py with new version 5 달 전
model ddf5a8781a fix(batch): refactor OCR detection integration and area ratio calculation 5 달 전
operators 52efe94da8 feat(api): simplify markdown and content list generation 10 달 전
post_proc 842483ccb3 refactor(magic_pdf): improve paragraph splitting logic and update dependencies 8 달 전
pre_proc 058d318491 feat(pdf_parse): add footnote block handling in layout split 7 달 전
resources e327e9bad5 fix(table): add model path for slanet-plus to resolve RapidTableError 7 달 전
spark b492c19c4c refactor: move some constants or enums defs to config folder 1 년 전
tools 54ce594bf6 refactor(tools): improve code readability and maintainability 7 달 전
utils 2e5e55cfe2 refactor(office_to_pdf): simplify font checking and add logging 7 달 전
__init__.py d5dbed7325 目录重构 1 년 전
pdf_parse_union_core_v2.py 058d318491 feat(pdf_parse): add footnote block handling in layout split 7 달 전