myhloli 7a8568045d fix(pre_proc): add Discarded block type to span block type compatibility 8 сар өмнө
..
config 20438bd2b7 feat(language-detection): add YOLOv11 language detection model 11 сар өмнө
data 3271cf75d3 refactor(langdetect): simplify language detection model and improve logging 10 сар өмнө
dict2md df1b8f598f refactor(ocr_mkcontent): optimize full-width character handling 9 сар өмнө
filter a5342950f6 fix(filter): toggle invalid character detection method 9 сар өмнө
integrations b492c19c4c refactor: move some constants or enums defs to config folder 1 жил өмнө
libs 4da3c0f5c0 Update version.py with new version 9 сар өмнө
model 0b05dff74f perf(inference): adjust batch ratio for high GPU memory 9 сар өмнө
operators 52efe94da8 feat(api): simplify markdown and content list generation 11 сар өмнө
post_proc 842483ccb3 refactor(magic_pdf): improve paragraph splitting logic and update dependencies 9 сар өмнө
pre_proc 7a8568045d fix(pre_proc): add Discarded block type to span block type compatibility 8 сар өмнө
resources 2a3a006f4d fix(models): update unimernet_small model path 10 сар өмнө
spark b492c19c4c refactor: move some constants or enums defs to config folder 1 жил өмнө
tools f911a102ab feat(tools): add character bounding box drawing functionality 11 сар өмнө
utils f6af67eb11 feat: support convert ppt/pptx/doc/docx 11 сар өмнө
__init__.py d5dbed7325 目录重构 1 жил өмнө
pdf_parse_union_core_v2.py 6bfc17119d refactor(pdf_parse): comment out performance measurement and logging 9 сар өмнө