icecraft 15cd97ff17 fix: match multiple captions преди 9 месеца
..
config 20438bd2b7 feat(language-detection): add YOLOv11 language detection model преди 11 месеца
data 3271cf75d3 refactor(langdetect): simplify language detection model and improve logging преди 10 месеца
dict2md 315adbce38 feat(ocr_mkcontent): add full-width to half-width character conversion преди 9 месеца
filter a5342950f6 fix(filter): toggle invalid character detection method преди 9 месеца
integrations b492c19c4c refactor: move some constants or enums defs to config folder преди 1 година
libs e4e4eef1f8 perf(language_detection): optimize batch size for language detection model преди 9 месеца
model 15cd97ff17 fix: match multiple captions преди 9 месеца
operators 52efe94da8 feat(api): simplify markdown and content list generation преди 10 месеца
post_proc 9e332f068a fix(llm_aided): update prompt преди 9 месеца
pre_proc 19916856e7 feat(pre_proc): add block type compatibility check for span allocation преди 9 месеца
resources 2a3a006f4d fix(models): update unimernet_small model path преди 10 месеца
spark b492c19c4c refactor: move some constants or enums defs to config folder преди 1 година
tools f911a102ab feat(tools): add character bounding box drawing functionality преди 11 месеца
utils f6af67eb11 feat: support convert ppt/pptx/doc/docx преди 11 месеца
__init__.py d5dbed7325 目录重构 преди 1 година
pdf_parse_union_core_v2.py 30bd3a83c7 fix(pdf_parse): Fixed the issue where some headings were missing in certain complex layouts. преди 9 месеца