myhloli c20e9a1e84 feat(layout): improve title block handling and layout detection 10 hónapja
..
config 20438bd2b7 feat(language-detection): add YOLOv11 language detection model 11 hónapja
data 3271cf75d3 refactor(langdetect): simplify language detection model and improve logging 10 hónapja
dict2md 0a468eca6e feat(llm_aided): add title optimization feature 11 hónapja
filter e1be7da644 refactor(magic_pdf): switch to pdfminer for invalid character detection 11 hónapja
integrations b492c19c4c refactor: move some constants or enums defs to config folder 1 éve
libs c20e9a1e84 feat(layout): improve title block handling and layout detection 10 hónapja
model c20e9a1e84 feat(layout): improve title block handling and layout detection 10 hónapja
operators 52efe94da8 feat(api): simplify markdown and content list generation 10 hónapja
post_proc 512adb6701 feat(model): add onnxruntime support for paddleocr on cpu 10 hónapja
pre_proc 15e876677d refactor(pre_proc): improve character overlap handling in spans 11 hónapja
resources c20e9a1e84 feat(layout): improve title block handling and layout detection 10 hónapja
spark b492c19c4c refactor: move some constants or enums defs to config folder 1 éve
tools f911a102ab feat(tools): add character bounding box drawing functionality 10 hónapja
utils f6af67eb11 feat: support convert ppt/pptx/doc/docx 11 hónapja
__init__.py d5dbed7325 目录重构 1 éve
pdf_parse_union_core_v2.py c20e9a1e84 feat(layout): improve title block handling and layout detection 10 hónapja