myhloli 869cf0a609 Merge remote-tracking branch 'origin/dev' into dev hai 10 meses
..
config 20438bd2b7 feat(language-detection): add YOLOv11 language detection model hai 11 meses
data 3271cf75d3 refactor(langdetect): simplify language detection model and improve logging hai 10 meses
dict2md 0a468eca6e feat(llm_aided): add title optimization feature hai 11 meses
filter e1be7da644 refactor(magic_pdf): switch to pdfminer for invalid character detection hai 11 meses
integrations b492c19c4c refactor: move some constants or enums defs to config folder hai 1 ano
libs 29681c4f79 fix(language): enhance language detection and text processing hai 10 meses
model bd1b76771e refactor(magic_pdf): update OCR engine selection in RapidTableModel hai 10 meses
operators 52efe94da8 feat(api): simplify markdown and content list generation hai 11 meses
post_proc 512adb6701 feat(model): add onnxruntime support for paddleocr on cpu hai 11 meses
pre_proc 15e876677d refactor(pre_proc): improve character overlap handling in spans hai 11 meses
resources a80ff05150 refactor(model): remove unused YOLO v11 language detection model hai 10 meses
spark b492c19c4c refactor: move some constants or enums defs to config folder hai 1 ano
tools f911a102ab feat(tools): add character bounding box drawing functionality hai 11 meses
utils f6af67eb11 feat: support convert ppt/pptx/doc/docx hai 11 meses
__init__.py d5dbed7325 目录重构 hai 1 ano
pdf_parse_union_core_v2.py 3f93b895bc feat(pdf_parse): add internal block sorting for images and tables hai 10 meses