myhloli 52efe94da8 feat(api): simplify markdown and content list generation hai 11 meses
..
config 20438bd2b7 feat(language-detection): add YOLOv11 language detection model hai 11 meses
data d637dab354 fix: s3 path join method hai 11 meses
dict2md 0a468eca6e feat(llm_aided): add title optimization feature hai 11 meses
filter e1be7da644 refactor(magic_pdf): switch to pdfminer for invalid character detection hai 1 ano
integrations b492c19c4c refactor: move some constants or enums defs to config folder hai 1 ano
libs f911a102ab feat(tools): add character bounding box drawing functionality hai 11 meses
model 12caa7845d fix(table): handle empty OCR result in rapidtable hai 11 meses
operators 52efe94da8 feat(api): simplify markdown and content list generation hai 11 meses
post_proc 512adb6701 feat(model): add onnxruntime support for paddleocr on cpu hai 11 meses
pre_proc 15e876677d refactor(pre_proc): improve character overlap handling in spans hai 11 meses
resources 20438bd2b7 feat(language-detection): add YOLOv11 language detection model hai 11 meses
spark b492c19c4c refactor: move some constants or enums defs to config folder hai 1 ano
tools f911a102ab feat(tools): add character bounding box drawing functionality hai 11 meses
utils f6af67eb11 feat: support convert ppt/pptx/doc/docx hai 1 ano
__init__.py d5dbed7325 目录重构 hai 1 ano
pdf_parse_union_core_v2.py 8a0aa7a479 Merge branch 'dev' into dev hai 11 meses