Xiaomeng Zhao 3f64c16ba9 Merge pull request #1311 from myhloli/add-auto-lang há 11 meses atrás
..
config 20438bd2b7 feat(language-detection): add YOLOv11 language detection model há 11 meses atrás
data 20438bd2b7 feat(language-detection): add YOLOv11 language detection model há 11 meses atrás
dict2md c638fc5d1f fix(pdf): improve ligature handling and text extraction há 11 meses atrás
filter e1be7da644 refactor(magic_pdf): switch to pdfminer for invalid character detection há 1 ano atrás
integrations b492c19c4c refactor: move some constants or enums defs to config folder há 1 ano atrás
libs 391a99860d Update version.py with new version há 1 ano atrás
model 20438bd2b7 feat(language-detection): add YOLOv11 language detection model há 11 meses atrás
para 41545a13c6 refactor(para): adjust line height multiplier for block splitting há 1 ano atrás
pipe e9d36221dd feat: add get_middle_json method há 11 meses atrás
post_proc 6a75d7dce5 perf(layout): optimize layout detection for PDF extraction há 1 ano atrás
pre_proc 7f8dc353b0 fix(pre_proc): prevent errors when imageWriter is None há 1 ano atrás
resources 20438bd2b7 feat(language-detection): add YOLOv11 language detection model há 11 meses atrás
rw 2db3c26374 refactor(libs): remove unused imports and functions há 1 ano atrás
spark b492c19c4c refactor: move some constants or enums defs to config folder há 1 ano atrás
tools 20438bd2b7 feat(language-detection): add YOLOv11 language detection model há 11 meses atrás
utils f6af67eb11 feat: support convert ppt/pptx/doc/docx há 1 ano atrás
__init__.py d5dbed7325 目录重构 há 1 ano atrás
pdf_parse_by_ocr.py a3a720ea87 refactor: isolate inference and pipeline há 1 ano atrás
pdf_parse_by_txt.py a3a720ea87 refactor: isolate inference and pipeline há 1 ano atrás
pdf_parse_union_core_v2.py 9e4ebea939 refactor(magic_pdf): remove YOLO_VERBOSE setting and update YOLOv8 prediction verbosity há 11 meses atrás
user_api.py 87af738ab1 fix: 1. ocr txt mode error 2. lose pdf_parse_type field há 1 ano atrás