Xiaomeng Zhao da3257a631 Merge pull request #1352 from myhloli/add-llm-aided пре 11 месеци
..
config 20438bd2b7 feat(language-detection): add YOLOv11 language detection model пре 11 месеци
data cd11ddcd6b docs: make sure the generate process of docs work properly пре 11 месеци
dict2md c660fdc8f0 feat(llm): add LLM-aided formula and text correction пре 11 месеци
filter e1be7da644 refactor(magic_pdf): switch to pdfminer for invalid character detection пре 1 година
integrations b492c19c4c refactor: move some constants or enums defs to config folder пре 1 година
libs c660fdc8f0 feat(llm): add LLM-aided formula and text correction пре 11 месеци
model 489f70e91d refactor(magic_pdf): move model config variables пре 11 месеци
operators 489f70e91d refactor(magic_pdf): move model config variables пре 11 месеци
pipe b2887ca0aa refactor: refactor code пре 11 месеци
post_proc c660fdc8f0 feat(llm): add LLM-aided formula and text correction пре 11 месеци
pre_proc 15e876677d refactor(pre_proc): improve character overlap handling in spans пре 11 месеци
resources 20438bd2b7 feat(language-detection): add YOLOv11 language detection model пре 11 месеци
rw 2db3c26374 refactor(libs): remove unused imports and functions пре 1 година
spark b492c19c4c refactor: move some constants or enums defs to config folder пре 1 година
tools bf2ff5a241 feat(gradio-app): improve PDF conversion and UI functionalities пре 11 месеци
utils f6af67eb11 feat: support convert ppt/pptx/doc/docx пре 1 година
__init__.py d5dbed7325 目录重构 пре 1 година
pdf_parse_by_ocr.py a3a720ea87 refactor: isolate inference and pipeline пре 1 година
pdf_parse_by_txt.py a3a720ea87 refactor: isolate inference and pipeline пре 1 година
pdf_parse_union_core_v2.py da3257a631 Merge pull request #1352 from myhloli/add-llm-aided пре 11 месеци
user_api.py 87af738ab1 fix: 1. ocr txt mode error 2. lose pdf_parse_type field пре 1 година