myhloli d986e39313 feat(llm_aided): add reasonability check and fine-tuning guidelines há 10 meses atrás
..
config 20438bd2b7 feat(language-detection): add YOLOv11 language detection model há 11 meses atrás
data 3271cf75d3 refactor(langdetect): simplify language detection model and improve logging há 11 meses atrás
dict2md 0a468eca6e feat(llm_aided): add title optimization feature há 11 meses atrás
filter e1be7da644 refactor(magic_pdf): switch to pdfminer for invalid character detection há 1 ano atrás
integrations b492c19c4c refactor: move some constants or enums defs to config folder há 1 ano atrás
libs 1a549a0e4b fix(language): remove invalid UTF-16 surrogate pairs from input text há 10 meses atrás
model db8be9745d fix(magic_pdf): limit batch ratio for GPU memory há 10 meses atrás
operators 52efe94da8 feat(api): simplify markdown and content list generation há 11 meses atrás
post_proc d986e39313 feat(llm_aided): add reasonability check and fine-tuning guidelines há 10 meses atrás
pre_proc f37b14bc83 refactor(pre_proc): adjust IOU threshold for character overlap detection há 10 meses atrás
resources c20e9a1e84 feat(layout): improve title block handling and layout detection há 10 meses atrás
spark b492c19c4c refactor: move some constants or enums defs to config folder há 1 ano atrás
tools f911a102ab feat(tools): add character bounding box drawing functionality há 11 meses atrás
utils f6af67eb11 feat: support convert ppt/pptx/doc/docx há 1 ano atrás
__init__.py d5dbed7325 目录重构 há 1 ano atrás
pdf_parse_union_core_v2.py 8570e006f8 refactor(magic_pdf): improve title block merging logic há 10 meses atrás