icecraft adbf492111 fix: support auto method and auto lang 7 bulan lalu
..
config 20438bd2b7 feat(language-detection): add YOLOv11 language detection model 11 bulan lalu
data adbf492111 fix: support auto method and auto lang 7 bulan lalu
dict2md c46d3373de refactor(ocr_mkcontent): improve title level handling and formatting 8 bulan lalu
filter a5342950f6 fix(filter): toggle invalid character detection method 9 bulan lalu
integrations b492c19c4c refactor: move some constants or enums defs to config folder 1 tahun lalu
libs 1b34f7e4ff refactor(magic_pdf): replace PIL with NumPy for image processing 8 bulan lalu
model adbf492111 fix: support auto method and auto lang 7 bulan lalu
operators 52efe94da8 feat(api): simplify markdown and content list generation 10 bulan lalu
post_proc 842483ccb3 refactor(magic_pdf): improve paragraph splitting logic and update dependencies 8 bulan lalu
pre_proc 3f2bafa88f feat(pre_proc): add function to remove x-overlapping characters in spans 8 bulan lalu
resources af27c0cc81 refactor(magic_pdf): support mps device and optimize image processing 8 bulan lalu
spark b492c19c4c refactor: move some constants or enums defs to config folder 1 tahun lalu
tools adbf492111 fix: support auto method and auto lang 7 bulan lalu
utils f6af67eb11 feat: support convert ppt/pptx/doc/docx 11 bulan lalu
__init__.py d5dbed7325 目录重构 1 tahun lalu
pdf_parse_union_core_v2.py 3f2bafa88f feat(pre_proc): add function to remove x-overlapping characters in spans 8 bulan lalu