myhloli cf4ea78dac refactor: remove torchtext deprecation warning handling 8 meses atrás
..
config 20438bd2b7 feat(language-detection): add YOLOv11 language detection model 11 meses atrás
data 1df26448ac Merge pull request #6 from myhloli/remove-pillow 8 meses atrás
dict2md c46d3373de refactor(ocr_mkcontent): improve title level handling and formatting 8 meses atrás
filter a5342950f6 fix(filter): toggle invalid character detection method 9 meses atrás
integrations b492c19c4c refactor: move some constants or enums defs to config folder 1 ano atrás
libs 1b34f7e4ff refactor(magic_pdf): replace PIL with NumPy for image processing 9 meses atrás
model cf4ea78dac refactor: remove torchtext deprecation warning handling 8 meses atrás
operators 52efe94da8 feat(api): simplify markdown and content list generation 11 meses atrás
post_proc 842483ccb3 refactor(magic_pdf): improve paragraph splitting logic and update dependencies 9 meses atrás
pre_proc 7a8568045d fix(pre_proc): add Discarded block type to span block type compatibility 8 meses atrás
resources af27c0cc81 refactor(magic_pdf): support mps device and optimize image processing 8 meses atrás
spark b492c19c4c refactor: move some constants or enums defs to config folder 1 ano atrás
tools e9c2473913 style: remove unused code 8 meses atrás
utils f6af67eb11 feat: support convert ppt/pptx/doc/docx 11 meses atrás
__init__.py d5dbed7325 目录重构 1 ano atrás
pdf_parse_union_core_v2.py cf4ea78dac refactor: remove torchtext deprecation warning handling 8 meses atrás