| .. |
|
config
|
20438bd2b7
feat(language-detection): add YOLOv11 language detection model
|
11 сар өмнө |
|
data
|
adbf492111
fix: support auto method and auto lang
|
7 сар өмнө |
|
dict2md
|
c46d3373de
refactor(ocr_mkcontent): improve title level handling and formatting
|
8 сар өмнө |
|
filter
|
a5342950f6
fix(filter): toggle invalid character detection method
|
9 сар өмнө |
|
integrations
|
b492c19c4c
refactor: move some constants or enums defs to config folder
|
1 жил өмнө |
|
libs
|
1b34f7e4ff
refactor(magic_pdf): replace PIL with NumPy for image processing
|
8 сар өмнө |
|
model
|
adbf492111
fix: support auto method and auto lang
|
7 сар өмнө |
|
operators
|
52efe94da8
feat(api): simplify markdown and content list generation
|
10 сар өмнө |
|
post_proc
|
842483ccb3
refactor(magic_pdf): improve paragraph splitting logic and update dependencies
|
8 сар өмнө |
|
pre_proc
|
3f2bafa88f
feat(pre_proc): add function to remove x-overlapping characters in spans
|
8 сар өмнө |
|
resources
|
af27c0cc81
refactor(magic_pdf): support mps device and optimize image processing
|
8 сар өмнө |
|
spark
|
b492c19c4c
refactor: move some constants or enums defs to config folder
|
1 жил өмнө |
|
tools
|
adbf492111
fix: support auto method and auto lang
|
7 сар өмнө |
|
utils
|
f6af67eb11
feat: support convert ppt/pptx/doc/docx
|
11 сар өмнө |
|
__init__.py
|
d5dbed7325
目录重构
|
1 жил өмнө |
|
pdf_parse_union_core_v2.py
|
3f2bafa88f
feat(pre_proc): add function to remove x-overlapping characters in spans
|
8 сар өмнө |