myhloli d19911f113 Update version.py with new version 1 vuosi sitten
..
config b492c19c4c refactor: move some constants or enums defs to config folder 1 vuosi sitten
data b1adde8e66 fix: rewrite projects/ and demos with new data api 1 vuosi sitten
dict2md 782e6571bc fix(ocr_mkcontent): handle empty paragraphs on pages 1 vuosi sitten
filter ac88815620 refactor(pdf_check): improve character detection using PyMuPDF 1 vuosi sitten
integrations b492c19c4c refactor: move some constants or enums defs to config folder 1 vuosi sitten
libs d19911f113 Update version.py with new version 1 vuosi sitten
model 7f2f2c0f28 refactor(ocr): Fix the error of paddleocr failing to initialize in a multi-threaded environment 1 vuosi sitten
para f674b8d413 refactor(para): improve language detection and block splitting 1 vuosi sitten
pipe b492c19c4c refactor: move some constants or enums defs to config folder 1 vuosi sitten
pre_proc a46b12e967 refactor(pre_proc): clean up OCR processing code 1 vuosi sitten
resources 240fe99e3c feat(table): integrate RapidTable model for table recognition 1 vuosi sitten
rw 2db3c26374 refactor(libs): remove unused imports and functions 1 vuosi sitten
spark b492c19c4c refactor: move some constants or enums defs to config folder 1 vuosi sitten
tools 9c8d995ed2 Merge pull request #1045 from myhloli/dev 1 vuosi sitten
utils 9cda7051c6 add init to magic_pdf.utils 1 vuosi sitten
__init__.py d5dbed7325 目录重构 1 vuosi sitten
pdf_parse_by_ocr.py 309be741e8 refactor(txt_parse): improve text extraction accuracy with new algorithm 1 vuosi sitten
pdf_parse_by_txt.py 309be741e8 refactor(txt_parse): improve text extraction accuracy with new algorithm 1 vuosi sitten
pdf_parse_union_core_v2.py d4345b6e39 refactor(pdf_parse): adjust character-axis alignment algorithm 1 vuosi sitten
user_api.py 309be741e8 refactor(txt_parse): improve text extraction accuracy with new algorithm 1 vuosi sitten