myhloli b9f3435cb7 Update version.py with new version 11 mēneši atpakaļ
..
config b492c19c4c refactor: move some constants or enums defs to config folder 1 gadu atpakaļ
data b0529b6fbd fix: reduce maximum image size 11 mēneši atpakaļ
dict2md b80befe9cf refactor(mkcontent): optimize paragraph text merging and language detection 11 mēneši atpakaļ
filter ac88815620 refactor(pdf_check): improve character detection using PyMuPDF 11 mēneši atpakaļ
integrations b492c19c4c refactor: move some constants or enums defs to config folder 1 gadu atpakaļ
libs b9f3435cb7 Update version.py with new version 11 mēneši atpakaļ
model 7f2f2c0f28 refactor(ocr): Fix the error of paddleocr failing to initialize in a multi-threaded environment 11 mēneši atpakaļ
para 41545a13c6 refactor(para): adjust line height multiplier for block splitting 11 mēneši atpakaļ
pipe b492c19c4c refactor: move some constants or enums defs to config folder 1 gadu atpakaļ
pre_proc 7f8dc353b0 fix(pre_proc): prevent errors when imageWriter is None 11 mēneši atpakaļ
resources 240fe99e3c feat(table): integrate RapidTable model for table recognition 1 gadu atpakaļ
rw 2db3c26374 refactor(libs): remove unused imports and functions 11 mēneši atpakaļ
spark b492c19c4c refactor: move some constants or enums defs to config folder 1 gadu atpakaļ
tools 9c8d995ed2 Merge pull request #1045 from myhloli/dev 1 gadu atpakaļ
utils 9cda7051c6 add init to magic_pdf.utils 1 gadu atpakaļ
__init__.py d5dbed7325 目录重构 1 gadu atpakaļ
pdf_parse_by_ocr.py 309be741e8 refactor(txt_parse): improve text extraction accuracy with new algorithm 1 gadu atpakaļ
pdf_parse_by_txt.py 309be741e8 refactor(txt_parse): improve text extraction accuracy with new algorithm 1 gadu atpakaļ
pdf_parse_union_core_v2.py d4345b6e39 refactor(pdf_parse): adjust character-axis alignment algorithm 11 mēneši atpakaļ
user_api.py 309be741e8 refactor(txt_parse): improve text extraction accuracy with new algorithm 1 gadu atpakaļ