myhloli 2b6e94424c refactor: comment out clean_memory function call vor 11 Monaten
..
config 87af738ab1 fix: 1. ocr txt mode error 2. lose pdf_parse_type field vor 1 Jahr
data 113448903a fix: unicode decode error vor 1 Jahr
dict2md 74ee428bbb fix(dict2md): add space for inline equations in CJK contexts vor 1 Jahr
filter a3a720ea87 refactor: isolate inference and pipeline vor 1 Jahr
integrations b492c19c4c refactor: move some constants or enums defs to config folder vor 1 Jahr
libs f6bd47de6a docs: add dataset method description vor 1 Jahr
model a296ea41f9 refactor(magic_pdf): optimize environment setup and dependencies vor 1 Jahr
para 41545a13c6 refactor(para): adjust line height multiplier for block splitting vor 1 Jahr
pipe f6bd47de6a docs: add dataset method description vor 1 Jahr
pre_proc 7f8dc353b0 fix(pre_proc): prevent errors when imageWriter is None vor 1 Jahr
resources 240fe99e3c feat(table): integrate RapidTable model for table recognition vor 1 Jahr
rw 2db3c26374 refactor(libs): remove unused imports and functions vor 1 Jahr
spark b492c19c4c refactor: move some constants or enums defs to config folder vor 1 Jahr
tools 4a82d6a07a feat: add function definitions vor 1 Jahr
utils 9cda7051c6 add init to magic_pdf.utils vor 1 Jahr
__init__.py d5dbed7325 目录重构 vor 1 Jahr
pdf_parse_by_ocr.py a3a720ea87 refactor: isolate inference and pipeline vor 1 Jahr
pdf_parse_by_txt.py a3a720ea87 refactor: isolate inference and pipeline vor 1 Jahr
pdf_parse_union_core_v2.py 2b6e94424c refactor: comment out clean_memory function call vor 11 Monaten
user_api.py 87af738ab1 fix: 1. ocr txt mode error 2. lose pdf_parse_type field vor 1 Jahr