drunkpig 18e65be489 fix: delete hyphen at end of line hai 1 ano
..
dict2md 18e65be489 fix: delete hyphen at end of line hai 1 ano
filter df14c61f6f update: Enhance the capability to detect garbled document issues hai 1 ano
integrations b72d4ebd94 Feat/support rag (#510) hai 1 ano
layout d5dbed7325 目录重构 hai 1 ano
libs c9a51491a4 feat: rename the file generated by command line tools (#401) hai 1 ano
model 068fab7f81 fix(end_page_id):Fix the issue where end_page_id is corrected to len-1 when its input is 0. (#518) hai 1 ano
para 65c3ac66ae <fix>(para_split_v2): index out of range issue of span_text first char (#396) hai 1 ano
pipe 0f91fcf61f feat(cli&analyze&pipeline): add start_page and end_page args for pagination (#507) hai 1 ano
post_proc 1b9d65b3d3 1、Trace类的key增加前置下划线 hai 1 ano
pre_proc 9067cd31ca fix(detect_all_bboxes): remove small overlapping blocks by merging (#501) hai 1 ano
resources 37925f36d9 feat(model inference): add table recognition and conversion to LaTeX (#284) hai 1 ano
rw 40e0827e60 Feat/impl cli (#264) hai 1 ano
spark c9af3457f5 delete useless files hai 1 ano
tools b72d4ebd94 Feat/support rag (#510) hai 1 ano
__init__.py d5dbed7325 目录重构 hai 1 ano
pdf_parse_by_ocr.py 959b8d82d8 renamed pipeline file name hai 1 ano
pdf_parse_by_txt.py 959b8d82d8 renamed pipeline file name hai 1 ano
pdf_parse_union_core.py 068fab7f81 fix(end_page_id):Fix the issue where end_page_id is corrected to len-1 when its input is 0. (#518) hai 1 ano
user_api.py 0f91fcf61f feat(cli&analyze&pipeline): add start_page and end_page args for pagination (#507) hai 1 ano