| .. |
|
__init__.py
|
d5dbed7325
目录重构
|
1 anno fa |
|
citationmarker_remove.py
|
8411c91029
fix(magic_pdf): prevent division by zero in citationmarker removal
|
1 anno fa |
|
construct_page_dict.py
|
442f36845f
fix complicated layout logic
|
1 anno fa |
|
cut_image.py
|
c5b27057ea
切图逻辑修复
|
1 anno fa |
|
detect_equation.py
|
bc339320ab
增加了一个solve_line_alien.py,在detect_equation.py中修改了__solve_contain_bboxs函数,并在pdf_parse_by_model.py里增加了函数solve_line_alien.py的调用
|
1 anno fa |
|
detect_footer_by_model.py
|
f62d1aa781
将模型和pymu坐标的转换逻辑抽象成方法
|
1 anno fa |
|
detect_footer_header_by_statistics.py
|
d5dbed7325
目录重构
|
1 anno fa |
|
detect_footnote.py
|
0e2d0b8b4f
parse_pdf_by_ocr 和 cut_image 重构,使用抽象类进行写出操作
|
1 anno fa |
|
detect_header.py
|
f62d1aa781
将模型和pymu坐标的转换逻辑抽象成方法
|
1 anno fa |
|
detect_images.py
|
d5dbed7325
目录重构
|
1 anno fa |
|
detect_page_number.py
|
f62d1aa781
将模型和pymu坐标的转换逻辑抽象成方法
|
1 anno fa |
|
detect_tables.py
|
d5dbed7325
目录重构
|
1 anno fa |
|
equations_replace.py
|
7dcf63e69c
fix:close some log output if not in debug mode
|
1 anno fa |
|
fix_image.py
|
d5dbed7325
目录重构
|
1 anno fa |
|
fix_table.py
|
d5dbed7325
目录重构
|
1 anno fa |
|
main_text_font.py
|
d5dbed7325
目录重构
|
1 anno fa |
|
ocr_detect_all_bboxes.py
|
9067cd31ca
fix(detect_all_bboxes): remove small overlapping blocks by merging (#501)
|
1 anno fa |
|
ocr_detect_layout.py
|
6396910992
移动modify_y_axis在pipeline中的位置
|
1 anno fa |
|
ocr_dict_merge.py
|
03469909bb
Feat/support footnote in figure (#532)
|
1 anno fa |
|
ocr_span_list_modify.py
|
5f992de4da
fix(magic_pdf): prevent removal of low-confidence spans already dropped
|
1 anno fa |
|
pdf_pre_filter.py
|
1b9d65b3d3
1、Trace类的key增加前置下划线
|
1 anno fa |
|
post_layout_split.py
|
d5dbed7325
目录重构
|
1 anno fa |
|
remove_bbox_overlap.py
|
d59981499d
fix: error return typo
|
1 anno fa |
|
remove_colored_strip_bbox.py
|
d5dbed7325
目录重构
|
1 anno fa |
|
remove_footer_header.py
|
d5ea44f944
按照统一格式组合文本型pdf的解析结果
|
1 anno fa |
|
remove_rotate_bbox.py
|
d5ea44f944
按照统一格式组合文本型pdf的解析结果
|
1 anno fa |
|
resolve_bbox_conflict.py
|
deb98fd0b1
fix footnote overlap error
|
1 anno fa |
|
solve_line_alien.py
|
bc339320ab
增加了一个solve_line_alien.py,在detect_equation.py中修改了__solve_contain_bboxs函数,并在pdf_parse_by_model.py里增加了函数solve_line_alien.py的调用
|
1 anno fa |
|
statistics.py
|
d5dbed7325
目录重构
|
1 anno fa |