| .. |
|
config
|
20438bd2b7
feat(language-detection): add YOLOv11 language detection model
|
hai 11 meses |
|
data
|
d2fc9dabf4
refactor(model): optimize batch processing and inference
|
hai 7 meses |
|
dict2md
|
c46d3373de
refactor(ocr_mkcontent): improve title level handling and formatting
|
hai 8 meses |
|
filter
|
a5342950f6
fix(filter): toggle invalid character detection method
|
hai 9 meses |
|
integrations
|
b492c19c4c
refactor: move some constants or enums defs to config folder
|
hai 1 ano |
|
libs
|
0c9572c871
Update version.py with new version
|
hai 7 meses |
|
model
|
07edefaa7d
feat(model): add text region handling and improve overlap resolution
|
hai 7 meses |
|
operators
|
52efe94da8
feat(api): simplify markdown and content list generation
|
hai 10 meses |
|
post_proc
|
842483ccb3
refactor(magic_pdf): improve paragraph splitting logic and update dependencies
|
hai 8 meses |
|
pre_proc
|
058d318491
feat(pdf_parse): add footnote block handling in layout split
|
hai 7 meses |
|
resources
|
e327e9bad5
fix(table): add model path for slanet-plus to resolve RapidTableError
|
hai 7 meses |
|
spark
|
b492c19c4c
refactor: move some constants or enums defs to config folder
|
hai 1 ano |
|
tools
|
54ce594bf6
refactor(tools): improve code readability and maintainability
|
hai 7 meses |
|
utils
|
2e5e55cfe2
refactor(office_to_pdf): simplify font checking and add logging
|
hai 7 meses |
|
__init__.py
|
d5dbed7325
目录重构
|
hai 1 ano |
|
pdf_parse_union_core_v2.py
|
058d318491
feat(pdf_parse): add footnote block handling in layout split
|
hai 7 meses |