myhloli
|
eeda90af31
fix(pdf_parse): improve span removal logic for all content types
|
1 tahun lalu |
myhloli
|
6b9f816f9e
fix(pdf_parse): optimize span processing by removing outside spans
|
1 tahun lalu |
myhloli
|
4cf7e9a224
refactor(pdf_parse): adjust block splitting logic for wide blocks
|
1 tahun lalu |
myhloli
|
c34c9d21ef
refactor(ocr): improve image and table block handling
|
1 tahun lalu |
icecraft
|
283b597a6e
feat: add [figure | table] match [caption | footnote] match algorithm v2
|
1 tahun lalu |
myhloli
|
7e301b849b
refactor(pdf): adjust span filling threshold in block construction
|
1 tahun lalu |
myhloli
|
6f63e70e94
feat(pdf_parse_union_core_v2): reintegrate para_split_v3 and add page range support
|
1 tahun lalu |
myhloli
|
ded2818ac2
feat(layoutreader): support local model directory and improve model loading
|
1 tahun lalu |
myhloli
|
a71db70314
feat: add arXiv paper link to header and adjust PDF parsing logic- Add arXiv paper link to the header template for easy access to the latest research paper.
|
1 tahun lalu |
myhloli
|
564c4ce1e3
refactor(magic_pdf): improve line sorting and block indexing
|
1 tahun lalu |
myhloli
|
4c9bf8abd5
refactor(memory management): remove unused clean_memory function
|
1 tahun lalu |
myhloli
|
42a7d792c3
refactor(magic_pdf): import model helpers directly for clarity
|
1 tahun lalu |
myhloli
|
5522d0a36c
refactor(pdf_parse_union_core_v2): update import paths to use new package structure
|
1 tahun lalu |
myhloli
|
2145a8b6d2
fix(pdf_parse): handle blocks without lines and enable bf16 on compatible devices
|
1 tahun lalu |
myhloli
|
177ab08e9f
refactor(pdf_parse): remove redundant sorting and optimize block indexing
|
1 tahun lalu |
myhloli
|
b9dfdea3cb
refactor(pdf_parse_union_core_v2): implement model initialization within classRefactored model initialization to be handled by a singleton class to ensure that model
|
1 tahun lalu |
myhloli
|
b2790f6f45
refactor(drawing): simplify draw bbox functions and adjust debug config
|
1 tahun lalu |
myhloli
|
34f8965007
refactor(draw_bbox): add line sorting visualization
|
1 tahun lalu |
myhloli
|
1efebe421c
refactor(pdf_parse_union): integrate LayoutLMv3 for block orderingReplace the heuristic-based block ordering algorithm with LayoutLMv3 model predictions toimprove the accuracy of block ordering on PDF pages. Additionally, refactor the span
|
1 tahun lalu |