Историја ревизија

Аутор SHA1 Порука Датум
  myhloli 058d318491 feat(pdf_parse): add footnote block handling in layout split пре 7 месеци
  myhloli ea730ae2e9 refactor(ocr): improve OCR score precision to three decimal places пре 7 месеци
  myhloli 795233d1bb refactor(magic_pdf): remove OCR timing measurement code пре 8 месеци
  myhloli 553f250fc7 refactor(magic_pdf): optimize code and improve logging пре 8 месеци
  myhloli a024c30fc4 feat(ocr): implement dynamic OCR processing for text spans with low contrast пре 8 месеци
  myhloli 3cb156f549 fix(pdf_parse_union_core_v2): suppress FutureWarning from transformers пре 8 месеци
  myhloli 59d6b195b0 refactor(model): integrate AtomModelSingleton for OCR and improve OCR result handling пре 8 месеци
  myhloli a330651d64 feat(ocr): implement separate detection and recognition processes пре 8 месеци
  myhloli 72e66c2d1e refactor(pdf_parse): adjust line calculation for block height пре 8 месеци
  myhloli 71efb101dc refactor(pdf_parse): adjust line calculation for block height пре 8 месеци
  myhloli 3f2bafa88f feat(pre_proc): add function to remove x-overlapping characters in spans пре 8 месеци
  myhloli 7210f7a65a perf(model): enable bfloat16 for layoutreader on supported devices пре 8 месеци
  myhloli cf4ea78dac refactor: remove torchtext deprecation warning handling пре 8 месеци
  myhloli af27c0cc81 refactor(magic_pdf): support mps device and optimize image processing пре 8 месеци
  myhloli 6bfc17119d refactor(pdf_parse): comment out performance measurement and logging пре 9 месеци
  myhloli e516cf535c feat(performance): add performance monitoring and optimization пре 9 месеци
  myhloli 6ec440d6f1 feat(pdf_parse): implement multi-threaded page processing пре 9 месеци
  myhloli 0a246f0f40 refactor(magic_pdf): simplify device selection in model initialization пре 9 месеци
  myhloli 9b00f988ac refactor(magic_pdf): remove bfloat16 support checks and usage пре 9 месеци
  myhloli 30bd3a83c7 fix(pdf_parse): Fixed the issue where some headings were missing in certain complex layouts. пре 9 месеци
  myhloli 5561ac9555 fix(pdf_parse): improve image processing and OCR accuracy пре 9 месеци
  myhloli 9f18ca2019 feat(pdf_parse): improve OCR processing and contrast filtering пре 9 месеци
  myhloli 10e848b39d feat(pdf_parse_union_core_v2): add timing log for LLM aided processes пре 10 месеци
  myhloli 1d08865f4a refactor(pdf_parse): uncomment char bbox validation logic пре 10 месеци
  myhloli ba6c17a9d9 feat(pdf_parse): remove tilted lines for better text extraction пре 10 месеци
  myhloli 8570e006f8 refactor(magic_pdf): improve title block merging logic пре 10 месеци
  Xiaomeng Zhao 206fcb3900 Merge pull request #1537 from myhloli/doclayoutyolo-fix пре 10 месеци
  myhloli c20e9a1e84 feat(layout): improve title block handling and layout detection пре 10 месеци
  Xiaomeng Zhao 9f12c39817 Update pdf_parse_union_core_v2.py пре 10 месеци
  myhloli aaff1a2616 fix(llm_aided): add enable flag check for LLM aided optimizations пре 10 месеци