Commit History

Author SHA1 Message Date
  myhloli b0fd756625 refactor: update OCR handling and improve function parameters for clarity 5 months ago
  myhloli 9bb257769c refactor: reorganize imports to align with backend structure and improve clarity 5 months ago
  myhloli a3ae57bf20 refactor: streamline text span extraction and remove unused functions 5 months ago
  myhloli 546be00aac refactor: update OCR score handling to filter low-confidence results 5 months ago
  myhloli 3334157f15 refactor: clean up unused OCR area calculation and update demo PDF path 5 months ago
  myhloli 236a6033f1 refactor: improve block processing logic and enhance span handling 5 months ago
  myhloli 7d4ce0c380 refactor: add LLM-aided title optimization and improve config handling 5 months ago
  myhloli d2de6d801a refactor: update text span extraction to use new version and improve character handling 5 months ago
  myhloli 1ed61cb5d6 refactor: update OCR span extraction logic and improve PDF page processing 5 months ago
  myhloli 51393aa814 refactor: update union_make import and adjust middle JSON structure for consistency 5 months ago
  myhloli a9abb4e201 refactor: enhance OCR processing and paragraph splitting in document analysis pipeline 5 months ago
  myhloli 0f21495a06 refactor: enhance block processing and sorting utilities for improved span management 5 months ago
  myhloli ae7b0a6eba refactor: implement block preprocessing utilities for improved bounding box management 5 months ago
  myhloli 8f1f9abec5 refactor: enhance bounding box utilities and add configuration reader for S3 integration 5 months ago
  myhloli ea5cb65a1f refactor: enhance document parsing by supporting multiple PDF files and improving method organization 5 months ago