Commit History

Author SHA1 Message Date
  ciaran fb6cb8b048 Update pdf_extract_kit.py 1 year ago
  myhloli 377b09cf8c refactor(table): disable StructEqTable support and add TableMaster support 1 year ago
  liukaiwen 4949408c9d perf: table model update with PP OCRv4 1 year ago
  liukaiwen 7d2dfc8091 Merge branch 'dev' into dev-table-model-update 1 year ago
  liukaiwen a0eff3be5c feat: table model update with paddle recognition v4 1 year ago
  Kaiwen Liu 6d571e2e2c Merge pull request #7 from opendatalab/dev 1 year ago
  myhloli 1807126e7f refactor(ocr): adjust OCR processing parameters 1 year ago
  myhloli ce72cf05cb refactor(magic_pdf): adjust confidence threshold for DocLayout_YOLO model 1 year ago
  myhloli 1279f2cd0f feat(model): add support for DocLayout-YOLO model 1 year ago
  liukaiwen 51f56aa32f feat: merge formula update 1 year ago
  myhloli 011a1b973b refactor(ocr):Increase the dilation factor in OCR to address the issue of word concatenation. 1 year ago
  myhloli 1f1dd3538d feat(list&index block): detect and merge list and index blocks 1 year ago
  liukaiwen a3358878b3 feat: merge formula update 1 year ago
  myhloli fb9949c44f perf(pdf_extract_kit): conditional memory cleanup based on GPU capacity 1 year ago
  myhloli be1b1ae7fc refactor(model): improve timing information and performance 1 year ago
  myhloli 4c9bf8abd5 refactor(memory management): remove unused clean_memory function 1 year ago
  myhloli f2a3a49541 fix(pdf_extract_kit):change unimernet base -> small 1 year ago
  myhloli 4811a3d1df fix(pdf-extract): ensure model is set to evaluation mode before processing 1 year ago
  myhloli c36fa049b0 refactor(pdf_extract): use Image.crop directly with layout detection 1 year ago
  myhloli a4c72e2e33 fix: solve conflicts 1 year ago
  quyuan f9df92aa34 fix: add magic-pdf-dev case 1 year ago
  drunkpig 554048086e Realese 0.8.0 (#587) 1 year ago
  drunkpig 9f352df0eb Realese 0.8.0 (#586) 1 year ago
  Xiaomeng Zhao 73f66af9bd Merge pull request #584 from myhloli/update-unimernet-to-0.2.0 1 year ago
  myhloli 4f340c4429 refactor(pdf_extract_kit): update model config and weight paths for UniMERNet-0.2.0 1 year ago
  myhloli 4b372f3f7e feat(ocr): pass language parameter for custom model init 1 year ago
  Xiaomeng Zhao aac9109414 refactor(pdf_extract_kit): implement singleton pattern for atomic models (#533) 1 year ago
  yyy d714ac8b76 Release: Release 0.7.1 verison, update dev (#527) 1 year ago
  yyy 1dc915a4a9 release: release 0.7.1 version (#526) 1 year ago
  Xiaomeng Zhao 041b9465b9 fix(pdf-extract): adjust box threshold for OCR detection (#447) 1 year ago