Historique des commits

Auteur SHA1 Message Date
  myhloli 99cf160d1c fix(dict2md): improve text concatenation logic il y a 1 an
  myhloli 87b9eeee59 fix(ocr): handle inline equations consistently with text content il y a 1 an
  myhloli 7c03014c2a fix(ocr_mkcontent): improve content handling for different languages and equation types- Adjust content formatting for Chinese, Japanese, Korean, and Western languages il y a 1 an
  myhloli faf8c286fb fix(magic_pdf): handle missing image_path in spans il y a 1 an
  myhloli 0e8d5893eb feat(draw_bbox): update bounding box drawing for tables and images il y a 1 an
  myhloli c34c9d21ef refactor(ocr): improve image and table block handling il y a 1 an
  myhloli 644085760b fix(ocr_mkcontent): expand para_to_standard_format_v2 to handle list and index blocks il y a 1 an
  myhloli fc49f5c446 refactor(magic_pdf): remove unused parameters and simplify functions il y a 1 an
  myhloli 011a1b973b refactor(ocr):Increase the dilation factor in OCR to address the issue of word concatenation. il y a 1 an
  myhloli 1f1dd3538d feat(list&index block): detect and merge list and index blocks il y a 1 an
  Xiaomeng Zhao 98313d4a25 Merge branch 'dev' into content-list-not-drop il y a 1 an
  myhloli 16699a9a70 fix(ocr_mkcontent): streamline drop reason handling il y a 1 an
  myhloli 196de029a3 fix(ocr_mkcontent): correct drop mode handling for pages with drop reasons il y a 1 an
  myhloli 37fbe998ac feat(ocr_mkcontent): support drop reason in none_with_reason modeEnable the `NONE_WITH_REASON` drop mode in `para_to_standard_format_v2` by updating the il y a 1 an
  myhloli 6062862c96 feat(pipeline): pass language parameter for parsing and markdown conversion il y a 1 an
  icecraft 03469909bb Feat/support footnote in figure (#532) il y a 1 an
  yyy d714ac8b76 Release: Release 0.7.1 verison, update dev (#527) il y a 1 an
  drunkpig 18e65be489 fix: delete hyphen at end of line il y a 1 an
  drunkpig 83e0d55a34 fix: replace \u0002, \u0003 in common text (#521) il y a 1 an
  Xiaomeng Zhao dd19f59eb6 fix(ocr_mkcontent): revise table caption output (#397) il y a 1 an
  Xiaomeng Zhao 66e3ce9c4a fix(ocr_mkcontent): improve language detection and content formatting (#458) il y a 1 an
  liukaiwen ec7271faee fix table recognition bug#321 il y a 1 an
  myhloli 0998d22a32 fix(ocr_mkcontent): add spaces around inline equation in content il y a 1 an
  Kaiwen Liu 37925f36d9 feat(model inference): add table recognition and conversion to LaTeX (#284) il y a 1 an
  myhloli a5c35165ee feat(dict2md): add page index to para content for standard format v2 il y a 1 an
  myhloli ff13c8e115 fix(mkmarkdown): add 2 space after image and table URLs il y a 1 an
  赵小蒙 5de013e6d5 fix:use line_lang instead of content_lang to concatenate para il y a 1 an
  赵小蒙 6199e608d4 add union_make logic il y a 1 an
  liukaiwen 503b9fad3e 解决标题后空格丢失 il y a 1 an
  赵小蒙 f01cb89f01 fix lost image or table bug il y a 1 an