Коммит түүх

Эзэн SHA1 Мессеж Огноо
  myhloli f674b8d413 refactor(para): improve language detection and block splitting 1 жил өмнө
  myhloli 160624bd36 refactor(para): improve block merging logic in para_split_v3.py 1 жил өмнө
  myhloli 5d6cbcb123 refactor(para): improve line stop flag and remove unused debug mode 1 жил өмнө
  icecraft b492c19c4c refactor: move some constants or enums defs to config folder 1 жил өмнө
  myhloli 69805f4ba9 refactor(para): adjust right margin threshold based on block width 1 жил өмнө
  myhloli 517fbe5bf4 refactor(para): improve paragraph splitting logic 1 жил өмнө
  hyastar 220a24cd4c 更新 para_split_v3.py 1 жил өмнө
  myhloli cf0d76c094 feat(para_split_v3): improve list identification with block aspect ratio 1 жил өмнө
  myhloli 2bf6c26871 feat(list): improve list detection algorithm- Add center_close_num and external_sides_not_close_num variables to analyze line positioning 1 жил өмнө
  myhloli a8f2e7d6c4 fix(list): improve list identification accuracy- Adjust the threshold for determining right-side spacing to 0.26 * block_weight 1 жил өмнө
  myhloli 8cc76c4921 refactor(para): improve paragraph splitting algorithm 1 жил өмнө
  myhloli 81b9fd7bdb refactor(para_split_v3): refine list block detection in paragraph splitting 1 жил өмнө
  myhloli 244b868443 fix(split_v3): Fix the rule adaptation for some special list samples. 1 жил өмнө
  myhloli fdcb49d327 refactor(para_split_v3): merge list and index block detection 1 жил өмнө
  myhloli 1f1dd3538d feat(list&index block): detect and merge list and index blocks 1 жил өмнө
  myhloli 7b42d5a0c4 fix: Solving the Grouping Anomaly Issue with Multiple Consecutive Non-Text Blocks 1 жил өмнө
  myhloli 6f63e70e94 feat(pdf_parse_union_core_v2): reintegrate para_split_v3 and add page range support 1 жил өмнө