浏览代码

refactor(magic_pdf): update invalid character detection logic

- Uncomment detect_invalid_chars_by_pymupdf function call
- Comment out detect_invalid_chars function call
myhloli 9 月之前
父节点
当前提交
5aa809ff14
共有 1 个文件被更改,包括 2 次插入2 次删除
  1. 2 2
      magic_pdf/filter/pdf_meta_scan.py

+ 2 - 2
magic_pdf/filter/pdf_meta_scan.py

@@ -323,8 +323,8 @@ def get_language(doc: fitz.Document):
 
 def check_invalid_chars(pdf_bytes):
     """乱码检测."""
-    # return detect_invalid_chars_by_pymupdf(pdf_bytes)
-    return detect_invalid_chars(pdf_bytes)
+    return detect_invalid_chars_by_pymupdf(pdf_bytes)
+    # return detect_invalid_chars(pdf_bytes)
 
 
 def pdf_meta_scan(pdf_bytes: bytes):