소스 검색

update docs

myhloli 2 달 전
부모
커밋
5425da9571
2개의 변경된 파일34개의 추가작업 그리고 31개의 파일을 삭제
  1. 12 13
      docs/en/reference/output_files.md
  2. 22 18
      docs/zh/reference/output_files.md

+ 12 - 13
docs/en/reference/output_files.md

@@ -520,18 +520,17 @@ Text levels are distinguished through the `text_level` field:
 Structure is broadly similar to the pipeline backend, but with these differences:
 
 - `list` becomes a second‑level block, a new field `sub_type` distinguishes list categories:
-  * `text`: ordinary list
-  * `ref_text`: reference / bibliography style list
-- New `code` block type with `sub_type`:
-  * `code`
-  * `algorithm`
-- A code block always has at least a `code_body`, it may optionally have a `code_caption`.
+    * `text`: ordinary list
+    * `ref_text`: reference / bibliography style list
+- New `code` block type with `sub_type`(a code block always has at least a `code_body`, it may optionally have a `code_caption`):
+    * `code`
+    * `algorithm`
 - `discarded_blocks` may contain additional types: 
-  * `header`
-  * `footer`
-  * `page_number`
-  * `aside_text`
-  * `page_footnote`
+    * `header`
+    * `footer`
+    * `page_number`
+    * `aside_text`
+    * `page_footnote`
 - All blocks include an `angle` field indicating rotation (one of `0, 90, 180, 270`).
 
 ##### Examples
@@ -639,9 +638,9 @@ Structure is broadly similar to the pipeline backend, but with these differences
 Based on the pipeline format, with these VLM-specific extensions:
 
 - New `code` type with `sub_type` (`code` | `algorithm`):
-  * Fields: `code_body` (string), optional `code_caption` (list of strings)
+    * Fields: `code_body` (string), optional `code_caption` (list of strings)
 - New `list` type with `sub_type` (`text` | `ref_text`):
-  * Field: `list_items` (array of strings)
+    * Field: `list_items` (array of strings)
 - All `discarded_blocks` entries are also output (e.g., headers, footers, page numbers, margin notes, page footnotes).
 - Existing types (`image`, `table`, `text`, `equation`) remain unchanged.
 - `bbox` still uses the 0–1000 normalized coordinate mapping.

+ 22 - 18
docs/zh/reference/output_files.md

@@ -537,17 +537,19 @@ inference_result: list[PageInferenceResults] = []
 vlm 后端的 middle.json 文件结构与 pipeline 后端类似,但存在以下差异: 
 
 - list变成二级block,增加`sub_type`字段区分list类型:
-  * `text`(文本类型)
-  * `ref_text`(引用类型)
+    * `text`(文本类型)
+    * `ref_text`(引用类型)
+
 - 增加code类型block,code类型包含两种"sub_type":
-  * 分别是"code"和"algorithm"
-  * 至少有code_body,可选code_caption
+    * 分别是`code`和`algorithm`
+    * 至少有`code_body`, 可选`code_caption`
+
 - `discarded_blocks`内元素type增加以下类型:
-  * `header`(页眉)
-  * `footer`(页脚)
-  * `page_number`(页码)
-  * `aside_text`(装订线文本)
-  * `page_footnote`(脚注)
+    * `header`(页眉)
+    * `footer`(页脚)
+    * `page_number`(页码)
+    * `aside_text`(装订线文本)
+    * `page_footnote`(脚注)
 - 所有block增加`angle`字段,用来表示旋转角度,0,90,180,270
 
 
@@ -726,17 +728,19 @@ vlm 后端的 middle.json 文件结构与 pipeline 后端类似,但存在以
 vlm 后端的 content_list.json 文件结构与 pipeline 后端类似,伴随本次middle.json的变化,做了以下调整: 
 
 - 新增`code`类型,code类型包含两种"sub_type":
-  * 分别是"code"和"algorithm"
-  * 至少有code_body, 可选code_caption
+    * 分别是`code`和`algorithm`
+    * 至少有`code_body`, 可选`code_caption`
+  
 - 新增`list`类型,list类型包含两种"sub_type":
-  * `text`
-  * `ref_text` 
+    * `text`
+    * `ref_text` 
+
 - 增加所有所有`discarded_blocks`的输出内容
-  * `header`
-  * `footer`
-  * `page_number`
-  * `aside_text`
-  * `page_footnote`
+    * `header`
+    * `footer`
+    * `page_number`
+    * `aside_text`
+    * `page_footnote`
 
 ##### 示例数据
 - code 类型 content