|
|
@@ -42,6 +42,13 @@
|
|
|
</div>
|
|
|
|
|
|
# Changelog
|
|
|
+- 2025/01/06 1.0.0 released. This is our first official release, where we have introduced a completely new API interface and enhanced compatibility through extensive refactoring:
|
|
|
+ - New API Interface
|
|
|
+ - For the data-side API, we have introduced the Dataset class, designed to provide a robust and flexible data processing framework. This framework currently supports a variety of document formats, including images (.jpg and .png), PDFs, Word documents (.doc and .docx), and PowerPoint presentations (.ppt and .pptx). It ensures effective support for data processing tasks ranging from simple to complex.
|
|
|
+ - For the user-side API, we have meticulously designed the MinerU processing workflow as a series of composable Stages. Each Stage represents a specific processing step, allowing users to define new Stages according to their needs and creatively combine these stages to customize their data processing workflows.
|
|
|
+ - Enhanced Compatibility
|
|
|
+ - By optimized the dependency environment, we ensure stable and efficient operation on ARM architecture Linux systems.
|
|
|
+ - We have deeply integrated with Huawei Ascend NPU acceleration, providing autonomous and controllable high-performance computing capabilities. This supports the localization and development of AI application platforms in China.
|
|
|
- 2024/11/22 0.10.0 released. Introducing hybrid OCR text extraction capabilities,
|
|
|
- Significantly improved parsing performance in complex text distribution scenarios such as dense formulas, irregular span regions, and text represented by images.
|
|
|
- Combines the dual advantages of accurate content extraction and faster speed in text mode, and more precise span/line region recognition in OCR mode.
|
|
|
@@ -248,7 +255,7 @@ You can modify certain configurations in this file to enable or disable features
|
|
|
{
|
|
|
// other config
|
|
|
"layout-config": {
|
|
|
- "model": "layoutlmv3" // Please change to "doclayout_yolo" when using doclayout_yolo.
|
|
|
+ "model": "doclayout_yolo" // Please change to "layoutlmv3" when using layoutlmv3.
|
|
|
},
|
|
|
"formula-config": {
|
|
|
"mfd_model": "yolo_v8_mfd",
|
|
|
@@ -288,20 +295,21 @@ If your device supports CUDA and meets the GPU requirements of the mainline envi
|
|
|
### Using NPU
|
|
|
|
|
|
If your device has NPU acceleration hardware, you can follow the tutorial below to use NPU acceleration:
|
|
|
+
|
|
|
[Ascend NPU Acceleration](docs/README_Ascend_NPU_Acceleration_zh_CN.md)
|
|
|
|
|
|
## Usage
|
|
|
|
|
|
### Command Line
|
|
|
|
|
|
-[Using MinerU via Command Line](https://mineru.readthedocs.io/en/latest/user_guide/quick_start/command_line.html)
|
|
|
+[Using MinerU via Command Line](https://mineru.readthedocs.io/en/latest/user_guide/usage/command_line.html)
|
|
|
|
|
|
> [!TIP]
|
|
|
> For more information about the output files, please refer to the [Output File Description](docs/output_file_en_us.md).
|
|
|
|
|
|
### API
|
|
|
|
|
|
-[Using MinerU via Python API](https://mineru.readthedocs.io/en/latest/user_guide/quick_start/to_markdown.html)
|
|
|
+[Using MinerU via Python API](https://mineru.readthedocs.io/en/latest/user_guide/usage/api.html)
|
|
|
|
|
|
|
|
|
### Deploy Derived Projects
|