|
|
+<td>PP-DocBee2 is a multimodal large model developed by the PaddlePaddle team, specifically designed for document understanding. Building upon PP-DocBee, the team has further optimized the foundational model and introduced a new data optimization scheme to enhance data quality. With just a relatively small dataset of 470,000 samples generated using the team's proprietary data synthesis strategy, PP-DocBee2 demonstrates superior performance in Chinese document understanding tasks. In terms of internal business metrics for Chinese language scenarios, PP-DocBee2 has achieved an approximately 11.4% improvement over PP-DocBee, and it also outperforms current popular open-source and closed-source models of a similar scale.</td>
|