https://github.com/opendatalab/MinerU.git

myhloli c96aa88d13 Merge pull request #119 from icecraft/feat/parallel_paddle 1 year ago
.github c69f414b20 update pypi upload logic 1 year ago
demo 4adc761b2e remove old demo 1 year ago
magic_pdf 738f9274a9 feat: parallelize paddle 1 year ago
others c9c14beab3 更新readme 1 year ago
tests b7a2f547bb skip case 1 year ago
tools 484b33044f add case 1 year ago
.gitignore 016cde3ece 修复init错误 1 year ago
LICENSE.md 9fe81795bc Create LICENSE.md 1 year ago
README.md a0e46724f0 Update README.md 1 year ago
magic-pdf.template.json 02d805ea9b 增加重构函数位置 1 year ago
requirements.txt ce0d99057a use fast_langdetect replace cld2 1 year ago
setup.py 9b5b116369 fix: change garbled_rate 0.1 -> 0.02 1 year ago
update_version.py 7fd8d97edb fix error: version is 0.0.0 1 year ago

README.md

Magic-PDF

便捷、准确的将PDF转换成Markdown文档

上手指南

开发前的配置要求

python 3.9+

安装步骤

1.Clone the repo

git clone https://github.com/magicpdf/Magic-PDF.git

2.Install the requirements

cd Magic-PDF
pip install -r requirements.txt

3.Run the command line

linux/osx
export PYTHONPATH=.
win
$env:PYTHONPATH += ";.\Magic-PDF\magic_pdf"    
python magic_pdf/cli/magicpdf.py --help

版权说明

LICENSE.md

鸣谢