Join our Discord
简体中文 | English
RapidOCR is a completely open-source, free OCR tool that supports multi-platform, multi-language operation and rapid offline deployment. Its core advantages lie in extreme speed and extensive compatibility.
Supported Languages: Default support for Chinese and English recognition. For other supported languages, please refer to the documentation: Model List.
Project Origin: Considering that PaddleOCR still has room for optimization in engineering aspects, we innovatively converted the models in PaddleOCR into the highly compatible ONNX format to simplify and accelerate the inference deployment of OCR models on various terminal devices. Furthermore, we achieved seamless cross-platform porting based on multiple programming languages such as Python, C++, Java, and C#, enabling developers to get started easily and integrate efficiently.
Name Implication: The name "RapidOCR" embodies our core expectations for the product: Rapid (simple operation, fast response), Good & Economical (low resource consumption, high cost-effectiveness), and Intelligent (achieving accurate and efficient recognition relying on deep learning technology). We focus on leveraging the advantages of artificial intelligence to create compact yet powerful models, relentlessly pursuing speed while ensuring excellent recognition results.
User Guide:
If you find this project helpful for your work or study, we kindly ask you to give us a ⭐ Star to provide valuable support and encouragement!
pip install rapidocr onnxruntime
from rapidocr import RapidOCR
engine = RapidOCR()
img_url = "https://www.modelscope.cn/models/RapidAI/RapidOCR/resolve/master/resources/test_files/ch_en_num.jpg"
result = engine(img_url)
print(result)
result.vis("vis_result.jpg")
Full documentation can be found on docs, in Chinese.
For more projects that use RapidOCR, you are welcome to register at the registration address. Registration is solely for product promotion.
This repository contains the Python component of RapidOCR. Components for other languages have been migrated to separate repositories.
For the complete workflow on contributing to Python development, please refer to: Python CONTRIBUTING.
RapidOCR is an Apache2.0-licensed open source project with its ongoing development made possible entirely by the support of these awesome backers. If you'd like to join them, please consider sponsoring RapidOCR's development.
| Sponsors | Application | Introduction |
|---|---|---|
![]() |
Quicker | Your fingertip toolbox |
|
@zg-young |
@Eunsolfs |
@youzzhang |
If you find this project useful in your research, please consider cite:
@misc{RapidOCR 2021,
title={{Rapid OCR}: OCR Toolbox},
author={RapidAI Team},
howpublished = {\url{https://github.com/RapidAI/RapidOCR}},
year={2021}
}
The copyright of the OCR model is held by Baidu, while the copyrights of all other engineering scripts are retained by the repository's owner.
This project is released under the Apache 2.0 license.