Skip to content

๐Ÿ“„ Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.

License

Notifications You must be signed in to change notification settings

RapidAI/RapidOCR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Shows an illustrated sun in light mode and a moon with stars in dark mode.
ย 
Open source OCR for the security of the digital world
ย 

Open in Colab PyPI SemVer2.0

็ฎ€ไฝ“ไธญๆ–‡ | English

๐Ÿ“ Introduction

Introducing the foremost multi-platform, multi-lingual OCR tool that boasts unparalleled speed, expansive support, and complete openness. This exceptional software is entirely free and renowned for facilitating swift offline deployments.

Supported Languages: It inherently supports Chinese and English, with self-service conversion required for additional languages. Please refer here for specific language support details.

Rationale: Acknowledging the limitations in PaddleOCR's architecture, we embarked on a mission to simplify OCR inference across diverse platforms. This endeavor culminated in converting PaddleOCR's model to the versatile ONNX format and seamlessly integrating it into Python, C++, Java, and C# environments.

Etymology: Derived from its essence, RapidOCR embodies lightness, velocity, affordability, and intelligence. Rooted in deep learning, this OCR technology underscores AI's prowess and emphasizes compact models, prioritizing swiftness without compromising efficacy.

Usage Scenarios:

  • Instant Deployment: If the pre-existing models within our repository suffice, simply leverage RapidOCR for swift deployment.
  • Customization: In case of specific requirements, refine PaddleOCR with your data and proceed with RapidOCR deployment, ensuring tailored results.

If our repository proves beneficial to your endeavors, kindly consider leaving a star โญ on GitHub to show your appreciation. It means the world to us!

๐ŸŽฅ Visualization

Demo

๐Ÿ› ๏ธ Installation

pip install rapidocr onnxruntime

๐Ÿ“‹ Usage

from rapidocr import RapidOCR

engine = RapidOCR()

img_url = "https://github.com/RapidAI/RapidOCR/blob/main/python/tests/test_files/ch_en_num.jpg?raw=true"
result = engine(img_url)
print(result)

result.vis("vis_result.jpg")

๐Ÿ“š Documentation

Full documentation can be found on docs, in Chinese.

๐Ÿ‘ฅ Who use? (more)

For more projects that use RapidOCR, you are welcome to register at the registration address. Registration is solely for product promotion.

๐Ÿ™ Acknowledgements

  • Many thanks to PaddleOCR for everything.
  • Many thanks to PaddleOCR2Pytorch for providing the converted PyTorch format models.
  • Many thanks to PaddleX for providing the document models.
  • Many thanks to DeliciaLaniD for fixing the misplaced start position of scan animation in ocrweb.
  • Many thanks to zhsunlight for the suggestion about parameterized call GPU reasoning and the careful and thoughtful testing.
  • Many thanks to lzh111222334 for fixing some bugs of rec preprocessing under python version.
  • Many thanks to AutumnSun1996 for the suggestion in the #42.
  • Many thanks to DeadWood8 for providing the document which packages rapidocr_web to exe by Nuitka.
  • Many thanks to Loovelj for fixing the bug of sorting the text boxes. For details see issue 75.

๐ŸŽ– Code Contributors

๐ŸŒŸ Sponsors & Backers

RapidOCR is an Apache2.0-licensed open source project with its ongoing development made possible entirely by the support of these awesome backers. If you'd like to join them, please consider sponsoring RapidOCR's development.

Sponsors

Sponsors Application Introduction
Quicker Your fingertip toolbox
Compshare 10,000+ RTX 40 Series GPUs, Global AI Model APIs. Instant response. Pay-as-you-go. Free trial for new users.

Backers

๐Ÿ“œ Citation

If you find this project useful in your research, please consider cite:

@misc{RapidOCR 2021,
    title={{Rapid OCR}: OCR Toolbox},
    author={RapidAI Team},
    howpublished = {\url{https://github.com/RapidAI/RapidOCR}},
    year={2021}
}

โญ๏ธ Stargazers over time

Stargazers over time

โš–๏ธ License

The copyright of the OCR model is held by Baidu, while the copyrights of all other engineering scripts are retained by the repository's owner.

This project is released under the Apache 2.0 license.