GitHub

PaddlePaddle /PaddleOCR is a GitHub trending repository ranked #10 with 80,529 stars and 10,632 forks. Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the ga… Data from Daily Trends.

PaddlePaddle /PaddleOCR

Rank
10
Language
Python
Stars
80,529
Fork
10,632
Stars today
747
Growth rate
0.93%
Date range
today
Repo link
https://github.com/PaddlePaddle/PaddleOCR
Snapshot date
6/6/2026

Description

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.