GitHub

PaddlePaddle /PaddleOCR is a GitHub trending repository ranked #17 with 80,954 stars and 10,657 forks. Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the ga… Data from Daily Trends.

PaddlePaddle /PaddleOCR

Rank
17
Language
Python
Stars
80,954
Fork
10,657
Stars today
433
Growth rate
0.53%
Date range
today
Repo link
https://github.com/PaddlePaddle/PaddleOCR
Snapshot date
6/7/2026

Description

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.