GitHub

PaddlePaddle /PaddleOCR is a GitHub trending repository ranked #4 with 79,840 stars and 10,598 forks. Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the ga… Data from Daily Trends.

PaddlePaddle /PaddleOCR

Rank
4
Language
Python
Stars
79,840
Fork
10,598
Stars today
141
Growth rate
0.18%
Date range
today
Repo link
https://github.com/PaddlePaddle/PaddleOCR
Snapshot date
6/5/2026

Description

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.