Paddleocr

Our logo includes team logo and official logo for PaddleOCR

PaddleOCR is an open-source OCR tool based on PaddlePaddle algorithm developed by company Baidu from China. It aims to create multilingual, awesome, leading, and practical OCR tools that help users train better models and apply them into practice. Compared with other OCR tools, PaddleOCR has the feature of ultra-lightweight and pre-trained models, which is suitable for local offline deployment and being embedded in IoT devices. It is also the reason that triggers we four Embedded Master students to study about it.

Project Site | Project Source

Authors

Yuhang Tian

Embedded Systems student, like anime.

Shixun Wu

Embedded Systems student, like Dota

Jinchen Zeng

Embedded Systems student, film lover

Mingyu Gao

Embedded Systems student, like cooking and music

PaddleOCR - Variability Makes It Splendid

This is the fourth article of the series relevant to PaddleCOR. This essay will mainly analyze the variability of PaddleOCR. If you are interested in this content, you can also visit our former ones. Variability Modeling Variability Management Variability Implementation Mechanism Reference Variability Modeling In this section, we will identify the variabilities/functionalities offered by PaddleOCR and introduce the benefits for stakeholders as well as the incompatibilities between these variabilities.

Paddleocr

March 28, 2021

Quality Process - To PaddleOCR and Beyond!

PaddleOCR is an ultra-lightweight OCR tool for extracting texts from pictures, which can be further integrated with other software systems or embedded devices. This week’s blog will explore the software quality and integration process of this effective OCR tool mainly based on its Github and the paper1 published by PaddleOCR developer team. Overall Software Testing Processes Key Elements in Continuous Integration Hotspot!Which Part is Mostly Updated？ Pull Request Analysis Technical Debt: Code Shows the Fact Reference Overall Software Testing Processes Software testing has a prescribed order in which things should be done to guarantee the user experience of software, serving as the ‘guardian’ of software quality.

Paddleocr

March 20, 2021

PaddleOCR - An Elegant And Modular Architecture

This essay is going to research PaddleOCR from the perspective of its architecture. We first derive its architecture style - pipe-and-filter and blackboard patterns from its working mechanism. Then we illustrate the containers and components to describe their structure. Furthermore, we analyze PaddleOCR from development and run-time views. At last, we simply show some API design principles that PaddleOCR applies. PaddleOCR’s Architectural Style Trade-off! The Tricky Architecture Architecture Structure Design Development View Runtime.

Paddleocr

March 15, 2021

PaddleOCR - An Ultra Lightweight OCR System

This article has been divided into 8 sections, where we, four Embedded System master students from Delft University of Technology, will analyze this open-source project named PaddleOCR developed by Baidu Company. The first two sections will give a general view of it. Following that, it will analyze the main capabilities and usage of the product. Then, the stakeholders and the key attributes will be presented. Besides, it will explore the roadmap of this application.

Paddleocr

March 6, 2021

fix the incompatibility of gif saving #2351

PaddlePaddle/PaddleOCR

This contribution fixes the incompatibility of git saving. The original file saves the output result with cv2.imwrite() while this API cannot correct saves ‘.gif’ file. So we add a judgement to check the type of image file. If the original image file is ‘.gif’ type, it will be saved as ‘.png’ file.

merged

Open PR