|
6 | 6 | ---
|
7 | 7 |
|
8 | 8 | ### Recently Update
|
| 9 | +#### **🔥🔥 2025.06.19: Release of PaddleOCR v3.0.2, which includes:** |
| 10 | + |
| 11 | +- **New Features:** |
| 12 | + |
| 13 | + - The default download source has been changed from `BOS` to `HuggingFace`. Users can also change the environment variable `PADDLE_PDX_MODEL_SOURCE` to `BOS` to set the model download source back to Baidu Object Storage (BOS). |
| 14 | + - Added service invocation examples for six languages—C++, Java, Go, C#, Node.js, and PHP—for pipelines like PP-OCRv5, PP-StructureV3, and PP-ChatOCRv4. |
| 15 | + - Improved the layout partition sorting algorithm in the PP-StructureV3 pipeline, enhancing the sorting logic for complex vertical layouts to deliver better results. |
| 16 | + - Enhanced model selection logic: when a language is specified but a model version is not, the system will automatically select the latest model version supporting that language. |
| 17 | + - Set a default upper limit for MKL-DNN cache size to prevent unlimited growth, while also allowing users to configure cache capacity. |
| 18 | + - Updated default configurations for high-performance inference to support Paddle MKL-DNN acceleration and optimized the logic for automatic configuration selection for smarter choices. |
| 19 | + - Adjusted the logic for obtaining the default device to consider the actual support for computing devices by the installed Paddle framework, making program behavior more intuitive. |
| 20 | + - Added Android example for PP-OCRv5. [Details](https://paddlepaddle.github.io/PaddleOCR/latest/en/version3.x/deployment/on_device_deployment.html). |
| 21 | + |
| 22 | +- **Bug Fixes:** |
| 23 | + |
| 24 | + - Fixed an issue with some CLI parameters in PP-StructureV3 not taking effect. |
| 25 | + - Resolved an issue where `export_paddlex_config_to_yaml` would not function correctly in certain cases. |
| 26 | + - Corrected the discrepancy between the actual behavior of `save_path` and its documentation description. |
| 27 | + - Fixed potential multithreading errors when using MKL-DNN in basic service deployment. |
| 28 | + - Corrected channel order errors in image preprocessing for the Latex-OCR model. |
| 29 | + - Fixed channel order errors in saving visualized images within the text recognition module. |
| 30 | + - Resolved channel order errors in visualized table results within PP-StructureV3 pipeline. |
| 31 | + - Fixed an overflow issue in the calculation of `overlap_ratio` under extremely special circumstances in the PP-StructureV3 pipeline. |
| 32 | + |
| 33 | +- **Documentation Improvements:** |
| 34 | + |
| 35 | + - Updated the description of the `enable_mkldnn` parameter in the documentation to accurately reflect the program's actual behavior. |
| 36 | + - Fixed errors in the documentation regarding the `lang` and `ocr_version` parameters. |
| 37 | + - Added instructions for exporting production line configuration files via CLI. |
| 38 | + - Fixed missing columns in the performance data table for PP-OCRv5. |
| 39 | + - Refined benchmark metrics for PP-StructureV3 pipeline across different configurations. |
| 40 | + |
| 41 | +- **Others:** |
| 42 | + |
| 43 | + - Relaxed version restrictions on dependencies like numpy and pandas, restoring support for Python 3.12. |
9 | 44 |
|
10 | 45 | #### **🔥🔥 2025.06.05: Release of PaddleOCR v3.0.1, which includes:**
|
11 | 46 |
|
12 | 47 | - **Optimisation of certain models and model configurations:**
|
13 |
| - - Updated the default model configuration for PP-OCRv5, changing both detection and recognition from mobile to server models. To improve default performance in most scenarios, the parameter `limit_side_len` in the configuration has been changed from 736 to 64. |
14 |
| - - Added a new text line orientation classification model `PP-LCNet_x1_0_textline_ori` with an accuracy of 99.42%. The default text line orientation classifier for OCR, PP-StructureV3, and PP-ChatOCRv4 pipelines has been updated to this model. |
15 |
| - - Optimised the text line orientation classification model `PP-LCNet_x0_25_textline_ori`, improving accuracy by 3.3 percentage points to a current accuracy of 98.85%. |
| 48 | + - Updated the default model configuration for PP-OCRv5, changing both detection and recognition from mobile to server models. To improve default performance in most scenarios, the parameter `limit_side_len` in the configuration has been changed from 736 to 64. |
| 49 | + - Added a new text line orientation classification model `PP-LCNet_x1_0_textline_ori` with an accuracy of 99.42%. The default text line orientation classifier for OCR, PP-StructureV3, and PP-ChatOCRv4 pipelines has been updated to this model. |
| 50 | + - Optimised the text line orientation classification model `PP-LCNet_x0_25_textline_ori`, improving accuracy by 3.3 percentage points to a current accuracy of 98.85%. |
16 | 51 |
|
17 | 52 | - **Optimisation of issues present in version 3.0.0:**
|
18 |
| - - **Improved CLI usage experience:** When using the PaddleOCR CLI without passing any parameters, a usage prompt is now provided. |
19 |
| - - **New parameters added:** PP-ChatOCRv3 and PP-StructureV3 now support the `use_textline_orientation` parameter. |
20 |
| - - **CPU inference speed optimisation:** All pipeline CPU inferences now enable MKL-DNN by default. |
21 |
| - - **Support for C++ inference:** The detection and recognition concatenation part of PP-OCRv5 now supports C++ inference. |
| 53 | + - **Improved CLI usage experience:** When using the PaddleOCR CLI without passing any parameters, a usage prompt is now provided. |
| 54 | + - **New parameters added:** PP-ChatOCRv3 and PP-StructureV3 now support the `use_textline_orientation` parameter. |
| 55 | + - **CPU inference speed optimisation:** All pipeline CPU inferences now enable MKL-DNN by default. |
| 56 | + - **Support for C++ inference:** The detection and recognition concatenation part of PP-OCRv5 now supports C++ inference. |
22 | 57 |
|
23 | 58 | - **Fixes for issues present in version 3.0.0:**
|
24 |
| - - Fixed an issue where PP-StructureV3 encountered CPU inference errors due to the inability to use MKL-DNN with formula and table recognition models. |
25 |
| - - Fixed an issue where GPU environments encountered the error `FatalError: Process abort signal is detected by the operating system` during inference. |
26 |
| - - Fixed type hint issues in some Python 3.8 environments. |
27 |
| - - Fixed the issue where the method `PPStructureV3.concatenate_markdown_pages` was missing. |
28 |
| - - Fixed an issue where specifying both `lang` and `model_name` when instantiating `paddleocr.PaddleOCR` resulted in `model_name` being ineffective. |
| 59 | + - Fixed an issue where PP-StructureV3 encountered CPU inference errors due to the inability to use MKL-DNN with formula and table recognition models. |
| 60 | + - Fixed an issue where GPU environments encountered the error `FatalError: Process abort signal is detected by the operating system` during inference. |
| 61 | + - Fixed type hint issues in some Python 3.8 environments. |
| 62 | + - Fixed the issue where the method `PPStructureV3.concatenate_markdown_pages` was missing. |
| 63 | + - Fixed an issue where specifying both `lang` and `model_name` when instantiating `paddleocr.PaddleOCR` resulted in `model_name` being ineffective. |
29 | 64 |
|
30 | 65 | #### **🔥🔥 2025.05.20: PaddleOCR 3.0 Official Release Highlights**
|
31 | 66 |
|
|
0 commit comments