Skip to the content.

HUST-ASTD

In this work, we presented four datasets named HUST-ART, HUST-AST, ABE, and Tana for Amharic script detection and recognition in the nature scene. The proposed datasets are the first comprehensive and public Amharic script datasets to the best of our knowledge. These datasets will promote the development of robust Amharic script detection and recognition algorithms. Consequently, the outcome will benefit people in East Africa, including diplomats from several countries and international communities. For detailed analysis and information, please refer to the supplementary document.

Download datasets:

Detection part:
HUST-ART[Baidu], passwd: 8wns or [OneDrive]
HUST-AST[Baidu], passwd: 8wns or [OneDrive]

Recognition part:
HUST-ART[Baidu], passwd:k6wj or [OneDrive]
ABE[Baidu], passwd:k6wj or [OneDrive]
TANA[Baidu], passwd: 8wns or [OneDrive]

Dataset Highlights

Octocat

HUST-ART

The comparisons of HUST-ART and other datasets:

Dataset Language Text shape Total images Cropped words Total instances Av. Instances
ICDAR-2013 ENG Horizontal 462 1,503 1,943 4.2
ICDAR-2015 ENG Oriented 1500 6,545 1,1886 7.9
MLT17 ENG/CHN Oriented 12,514 84,868 107,537 8.6
Total-Text ENG Cruved 1,555 9,330 11,459 6.0
Addis et al. Amharic - - 2,500 - -
HUST-ART Amharic Oriented 2,200 11,254 14,069 6.4

HUST-AST

Visualizations of Detection

Octocat

ABE

Tana

Visualizations of Recognition

Octocat

Visualizations of Recognition

Octocat

Leaderboard of detection

Results on HUST-ART dataset:

Method Backbone Precision Recall F1-measure FPS
EAST Res50 79.67 79.10 79.38 -
PSENET Res50 94.79 72.21 81.97 -
PAN Res18 95.21 73.52 82.97 28
DB Res18 96.61 73.67 83.60 50
DB Res50 95.31 74.62 83.71 22

Leaderboard of recognition

Results on ABE dataset:

Method Accuracy (%)
CRNN 75.91
RARE 78.13
ASTER 81.4
SATRN 85.66
MASTER 86.5

Results on HUST-ART dataset:

Method Accuracy (%)
CRNN 80.26
RARE 82.05
ASTER 85.3
SATRN 87.54
MASTER 87.7

Leaderboard of End-to-end detection and recognition

Method E2E Precision Recall F1-measure
PAN++ 30.31 93.38 30.06 45.48
Mask TextSpotter v3 81.71 88.31 80.82 84.40

E2E refer to the End-to-end recognition accuracy rate.

Contact

Send an email to myethiopia2025@gmail.com for any queries.