HUST-ASTD

In this work, we presented four datasets named HUST-ART, HUST-AST, ABE, and Tana for Amharic script detection and recognition in the nature scene. The proposed datasets are the first comprehensive and public Amharic script datasets to the best of our knowledge. These datasets will promote the development of robust Amharic script detection and recognition algorithms. Consequently, the outcome will benefit people in East Africa, including diplomats from several countries and international communities. For detailed analysis and information, please refer to the supplementary document.

Download datasets:

Detection part:
HUST-ART[Baidu], passwd: 8wns or [OneDrive]
HUST-AST[Baidu], passwd: 8wns or [OneDrive]

Recognition part:
HUST-ART[Baidu], passwd:k6wj or [OneDrive]
ABE[Baidu], passwd:k6wj or [OneDrive]
TANA[Baidu], passwd: 8wns or [OneDrive]

Dataset Highlights

Octocat

HUST-ART

The comparisons of HUST-ART and other datasets:

Dataset	Language	Text shape	Total images	Cropped words	Total instances	Av. Instances
ICDAR-2013	ENG	Horizontal	462	1,503	1,943	4.2
ICDAR-2015	ENG	Oriented	1500	6,545	1,1886	7.9
MLT17	ENG/CHN	Oriented	12,514	84,868	107,537	8.6
Total-Text	ENG	Cruved	1,555	9,330	11,459	6.0
Addis et al.	Amharic	-	-	2,500	-	-
HUST-ART	Amharic	Oriented	2,200	11,254	14,069	6.4

The first real world Amharic detection dataset.
Contain 2,200 natural scenes images.
A total of 11,254 cropped text instances.
Comprises diversified scenes, including signboards, posters, indoors, streets, etc.

HUST-AST

The first synthetic Amharic detection dataset.
Contain 75,904 images
A total of 829394 cropped synthetic text instances.
Various scenes and typeface.

Visualizations of Detection

Octocat

ABE

The first real world Amharic recognition dataset.
Contain 12,839 real-word text images: 7,621 for training and 5,218 for testing.
Various scenes

Tana

The first diverse and large synthetic Amharic recognition dataset.
Consists of 2851778 synthetic word images, including the 829394 HUST-AST cropped text images.
Various scenes.

Visualizations of Recognition

Octocat

Visualizations of Recognition

Octocat

Leaderboard of detection

Results on HUST-ART dataset:

Method	Backbone	Precision	Recall	F1-measure	FPS
EAST	Res50	79.67	79.10	79.38	-
PSENET	Res50	94.79	72.21	81.97	-
PAN	Res18	95.21	73.52	82.97	28
DB	Res18	96.61	73.67	83.60	50
DB	Res50	95.31	74.62	83.71	22

Leaderboard of recognition

Results on ABE dataset:

Method	Accuracy (%)
CRNN	75.91
RARE	78.13
ASTER	81.4
SATRN	85.66
MASTER	86.5

Results on HUST-ART dataset:

Method	Accuracy (%)
CRNN	80.26
RARE	82.05
ASTER	85.3
SATRN	87.54
MASTER	87.7

Leaderboard of End-to-end detection and recognition

Method	E2E	Precision	Recall	F1-measure
PAN++	30.31	93.38	30.06	45.48
Mask TextSpotter v3	81.71	88.31	80.82	84.40

E2E refer to the End-to-end recognition accuracy rate.

Contact

Send an email to myethiopia2025@gmail.com for any queries.