Outline

Ingegneria Sismica

Ingegneria Sismica

End-to-End Music Braille Transcription from Sheet Music Images

Author(s): Lihua Chai1, Yue Fu1, Zhi Yu2, Tianyuan Huang3, Zepeng Zhu2, Jiaxian He2
1Department of Special Education, Zhejiang College of Special Education, Hangzhou, China
2School of Software Technology, Zhejiang University, Hangzhou, China
3College of Computer Science and Technology, Zhejiang University, Hangzhou, China
Chai, Lihua . et al “End-to-End Music Braille Transcription from Sheet Music Images.” Ingegneria Sismica Volume 43 Issue 3: 1-21, doi:10.65102/is20261078.

Abstract

An end-to-end translation paradigm was proposed to address the problems of error accumulation and low robustness in cascaded music score-to-braille translation methods. For this purpose, a large-scale staff music-braille parallel corpus was constructed, which contains 300,000 sample pairs with music-element-level alignment. An encoder-decoder model was then designed to achieve direct conversion from score images to braille symbol sequences. This was accomplished by jointly optimizing visual feature extraction, musical semantic understanding, and sequence generation. A data augmentation strategy specific to music score characteristics was also introduced to enhance the model’s generalization ability towards diverse layouts and image noise. Experimental results show the model’s performance is significantly superior to that of cascaded baseline methods. The generated braille exhibits high accuracy in key musical semantics, such as pitch and duration. Furthermore, a strong robustness is demonstrated against different layouts and noisy environments. A new technical solution for efficient and accurate automated braille music production is thus provided.

Keywords
End-to-End Music Braille Transcription; Optical Music Recognition (OMR); Staff-to-Braille Parallel Corpus; Encoder-Decoder Model; Hybrid Vision Transformer (Hybrid ViT); Data Augmentation; Error Propagation; Music Accessibility

Related Articles

Liqin Zheng1, Dongrui Qing2, Yan Zhang1
1School of Mathematics and Statistics, Shaan Xi Xue Qian Normal University Xi’an 710100, P.R.China
2School of Marxism, Xi’an University of Finance and Economics Xi’an 710100, P.R.China
Yanan Gao1, Aiqun Peng2, Nina Ma2
1Management School of Anhui Business and Technology College Hefei 230000, Anhui, China
2Economics and Trade School of Anhui Business and Technology College Hefei 230000, Anhui, China
Ya’ning Liu1, Ping Ma1
1School of Teacher Education, Shihezi University, Shihezi, Xinjiang, 832000, China
Yuhui Li1, Zhongliang Gong1
1College of Mechanical and Intelligent Manufacturing, Central South University of Forestry and Technology, Changsha, Hunan, 410004, China
Hanqing Hu1, Chengjin Liu1, Tianmu Tian1
1School of Management Science and Engineering, Beijing Information Science & Technology University, Beijing 100192