Design and implementation of Intelligent Composition generation Model driven by Multi-modal music features

Li, Lianghua

doi:10.65102/is2026215

Research article

Ingegneria Sismica

Volume 43 Issue 1
Pages: 1
-21

Design and implementation of Intelligent Composition generation Model driven by Multi-modal music features

Author(s): ^¹

¹Hunan First Normal University the College of Music and Dance. Chang sha 410205, P. R. China

Published: 30/04/2026

Cite

Li, Lianghua. “Design and implementation of Intelligent Composition generation Model driven by Multi-modal music features.” Ingegneria Sismica Volume 43 Issue 1: 1-21, doi:10.65102/is2026215.

https://doi.org/10.65102/is2026215

Abstract

Generative artificial intelligence (AI) has accelerated its entry into digital content production scenarios and promoted the evolution of intelligent composition from single sequence prediction to multi-modal collaborative modeling. Aiming at the composition task driven by multi-modal music features, this paper constructs a generative model that integrates audio, MIDI, lyric text, style labels and emotion labels. Through unified feature representation, cross-modal attention fusion, hierarchical sequence generation, melody rhythm and harmony synergy constraints, as well as bi-conditional modulation of style and emotion and structure verification of music score. The closed-loop design from feature input to symbolic output is realized. Experimental results show that the melody coherence, style matching and emotional accuracy of the model reach 91, 90 and 87 points respectively, and the comprehensive quality score is 88.4, which is still 84.7 under 20% disturbance. In the lyric-assisted composition scene, the comprehensive score is 89.1, and the style preservation rate is 91.4%. The research shows that this method can improve the structure stability and expression consistency of the generated music, which has reference significance for the engineering implementation of intelligent composition system.

Keywords
Multi-modal music feature; Intelligent composition; Music generation model; Cross-modal fusion

Research article
https://doi.org/10.65102/is20261300

Research on high-quality image super-resolution re...

Volume 43 Issue 3
Pages: 1
-21
08/07/2026

^¹,², ^¹,², ^¹

¹Hainan Vocational University of Science and Technology, Haikou 571126, China

²Institute for Mathematical Research, Universiti Putra Malaysia, Serdang 43400, Malaysia

Research article
https://doi.org/10.65102/is20261301

Multi-scale Dual Transformer based Multi long-term...

Volume 43 Issue 3
Pages: 1
-18
08/07/2026

^¹,², ^¹,², ^¹

¹Hainan Vocational University of Science and Technology, Haikou 571126, China

²Institute for Mathematical Research, Universiti Putra Malaysia, Serdang 43400, Malaysia

Research article
https://doi.org/10.65102/is20261299

Ultra-Short-Term Wind Power Forecasting Based on V...

Volume 43 Issue 3
Pages: 1
-15
08/07/2026

^¹, ^², ^¹, ^¹, ^¹

¹Electric Power Research Institute, State Grid Shanxi Electric Power Co., Ltd., Taiyuan, 030001, Shanxi, China

²Jincheng Power Supply Branch, State Grid Shanxi Electric Power Co., Ltd., Jincheng, 048000, Shanxi, China

Research article
https://doi.org/10.65102/is20261298

Integration of Traditional Culture Elements and Co...

Volume 43 Issue 3
Pages: 1
-12
01/07/2026

^¹,²

¹China Academy of Cultural Heritage, Chaoyang District, 100029, Beijing, China

²Beijing University of Civil Engineering and Architecture, Xicheng District, 100044, Beijing, China

Research article
https://doi.org/10.65102/is20261297

Quantitative Evaluation Model of the Policy Effect...

Volume 43 Issue 3
Pages: 1
-15
01/07/2026

^¹, ^², ^¹

¹School of Digital Media, Shenzhen Polytechnic University, Shenzhen 518055, Guangdong, China

²Postdoctoral Mobile Station of Journalism and communication, Fudan University, Shanghai 200433, Shanghai, China

Outline

Ingegneria Sismica

Design and implementation of Intelligent Composition generation Model driven by Multi-modal music features

Abstract

Related Articles

Research on high-quality image super-resolution re...

Multi-scale Dual Transformer based Multi long-term...

Ultra-Short-Term Wind Power Forecasting Based on V...

Integration of Traditional Culture Elements and Co...

Quantitative Evaluation Model of the Policy Effect...

Open Access