Music Emotion Recognition based on Feature Combination, Deep Learning and Chord Detection

Zhang, Fan

Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/18140

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Meng, H	-
dc.contributor.advisor	Boulgouris, N	-
dc.contributor.author	Zhang, Fan	-
dc.date.accessioned	2019-05-20T10:20:46Z	-
dc.date.available	2019-05-20T10:20:46Z	-
dc.date.issued	2019	-
dc.identifier.uri	http://bura.brunel.ac.uk/handle/2438/18140	-
dc.description	This thesis was submitted for the award of Doctor of Philosophy and was awarded by Brunel University London.	en_US
dc.description.abstract	As one of the most classic human inventions, music appeared in many artworks, such as songs, movies and theatres. It can be seen as another language, used to express the authors thoughts and emotion. In many cases, music can express the meaning and emotion emerged which is the authors hope and the audience feeling. However, the emotions which appear during human enjoying the music is complex and difﬁcult to precisely explain. Therefore, Music Emotion Recognition (MER) is an interesting research topic in artiﬁcial intelligence ﬁeld for recognising the emotions from the music. The recognition methods and tools for the music signals are growing fast recently. With recent development of the signal processing, machine learning and algorithm optimization, the recognition accuracy is approaching perfection. In this thesis, the research is focused on three differentsigniﬁcantpartsofMER,thatarefeatures, learningmethodsandmusicemotion theory, to explain and illustrate how to effectively build MER systems. Firstly, an automatic MER system for classing 4 emotions was proposed where OpenSMILE is used for feature extraction and IS09 feature was selected. After the combination with STAT statistic features, Random Forest classiﬁer produced the best performance than previous systems. It shows that this approach of feature selection and machine learning can indeed improve the accuracy of MER by at least 3.5% from other combinations under suitable parameter setting and the performance of system was improved by new features combination by IS09 and STAT reaching 83.8% accuracy. Secondly, another MER system for 4 emotions was proposed basedon the dynamic property of music signals where the features are extracted from segments of music signals instead of the whole recording in APM database. Then Long Shot-Term Memory (LSTM) deep learning model was used for classiﬁcation. The model can use the dynamic continuous information between the different time frame segments for more effective emotion recognition. However, the ﬁnal performance just achieved 65.7% which was not as good as expected. The reason might be that the database is not suitable to the LSTM as the initial thoughts. The information between the segments might be not good enough to improve the performance of recognition in comparison with the traditional methods. The complex deep learning method do not suitable for every database was proved by the conclusion,which shown that the LSTM dynamic deep learning method did not work well in this continuous database. Finally, it was targeted to recognise the emotion by the identiﬁcation of chord inside as these chords have particular emotion information inside stated in previous theoretical work. The research starts by building a new chord database that uses the Adobe audition to extract the chord clip from the piano chord teaching audio. Then the FFT features based on the 1000 points sampling pre-process data and STAT features were extracted for the selected samples from the database. After the calculation and comparison using Euclidean distance and correlation, the results shown the STAT features work well in most of chords except the Augmented chord. The new approach of recognise 6 emotions from the music was ﬁrst time used in this research and approached 75% accuracy of chord identiﬁcation. In summary, the research proposed new MER methods through the three different approaches. Some of them achieved good recognition performance and some of them will have more broad application prospects	en_US
dc.language.iso	en	en_US
dc.publisher	Brunel University London	en_US
dc.relation.uri	https://bura.brunel.ac.uk/bitstream/2438/18140/1/FulltextThesis.pdf	-
dc.subject	MER	en_US
dc.subject	Long-short term memory	en_US
dc.subject	Random Forest	en_US
dc.title	Music Emotion Recognition based on Feature Combination, Deep Learning and Chord Detection	en_US
dc.type	Thesis	en_US
Appears in Collections:	Electronic and Electrical Engineering Dept of Electronic and Electrical Engineering Theses

Files in This Item:

File	Description	Size	Format
FulltextThesis.pdf		4.9 MB	Adobe PDF	View/Open

Show simple item record