File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

postgraduate thesis: Large vocabulary automatic chord estimation from audio using deep learning approaches

TitleLarge vocabulary automatic chord estimation from audio using deep learning approaches
Authors
Advisors
Advisor(s):Kwok, YK
Issue Date2016
PublisherThe University of Hong Kong (Pokfulam, Hong Kong)
Citation
Deng, J. [邓俊祺]. (2016). Large vocabulary automatic chord estimation from audio using deep learning approaches. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
AbstractBeing well aware of the chord annotation subjectivity issue, this thesis attests the necessity of large vocabulary with a joint argument of machine musicianship and the Turing test. Built upon this premise, it proposes two deep learning based system frameworks that lead to potential practical solutions to large vocabulary automatic chord estimation. The first framework separates chord segmentation and classification into two tasks, which is unlike all previous approaches that combine them in one single pass. Several deep learning models are implemented and tested. Under the large vocabulary evaluation, the recurrent neural network model shows great potential in balanced performances across different chords. This framework has shown its advantages over large vocabulary evaluation in the automatic chord estimation task of music information retrieval evaluation exchange 2016. The second framework incorporates a skewed class distribution sensitive approach. It employs an ``even chance'' scheme to boost the uncommon chords' exposure when training a recurrent neural network sequence decoder. The main drawback of this approach is the low segmentation quality. Nevertheless, it demonstrates the even chance training scheme to be effective for the large vocabulary automatic chord estimation. Finally, a preliminary study has been conducted for automatic jazz chord estimation. Upon this study, a chord-scale estimation system is built and some semi-automatic or fully automatic jazz improvisation demos are created.
DegreeDoctor of Philosophy
SubjectMusic - Data processing
Machine learning
Dept/ProgramElectrical and Electronic Engineering
Persistent Identifierhttp://hdl.handle.net/10722/249913

 

DC FieldValueLanguage
dc.contributor.advisorKwok, YK-
dc.contributor.authorDeng, Junqi-
dc.contributor.author邓俊祺-
dc.date.accessioned2017-12-19T09:27:44Z-
dc.date.available2017-12-19T09:27:44Z-
dc.date.issued2016-
dc.identifier.citationDeng, J. [邓俊祺]. (2016). Large vocabulary automatic chord estimation from audio using deep learning approaches. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.-
dc.identifier.urihttp://hdl.handle.net/10722/249913-
dc.description.abstractBeing well aware of the chord annotation subjectivity issue, this thesis attests the necessity of large vocabulary with a joint argument of machine musicianship and the Turing test. Built upon this premise, it proposes two deep learning based system frameworks that lead to potential practical solutions to large vocabulary automatic chord estimation. The first framework separates chord segmentation and classification into two tasks, which is unlike all previous approaches that combine them in one single pass. Several deep learning models are implemented and tested. Under the large vocabulary evaluation, the recurrent neural network model shows great potential in balanced performances across different chords. This framework has shown its advantages over large vocabulary evaluation in the automatic chord estimation task of music information retrieval evaluation exchange 2016. The second framework incorporates a skewed class distribution sensitive approach. It employs an ``even chance'' scheme to boost the uncommon chords' exposure when training a recurrent neural network sequence decoder. The main drawback of this approach is the low segmentation quality. Nevertheless, it demonstrates the even chance training scheme to be effective for the large vocabulary automatic chord estimation. Finally, a preliminary study has been conducted for automatic jazz chord estimation. Upon this study, a chord-scale estimation system is built and some semi-automatic or fully automatic jazz improvisation demos are created.-
dc.languageeng-
dc.publisherThe University of Hong Kong (Pokfulam, Hong Kong)-
dc.relation.ispartofHKU Theses Online (HKUTO)-
dc.rightsThe author retains all proprietary rights, (such as patent rights) and the right to use in future works.-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subject.lcshMusic - Data processing-
dc.subject.lcshMachine learning-
dc.titleLarge vocabulary automatic chord estimation from audio using deep learning approaches-
dc.typePG_Thesis-
dc.description.thesisnameDoctor of Philosophy-
dc.description.thesislevelDoctoral-
dc.description.thesisdisciplineElectrical and Electronic Engineering-
dc.description.naturepublished_or_final_version-
dc.identifier.doi10.5353/th_991043976387303414-
dc.date.hkucongregation2017-
dc.identifier.mmsid991043976387303414-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats