Data-adaptive graph-regularized matrix factorizations

Chen, Yangge; 陳阳戈

File Download

FullText.pdf

Supplementary

Citations:
Appears in Collections:
- HKU Theses Online
- Electrical & Electronic Engineering: Theses

postgraduate thesis: Data-adaptive graph-regularized matrix factorizations

Title	Data-adaptive graph-regularized matrix factorizations
Authors	Chen, Yangge 陳阳戈
Advisors	Advisor(s):Wu, YC
Issue Date	2023
Publisher	The University of Hong Kong (Pokfulam, Hong Kong)
Citation	Chen, Y. [陳阳戈]. (2023). Data-adaptive graph-regularized matrix factorizations. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
Abstract	Matrix factorizations have been utilized in many fields, such as recommendation system, genetic research, and image processing. They are used for exploring hidden pattern from observed data, recovering missing data and predicting unknown scenarios. Due to the rapidly growing amount of data and increasing demands on data analysis capability, side information is widely employed on top of the basic matrix factorization models. In particular, graph information, which provides data correlations or pairwise similarities, is one commonly-used side information. Due to the additional perspective provided by the graph information, graph-regularized matrix factorization models are proved to outperform vanilla matrix factorization models in various applications, including matrix completion and multi-label learning. Invoking graph information in matrix factorizations requires the model to balance information from data observation, graph regularizations and model assumptions. However, the data-adaptability of regularization is often overlooked in existing graph-regularized matrix factorization works. More specifically, matrix factorization gives rise to the concept of theme as it groups data into multiple components in an unsupervised learning manner. Existing graph-regularized works ignore the theme-wise data partition and apply the same regularization parameters to all themes. While generalizing to theme-wise graph regularization may seem straightforward in the first sight, it leads to a dramatically increased number of parameters to be tuned in traditional optimization-based methods. Besides the theme-wise adaptability, data-adaptive regularizations become necessary if the graph information varies in different parts of the data. In this case, the graph regularizations should be embedded locally, whose mechanism also needs further investigation. To allow theme-wise adaptive regularization and meanwhile overcome the computational costly regularization parameter tunning, this thesis investigates the graph-regularized matrix factorization problem from the probabilistic model perspective. Benefit from the newly designed prior distribution and Bayesian inference techniques, the graph regularization can be adaptively embedded theme-wisely, and at the same time, the regularization parameters are automatically learned. Above that, model assumptions such as low-rankness can be simultaneously incorporated into the proposed model to tackle the missing data challenge. A nontrivial conditional conjugacy is exploited such that an efficient probabilistic graph-regularized matrix factorization algorithm can be derived under variational inference framework. Extensive numerical results on various matrix completion and multi-label learning applications show the superior performance of the proposed tuning-free method compared to existing state-of-the-art models. In addition to graph regularized factorization of a single matrix, this thesis also investigates factorization of multiple but coupled matrices in the context of functional magnetic resonance imaging. In this case, graph information represents the structural brain connections that can be leveraged to learn the functional interactions between different regions of the brain. This structurally-informed factorization has long been a challenge as the structural brain connections have inherent sparsity and vary for each subject. To overcome this challenge, a novel sparse non-negative matrix factorization model with local graph regularizations is proposed. Results show that the estimated latent dynamic functional brain networks are closer to a priori knowledge of brain organization and acquire better interpretability due to the elimination of redundant bases for the latent functional brain networks.
Degree	Doctor of Philosophy
Subject	Matrices
Dept/Program	Electrical and Electronic Engineering
Persistent Identifier	http://hdl.handle.net/10722/335927

DC Field	Value	Language
dc.contributor.advisor	Wu, YC	-
dc.contributor.author	Chen, Yangge	-
dc.contributor.author	陳阳戈	-
dc.date.accessioned	2023-12-29T04:04:54Z	-
dc.date.available	2023-12-29T04:04:54Z	-
dc.date.issued	2023	-
dc.identifier.citation	Chen, Y. [陳阳戈]. (2023). Data-adaptive graph-regularized matrix factorizations. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.	-
dc.identifier.uri	http://hdl.handle.net/10722/335927	-
dc.description.abstract	Matrix factorizations have been utilized in many fields, such as recommendation system, genetic research, and image processing. They are used for exploring hidden pattern from observed data, recovering missing data and predicting unknown scenarios. Due to the rapidly growing amount of data and increasing demands on data analysis capability, side information is widely employed on top of the basic matrix factorization models. In particular, graph information, which provides data correlations or pairwise similarities, is one commonly-used side information. Due to the additional perspective provided by the graph information, graph-regularized matrix factorization models are proved to outperform vanilla matrix factorization models in various applications, including matrix completion and multi-label learning. Invoking graph information in matrix factorizations requires the model to balance information from data observation, graph regularizations and model assumptions. However, the data-adaptability of regularization is often overlooked in existing graph-regularized matrix factorization works. More specifically, matrix factorization gives rise to the concept of theme as it groups data into multiple components in an unsupervised learning manner. Existing graph-regularized works ignore the theme-wise data partition and apply the same regularization parameters to all themes. While generalizing to theme-wise graph regularization may seem straightforward in the first sight, it leads to a dramatically increased number of parameters to be tuned in traditional optimization-based methods. Besides the theme-wise adaptability, data-adaptive regularizations become necessary if the graph information varies in different parts of the data. In this case, the graph regularizations should be embedded locally, whose mechanism also needs further investigation. To allow theme-wise adaptive regularization and meanwhile overcome the computational costly regularization parameter tunning, this thesis investigates the graph-regularized matrix factorization problem from the probabilistic model perspective. Benefit from the newly designed prior distribution and Bayesian inference techniques, the graph regularization can be adaptively embedded theme-wisely, and at the same time, the regularization parameters are automatically learned. Above that, model assumptions such as low-rankness can be simultaneously incorporated into the proposed model to tackle the missing data challenge. A nontrivial conditional conjugacy is exploited such that an efficient probabilistic graph-regularized matrix factorization algorithm can be derived under variational inference framework. Extensive numerical results on various matrix completion and multi-label learning applications show the superior performance of the proposed tuning-free method compared to existing state-of-the-art models. In addition to graph regularized factorization of a single matrix, this thesis also investigates factorization of multiple but coupled matrices in the context of functional magnetic resonance imaging. In this case, graph information represents the structural brain connections that can be leveraged to learn the functional interactions between different regions of the brain. This structurally-informed factorization has long been a challenge as the structural brain connections have inherent sparsity and vary for each subject. To overcome this challenge, a novel sparse non-negative matrix factorization model with local graph regularizations is proposed. Results show that the estimated latent dynamic functional brain networks are closer to a priori knowledge of brain organization and acquire better interpretability due to the elimination of redundant bases for the latent functional brain networks.	-
dc.language	eng	-
dc.publisher	The University of Hong Kong (Pokfulam, Hong Kong)	-
dc.relation.ispartof	HKU Theses Online (HKUTO)	-
dc.rights	The author retains all proprietary rights, (such as patent rights) and the right to use in future works.	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.subject.lcsh	Matrices	-
dc.title	Data-adaptive graph-regularized matrix factorizations	-
dc.type	PG_Thesis	-
dc.description.thesisname	Doctor of Philosophy	-
dc.description.thesislevel	Doctoral	-
dc.description.thesisdiscipline	Electrical and Electronic Engineering	-
dc.description.nature	published_or_final_version	-
dc.date.hkucongregation	2023	-
dc.identifier.mmsid	991044634606903414	-

File Download

Supplementary

postgraduate thesis: Data-adaptive graph-regularized matrix factorizations

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats