Generalization Analysis for Contrastive Representation Learning

Lei, Yunwen; Yang, Tianbao; Ying, Yiming; Zhou, Ding-Xuan

File Download

content.pdf

Links for fulltext

(May Require Subscription)

Publisher Website: 10.48550/arXiv.2302.12383

Supplementary

Citations:
Appears in Collections:
- Mathematics: Conference papers

Conference Paper: Generalization Analysis for Contrastive Representation Learning

Title	Generalization Analysis for Contrastive Representation Learning
Authors	Lei, Yunwen Yang, Tianbao Ying, Yiming Zhou, Ding-Xuan
Issue Date	23-Jul-2023
Abstract	Recently, contrastive learning has found impressive success in advancing the state of the art in solving various machine learning tasks. However, the existing generalization analysis is very limited or even not meaningful. In particular, the existing generalization error bounds depend linearly on the number $k$ of negative examples while it was widely shown in practice that choosing a large $k$ is necessary to guarantee good generalization of contrastive learning in downstream tasks. In this paper, we establish novel generalization bounds for contrastive learning which do not depend on $k$, up to logarithmic terms. Our analysis uses structural results on empirical covering numbers and Rademacher complexities to exploit the Lipschitz continuity of loss functions. For self-bounding Lipschitz loss functions, we further improve our results by developing optimistic bounds which imply fast rates in a low noise condition. We apply our results to learning with both linear representation and nonlinear representation by deep neural networks, for both of which we derive Rademacher complexity bounds to get improved generalization bounds.
Persistent Identifier	http://hdl.handle.net/10722/333738

DC Field	Value	Language
dc.contributor.author	Lei, Yunwen	-
dc.contributor.author	Yang, Tianbao	-
dc.contributor.author	Ying, Yiming	-
dc.contributor.author	Zhou, Ding-Xuan	-
dc.date.accessioned	2023-10-06T08:38:41Z	-
dc.date.available	2023-10-06T08:38:41Z	-
dc.date.issued	2023-07-23	-
dc.identifier.uri	http://hdl.handle.net/10722/333738	-
dc.description.abstract	<p>Recently, contrastive learning has found impressive success in advancing the state of the art in solving various machine learning tasks. However, the existing generalization analysis is very limited or even not meaningful. In particular, the existing generalization error bounds depend linearly on the number $k$ of negative examples while it was widely shown in practice that choosing a large $k$ is necessary to guarantee good generalization of contrastive learning in downstream tasks. In this paper, we establish novel generalization bounds for contrastive learning which do not depend on $k$, up to logarithmic terms. Our analysis uses structural results on empirical covering numbers and Rademacher complexities to exploit the Lipschitz continuity of loss functions. For self-bounding Lipschitz loss functions, we further improve our results by developing optimistic bounds which imply fast rates in a low noise condition. We apply our results to learning with both linear representation and nonlinear representation by deep neural networks, for both of which we derive Rademacher complexity bounds to get improved generalization bounds.</p>	-
dc.language	eng	-
dc.relation.ispartof	International Conference on Machine Learning (23/07/2023-29/07/2023, Honolulu, Hawaii)	-
dc.title	Generalization Analysis for Contrastive Representation Learning	-
dc.type	Conference_Paper	-
dc.description.nature	published_or_final_version	-
dc.identifier.doi	10.48550/arXiv.2302.12383	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Generalization Analysis for Contrastive Representation Learning

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats