What Is It for a Machine Learning Model to Have a Capability?

Harding, Jacqueline; Sharadin, Nathaniel

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1086/732153
Find via

Supplementary

Citations:
Appears in Collections:
- Philosophy: Journal/Magazine Articles

Article: What Is It for a Machine Learning Model to Have a Capability?

Title	What Is It for a Machine Learning Model to Have a Capability?
Authors	Harding, Jacqueline Sharadin, Nathaniel
Issue Date	9-Jul-2024
Publisher	The University of Chicago Press
Citation	The British Journal for the Philosophy of Science, 2024 How to Cite? DOI: http://dx.doi.org/10.1086/732153
Abstract	What can contemporary machine learning (ML) models do? Given the proliferation of ML models in society, answering this question matters to a variety of stakeholders, both public and private. The evaluation of models' capabilities is rapidly emerging as a key subfield of modern ML, buoyed by regulatory attention and government grants. Despite this, the notion of an ML model possessing a capability has not been interrogated: what are we saying when we say that a model is able to do something? And what sorts of evidence bear upon this question? In this paper, we aim to answer these questions, using the capabilities of large language models (LLMs) as a running example. Drawing on the large philosophical literature on abilities, we develop an account of ML models' capabilities which can be usefully applied to the nascent science of model evaluation. Our core proposal is a conditional analysis of model abilities (CAMA): crudely, a machine learning model has a capability to X just when it would reliably succeed at doing X if it 'tried'. The main contribution of the paper is making this proposal precise in the context of ML, resulting in an operationalisation of CAMA applicable to LLMs. We then put CAMA to work, showing that it can help make sense of various features of ML model evaluation practice, as well as suggest procedures for performing fair inter-model comparisons.
Persistent Identifier	http://hdl.handle.net/10722/356818
ISSN	0007-0882 2023 Impact Factor: 3.2 2023 SCImago Journal Rankings: 1.446

DC Field	Value	Language
dc.contributor.author	Harding, Jacqueline	-
dc.contributor.author	Sharadin, Nathaniel	-
dc.date.accessioned	2025-06-19T00:35:14Z	-
dc.date.available	2025-06-19T00:35:14Z	-
dc.date.issued	2024-07-09	-
dc.identifier.citation	The British Journal for the Philosophy of Science, 2024	-
dc.identifier.issn	0007-0882	-
dc.identifier.uri	http://hdl.handle.net/10722/356818	-
dc.description.abstract	<p>What can contemporary machine learning (ML) models do? Given the proliferation of ML models in society, answering this question matters to a variety of stakeholders, both public and private. The evaluation of models' capabilities is rapidly emerging as a key subfield of modern ML, buoyed by regulatory attention and government grants. Despite this, the notion of an ML model possessing a capability has not been interrogated: what are we saying when we say that a model is able to do something? And what sorts of evidence bear upon this question? In this paper, we aim to answer these questions, using the capabilities of large language models (LLMs) as a running example. Drawing on the large philosophical literature on abilities, we develop an account of ML models' capabilities which can be usefully applied to the nascent science of model evaluation. Our core proposal is a conditional analysis of model abilities (CAMA): crudely, a machine learning model has a capability to X just when it would reliably succeed at doing X if it 'tried'. The main contribution of the paper is making this proposal precise in the context of ML, resulting in an operationalisation of CAMA applicable to LLMs. We then put CAMA to work, showing that it can help make sense of various features of ML model evaluation practice, as well as suggest procedures for performing fair inter-model comparisons.<br></p>	-
dc.language	eng	-
dc.publisher	The University of Chicago Press	-
dc.relation.ispartof	The British Journal for the Philosophy of Science	-
dc.title	What Is It for a Machine Learning Model to Have a Capability?	-
dc.type	Article	-
dc.identifier.doi	10.1086/732153	-
dc.identifier.eissn	1464-3537	-
dc.identifier.issnl	0007-0882	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: What Is It for a Machine Learning Model to Have a Capability?

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats