Mutual information and the encoding of contingency tables

Jerdee, Maximilian; Kirkley, Alec; Newman, M. E.J.

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1103/PhysRevE.110.064306
Scopus: eid_2-s2.0-85211069808
Find via

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- Urban Planning & Design: Journal/Magazine Articles
- HKU Musketeers Foundation Institute of Data Science: Journal/Magazine Articles

Article: Mutual information and the encoding of contingency tables

Title	Mutual information and the encoding of contingency tables
Authors	Jerdee, Maximilian Kirkley, Alec Newman, M. E.J.
Issue Date	5-Dec-2024
Publisher	American Physical Society
Citation	Physical Review E, 2024, v. 110, n. 6 How to Cite? DOI: http://dx.doi.org/10.1103/PhysRevE.110.064306
Abstract	Mutual information is commonly used as a measure of similarity between competing labelings of a given set of objects, for example to quantify performance in classification and community detection tasks. As argued recently, however, the mutual information as conventionally defined can return biased results because it neglects the information cost of the so-called contingency table, a crucial component of the similarity calculation. In principle the bias can be rectified by subtracting the appropriate information cost, leading to the modified measure known as the reduced mutual information, but in practice one can only ever compute an upper bound on this information cost, and the value of the reduced mutual information depends crucially on how good a bound is established. In this paper we describe an improved method for encoding contingency tables that gives a substantially better bound in typical use cases and approaches the ideal value in the common case where the labelings are closely similar, as we demonstrate with extensive numerical results.
Persistent Identifier	http://hdl.handle.net/10722/366329
ISSN	2470-0045 2023 Impact Factor: 2.2 2023 SCImago Journal Rankings: 0.805

DC Field	Value	Language
dc.contributor.author	Jerdee, Maximilian	-
dc.contributor.author	Kirkley, Alec	-
dc.contributor.author	Newman, M. E.J.	-
dc.date.accessioned	2025-11-25T04:18:47Z	-
dc.date.available	2025-11-25T04:18:47Z	-
dc.date.issued	2024-12-05	-
dc.identifier.citation	Physical Review E, 2024, v. 110, n. 6	-
dc.identifier.issn	2470-0045	-
dc.identifier.uri	http://hdl.handle.net/10722/366329	-
dc.description.abstract	Mutual information is commonly used as a measure of similarity between competing labelings of a given set of objects, for example to quantify performance in classification and community detection tasks. As argued recently, however, the mutual information as conventionally defined can return biased results because it neglects the information cost of the so-called contingency table, a crucial component of the similarity calculation. In principle the bias can be rectified by subtracting the appropriate information cost, leading to the modified measure known as the reduced mutual information, but in practice one can only ever compute an upper bound on this information cost, and the value of the reduced mutual information depends crucially on how good a bound is established. In this paper we describe an improved method for encoding contingency tables that gives a substantially better bound in typical use cases and approaches the ideal value in the common case where the labelings are closely similar, as we demonstrate with extensive numerical results.	-
dc.language	eng	-
dc.publisher	American Physical Society	-
dc.relation.ispartof	Physical Review E	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.title	Mutual information and the encoding of contingency tables	-
dc.type	Article	-
dc.identifier.doi	10.1103/PhysRevE.110.064306	-
dc.identifier.scopus	eid_2-s2.0-85211069808	-
dc.identifier.volume	110	-
dc.identifier.issue	6	-
dc.identifier.eissn	2470-0053	-
dc.identifier.issnl	2470-0045	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Mutual information and the encoding of contingency tables

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats