A Primer on the Use of Equivalence Testing for Evaluating Measurement Agreement

Dixon, Philip M.; Saint-Maurice, Pedro F.; Kim, Youngwon; Hibbing, Paul; Bai, Yang; Welk, Gregory J.

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1249/MSS.0000000000001481
Scopus: eid_2-s2.0-85044027103
PMID: 29135817
WOS: WOS:000427796400025
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
- PubMed Central: 0
Appears in Collections:
- Public Health: Journal/Magazine Articles

Article: A Primer on the Use of Equivalence Testing for Evaluating Measurement Agreement

Title	A Primer on the Use of Equivalence Testing for Evaluating Measurement Agreement
Authors	Dixon, Philip M.Saint-Maurice, Pedro F.Kim, Youngwon Hibbing, Paul Bai, Yang Welk, Gregory J.
Keywords	VALIDATION CALIBRATION CONVERGENT VALIDITY CRITERION VALIDITY
Issue Date	2018
Citation	Medicine and Science in Sports and Exercise, 2018, v. 50, n. 4, p. 837-845 How to Cite? DOI: http://dx.doi.org/10.1249/MSS.0000000000001481
Abstract	© 2018 Lippincott Williams and Wilkins. All rights reserved. Purpose Statistical equivalence testing is more appropriate than conventional tests of difference to assess the validity of physical activity (PA) measures. This article presents the underlying principles of equivalence testing and gives three examples from PA and fitness assessment research. Methods The three examples illustrate different uses of equivalence tests. Example 1 uses PA data to evaluate an activity monitor's equivalence to a known criterion. Example 2 illustrates the equivalence of two field-based measures of physical fitness with no known reference method. Example 3 uses regression to evaluate an activity monitor's equivalence across a suite of 23 activities. Results The examples illustrate the appropriate reporting and interpretation of results from equivalence tests. In the first example, the mean criterion measure is significantly within ±15% of the mean PA monitor. The mean difference is 0.18 METs and the 90% confidence interval of -0.15 to 0.52 is inside the equivalence region of -0.65 to 0.65. In the second example, we chose to define equivalence for these two measures as a ratio of mean values between 0.98 and 1.02. The estimated ratio of mean VO2 values is 0.99, which is significantly (P = 0.007) inside the equivalence region. In the third example, the PA monitor is not equivalent to the criterion across the suite of activities. The estimated regression intercept and slope are -1.23 and 1.06. Neither confidence interval is within the suggested regression equivalence regions. Conclusions When the study goal is to show similarity between methods, equivalence testing is more appropriate than traditional statistical tests of differences (e.g., ANOVA and t-tests).
Persistent Identifier	http://hdl.handle.net/10722/266826
ISSN	0195-9131 2023 Impact Factor: 4.1 2023 SCImago Journal Rankings: 1.470
ISI Accession Number ID	WOS:000427796400025

DC Field	Value	Language
dc.contributor.author	Dixon, Philip M.	-
dc.contributor.author	Saint-Maurice, Pedro F.	-
dc.contributor.author	Kim, Youngwon	-
dc.contributor.author	Hibbing, Paul	-
dc.contributor.author	Bai, Yang	-
dc.contributor.author	Welk, Gregory J.	-
dc.date.accessioned	2019-01-31T07:19:43Z	-
dc.date.available	2019-01-31T07:19:43Z	-
dc.date.issued	2018	-
dc.identifier.citation	Medicine and Science in Sports and Exercise, 2018, v. 50, n. 4, p. 837-845	-
dc.identifier.issn	0195-9131	-
dc.identifier.uri	http://hdl.handle.net/10722/266826	-
dc.description.abstract	© 2018 Lippincott Williams and Wilkins. All rights reserved. Purpose Statistical equivalence testing is more appropriate than conventional tests of difference to assess the validity of physical activity (PA) measures. This article presents the underlying principles of equivalence testing and gives three examples from PA and fitness assessment research. Methods The three examples illustrate different uses of equivalence tests. Example 1 uses PA data to evaluate an activity monitor's equivalence to a known criterion. Example 2 illustrates the equivalence of two field-based measures of physical fitness with no known reference method. Example 3 uses regression to evaluate an activity monitor's equivalence across a suite of 23 activities. Results The examples illustrate the appropriate reporting and interpretation of results from equivalence tests. In the first example, the mean criterion measure is significantly within ±15% of the mean PA monitor. The mean difference is 0.18 METs and the 90% confidence interval of -0.15 to 0.52 is inside the equivalence region of -0.65 to 0.65. In the second example, we chose to define equivalence for these two measures as a ratio of mean values between 0.98 and 1.02. The estimated ratio of mean VO2 values is 0.99, which is significantly (P = 0.007) inside the equivalence region. In the third example, the PA monitor is not equivalent to the criterion across the suite of activities. The estimated regression intercept and slope are -1.23 and 1.06. Neither confidence interval is within the suggested regression equivalence regions. Conclusions When the study goal is to show similarity between methods, equivalence testing is more appropriate than traditional statistical tests of differences (e.g., ANOVA and t-tests).	-
dc.language	eng	-
dc.relation.ispartof	Medicine and Science in Sports and Exercise	-
dc.subject	VALIDATION	-
dc.subject	CALIBRATION	-
dc.subject	CONVERGENT VALIDITY	-
dc.subject	CRITERION VALIDITY	-
dc.title	A Primer on the Use of Equivalence Testing for Evaluating Measurement Agreement	-
dc.type	Article	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1249/MSS.0000000000001481	-
dc.identifier.pmid	29135817	-
dc.identifier.scopus	eid_2-s2.0-85044027103	-
dc.identifier.volume	50	-
dc.identifier.issue	4	-
dc.identifier.spage	837	-
dc.identifier.epage	845	-
dc.identifier.eissn	1530-0315	-
dc.identifier.isi	WOS:000427796400025	-
dc.identifier.issnl	0195-9131	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: A Primer on the Use of Equivalence Testing for Evaluating Measurement Agreement

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats