On the Relationship between the Reliability and Accuracy of Bio-Behavioral Diagnoses: Simple Math to the Rescue

Authors

  • Dom Cicchetti Department of Biometry, Yale University School of Medicine, New Haven, CT 06520, USA

DOI:

https://doi.org/10.6000/1929-6029.2015.04.02.2

Keywords:

Binary Diagnoses, Diagnostic Reliability, Diagnostic Accuracy.

Abstract

An equivalence between the J statistic (Jack Youden, 1950) and the Kappa statistic (K), Cohen (1960), was discovered by Helena Kraemer (1982). J is defined as: [Sensitivity (Se) + Specificity (Sp)] - 1. The author (2011) added the remaining two validity components to the J Index, namely, Predicted Positive Accuracy (PPA) and Predicted Negative Accuracy (PNA). The resulting D Index or D = [(Se + Sp) + (PPA + PNA) - 1] / 2. The purpose of this research is to compare J and D as estimates of K, using both actual and simulated data sets. The actual data consisted of ratings of clinical depression and self-reports of gonorrhea. The simulated data sets represented binary diagnoses when the percentages of Negative and Positive cases were: (Identical; Slightly varying; Mildly varying; Moderately varying; or Markedly varying diagnostic patterns, For both the diagnosis of clinical depression, and the self-reports of gonorrhea, D produced closer approximations to Kappa. For the simulated data, under both identical and slightly different patterns of assigning Negative and Positive binary diagnoses, K, D and J produced identical results. While J produced acceptably close values to K under the condition of Mild discrepancies in the proportions of Negative and Positive cases, D continued to more closely approximate K. While D more closely estimated K under Markedly varying diagnostic patterns, D produced values under this extreme condition that were closer than would have been predicted. The significance of these findings for future research is discussed.

Author Biography

  • Dom Cicchetti, Department of Biometry, Yale University School of Medicine, New Haven, CT 06520, USA

    Biometry

References

Kraemer HC. Estimating false alarms and missed events from interobserver agreement: Comment on Kaye. Psychol Bull 1982; 92: 749-754. http://dx.doi.org/10.1037/0033-2909.92.3.749

Youden WJ. J Index for rating diagnostic tests. Cancer 1950; 3: 32-35. http://dx.doi.org/10.1002/1097-0142(1950)3:1<32::AID-CNCR2820030106>3.0.CO;2-3

Cicchetti DV. On the reliability and accuracy of the Evaluative Method for identifying evidence-based practices in Autism. In: Reichow B, Doehring P, Cicchetti DV, Volkmar F, Eds. Evidence-based practices and treatments for children with Autism. New York, NY: Springer, 2011; pp. 41-51. http://dx.doi.org/10.1007/978-1-4419-6975-0_3

Schimmel MS, Kaplan M, Soll Rf. Blood transfusion in the neonate- Where are we today? In: Peterson BR, Ed. New developments in blood transfusion research. New York, NY: Nova Science 2006; pp. 1-15.

Feinstein AR. Clinimetrics. New Haven CT: Yale University Press, 1987.

Fleiss JL, Levin B, Cho Paik M. Statistical methods for rates and proportions. New York, NY: Wiley, 2003. http://dx.doi.org/10.1002/0471445428

Kraemer HC, Kazdin AE, Offord DR, Kessler RC, Jensen PS, Kupfer DJ. Coming to terms with the terms of risk. Arch Gen Psychiat 1982; 54: 337-343. http://dx.doi.org/10.1001/archpsyc.1997.01830160065009

Nelson L, Cicchetti DV. Validity of the MMPI Depression Scale for outpatients. Psychol Assess 1991; 3: 55-59. http://dx.doi.org/10.1037/1040-3590.3.1.55

Niccolai LM, Kershaw TS, Lewis JB, Cicchetti DV, Ethier KA, Ickovics J. Data collection for sexually transmitted disease diagnoses: A comparison of self-reports, medical record reviews, and state health department reports. Annals Epidemiol 2005; 15: 236-242. http://dx.doi.org/10.1016/j.annepidem.2004.07.093

Cohen J. A coefficient of agreementfor nominal scales. Educ Psychol Meas 1960; 23: 37-46. http://dx.doi.org/10.1177/001316446002000104

Fleiss JL, Cohen J, Everitt BS. Large sample standard errors of kappa and weighted kappa. Psychol Bull 1969; 72: 323-327. http://dx.doi.org/10.1037/h0028106

Cicchetti DV, Fleiss JL. Comparison of the null distributions of kappa and the C ordinal statistic. Applied Psychol Meas 1977; 1: 195-201. http://dx.doi.org/10.1177/014662167700100206

Cicchetti DV. Testing the normal approximation and minimal sample size requirements of weighted kappa when the number of categories is large. Applied Psychol Meas 1981; 5: 101-104. http://dx.doi.org/10.1177/014662168100500114

Cicchetti DV, Volkmar F, Klin A, Showalter D. Diagnosing Autism using ICD-10 criteria: A comparison of neural networks and standard multivariate procedures. Child Neuropsychol 1995; 1: 26-37. http://dx.doi.org/10.1080/09297049508401340

Cicchetti DV, Sparrow SS. Developing criteria for establishing interrater reliability of specific items: Applications to assessments of adaptive behavior. Amer J Mental Deficiency 1981; 86: 127-137.

Landis JR, Koch GG. The measure of observer agreement for categorical data. Biometrics 1977; 33: 159-174. http://dx.doi.org/10.2307/2529310

Cicchetti DV, Fontana A, Showalter D. Establishing reliability when multiple examiners evaluate a single case- Part II: Applications to symptoms of Post-Traumatic Stress Disorder (PTSD). Internat J Stat Med Research 2014; 3.

Downloads

Published

2015-05-21

Issue

Section

General Articles

How to Cite

On the Relationship between the Reliability and Accuracy of Bio-Behavioral Diagnoses: Simple Math to the Rescue. (2015). International Journal of Statistics in Medical Research, 4(2), 172-179. https://doi.org/10.6000/1929-6029.2015.04.02.2

Similar Articles

11-20 of 103

You may also start an advanced similarity search for this article.

Most read articles by the same author(s)