Sensitivity and specificity

Calculations

Two-by-two table for a diagnostic test
		Disease
		Present	Absent
Test result	Positive	Cell A	Cell B	Total with a positive test
	Negative	Cell C	Cell D	Total with a negative test
		Total with disease	Total without disease

Sensitivity and specificity

{\mbox{Sensitivity of a test}}=\left({\frac {\mbox{Total with a positive test}}{{\mbox{Total }}without{\mbox{ disease}}}}\right)=\left({\frac {\mbox{Cell A}}{{\mbox{Cell A}}+{\mbox{Cell C}}}}\right)

{\mbox{Specificity of a test}}=\left({\frac {\mbox{Total with a negative test}}{{\mbox{Total }}without{\mbox{ disease}}}}\right)=\left({\frac {\mbox{Cell D}}{{\mbox{Cell B}}+{\mbox{Cell D}}}}\right)

Predictive value of tests

The predictive values of diagnostic tests are defined as "in screening and diagnostic tests, the probability that a person with a positive test is a true positive (i.e., has the disease), is referred to as the predictive value of a positive test; whereas, the predictive value of a negative test is the probability that the person with a negative test does not have the disease. Predictive value is related to the sensitivity and specificity of the test."^[2]

{\mbox{Positive predictive value}}=\left({\frac {{\mbox{Total }}with{\mbox{ disease and a positive test}}}{\mbox{Total with a positive test}}}\right)=\left({\frac {\mbox{Cell A}}{{\mbox{Cell A}}+{\mbox{Cell B}}}}\right)

{\mbox{Negative predictive value}}=\left({\frac {{\mbox{Total }}without{\mbox{ disease and a negative test}}}{\mbox{Total with a negative test}}}\right)=\left({\frac {\mbox{Cell D}}{{\mbox{Cell C}}+{\mbox{Cell D}}}}\right)

Summary statistics for diagnostic ability

While simply reporting the accuracy of a test seems intuitive, the accuracy is heavily influenced by the prevalence of disease.^[3] For example, if the disease occurred with a a frequency of one in one thousand, then simply guessing that all patients do not have disease will yield an accuracy of over 99%.

With the arrival of many biomarkers that may be expensive diagnostic tests, much research has addressed how to summarize the incremental value of a new expensive test to existing diagnostic methods.^[4]

Area under the ROC curve

For more information, see: Receiver operating characteristic curve.

The area under the receiver operating characteristic curve (ROC curve), or c-index has been proposed. The c-index varies from 0 to 1 and a result of 0.5 indicates that the diagnostic test does not add to guessing.^[5] Variations have been proposed.^[6]^[7]

Bayes Information Criterion

The Bayes Information Criterion has been proposed by Schwarz in 1978.^[8]

Sum of sensitivity and specificity

This easy metric is called S+T.^[9] It varies from 0 to 2 and a result of 1 indicates that the diagnostic test does not add to guessing.

Predictiveness curve

A graph of the predictiveness curve has been proposed.^[10]

Proportionate reduction in uncertainty score

The proportionate reduction in uncertainty score (PRU) has been proposed.^[11]

Integrated sensitivity and specificity

This measure has been proposed as an alternative to the area of the the receiver operating characteristic curve.^[12]

Reclassification tables

This measure has been proposed as an alternative to the area of the the receiver operating characteristic curve.^[12] This method allows calculating a 'reclassification index' or 'reclassification rate', or 'net reclassification improvement' (NRI)^[12]

The clinical net reclassification improvement (CNRI) is a variation that is the NRI only for the subjects at intermediate risk of disease.^[13]

Sequential scoring

Sequential scoring has been proposed in order to isolate the effect of a new, expensive diagnostic test.^[14]

Threats to validity of calculations

Various biases incurred during the study and analysis of a diagnostic tests can affect the validity of the calculations. An example is spectrum bias.

Poorly designed studies may overestimate the accuracy of a diagnostic test.^[15]

References

↑ National Library of Mediicne. Sensitivity and specificity. Retrieved on 2007-12-09.
↑ National Library of Mediicne. Predictive value of tests. Retrieved on 2007-12-09.
↑ Harrell FE, Califf RM, Pryor DB, Lee KL, Rosati RA (May 1982). "Evaluating the yield of medical tests". JAMA 247 (18): 2543–6. PMID 7069920. ^[e]
↑ Cornell J, Mulrow CD, Localio AR (December 2008). "Diagnostic test accuracy and clinical decision making". Ann. Intern. Med. 149 (12): 904–6. PMID 19075211. ^[e]
↑ Hanley JA, McNeil BJ (April 1982). "The meaning and use of the area under a receiver operating characteristic (ROC) curve". Radiology 143 (1): 29–36. PMID 7063747. ^[e]
↑ Walter SD (July 2005). "The partial area under the summary ROC curve". Stat Med 24 (13): 2025–40. DOI:10.1002/sim.2103. PMID 15900606. Research Blogging.
↑ Bangdiwala SI, Haedo AS, Natal ML, Villaveces A (September 2008). "The agreement chart as an alternative to the receiver-operating characteristic curve for diagnostic tests". J Clin Epidemiol 61 (9): 866–74. DOI:10.1016/j.jclinepi.2008.04.002. PMID 18687288. Research Blogging.
↑ Schwarz, G. (1978). Estimating the dimension of a model. Annals of Statistics 6, 461–464. DOI:10.1214/aos/1176344136 Google Scholar
↑ Connell FA, Koepsell TD (May 1985). "Measures of gain in certainty from a diagnostic test". Am. J. Epidemiol. 121 (5): 744–53. PMID 4014166. ^[e]
↑ Pepe, Margaret S.; Ziding Feng, Ying Huang, Gary Longton, Ross Prentice, Ian M. Thompson, Yingye Zheng (2008-02-01). "Integrating the Predictiveness of a Marker with Its Performance as a Classifier". Am. J. Epidemiol. 167 (3): 362-368. DOI:10.1093/aje/kwm305. PMID 17982157. Retrieved on 2008-12-17. Research Blogging.
↑ Coulthard MG (May 2007). "Quantifying how tests reduce diagnostic uncertainty". Arch. Dis. Child. 92 (5): 404–8. DOI:10.1136/adc.2006.111633. PMID 17158858. Research Blogging.
↑ ^12.0 ^12.1 ^12.2 Pencina MJ, D'Agostino RB, D'Agostino RB, Vasan RS (January 2008). "Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond". Stat Med 27 (2): 157–72; discussion 207–12. DOI:10.1002/sim.2929. PMID 17569110. Research Blogging.
↑ Cook NR (January 2008). "Comments on 'Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond' by M. J. Pencina et al., Statistics in Medicine (DOI: 10.1002/sim.2929)". Stat Med 27 (2): 191–5. DOI:10.1002/sim.2987. PMID 17671959. Research Blogging.
↑ Greenland S (January 2008). "The need for reorientation toward cost-effective prediction: comments on 'Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond' by M. J. Pencina et al., Statistics in Medicine (DOI: 10.1002/sim.2929)". Stat Med 27 (2): 199–206. DOI:10.1002/sim.2995. PMID 17729377. Research Blogging.
↑ Lijmer JG, Mol BW, Heisterkamp S, et al (September 1999). "Empirical evidence of design-related bias in studies of diagnostic tests". JAMA 282 (11): 1061–6. PMID 10493205. ^[e]

[MeSH_SnSp-1] National Library of Mediicne. Sensitivity and specificity. Retrieved on 2007-12-09.

[MeSH_PV-2] National Library of Mediicne. Predictive value of tests. Retrieved on 2007-12-09.

[pmid7069920-3] Harrell FE, Califf RM, Pryor DB, Lee KL, Rosati RA (May 1982). "Evaluating the yield of medical tests". JAMA 247 (18): 2543–6. PMID 7069920. ^[e]

[pmid19075211-4] Cornell J, Mulrow CD, Localio AR (December 2008). "Diagnostic test accuracy and clinical decision making". Ann. Intern. Med. 149 (12): 904–6. PMID 19075211. ^[e]

[pmid7063747-5] Hanley JA, McNeil BJ (April 1982). "The meaning and use of the area under a receiver operating characteristic (ROC) curve". Radiology 143 (1): 29–36. PMID 7063747. ^[e]

[pmid15900606-6] Walter SD (July 2005). "The partial area under the summary ROC curve". Stat Med 24 (13): 2025–40. DOI:10.1002/sim.2103. PMID 15900606. Research Blogging.

[pmid18687288-7] Bangdiwala SI, Haedo AS, Natal ML, Villaveces A (September 2008). "The agreement chart as an alternative to the receiver-operating characteristic curve for diagnostic tests". J Clin Epidemiol 61 (9): 866–74. DOI:10.1016/j.jclinepi.2008.04.002. PMID 18687288. Research Blogging.

[8] Schwarz, G. (1978). Estimating the dimension of a model. Annals of Statistics 6, 461–464. DOI:10.1214/aos/1176344136 Google Scholar

[pmid4014166-9] Connell FA, Koepsell TD (May 1985). "Measures of gain in certainty from a diagnostic test". Am. J. Epidemiol. 121 (5): 744–53. PMID 4014166. ^[e]

[10] Pepe, Margaret S.; Ziding Feng, Ying Huang, Gary Longton, Ross Prentice, Ian M. Thompson, Yingye Zheng (2008-02-01). "Integrating the Predictiveness of a Marker with Its Performance as a Classifier". Am. J. Epidemiol. 167 (3): 362-368. DOI:10.1093/aje/kwm305. PMID 17982157. Retrieved on 2008-12-17. Research Blogging.

[pmid17158858-11] Coulthard MG (May 2007). "Quantifying how tests reduce diagnostic uncertainty". Arch. Dis. Child. 92 (5): 404–8. DOI:10.1136/adc.2006.111633. PMID 17158858. Research Blogging.

[pmid17569110-12] 12.0 ^12.1 ^12.2 Pencina MJ, D'Agostino RB, D'Agostino RB, Vasan RS (January 2008). "Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond". Stat Med 27 (2): 157–72; discussion 207–12. DOI:10.1002/sim.2929. PMID 17569110. Research Blogging.

[pmid17671959-13] Cook NR (January 2008). "Comments on 'Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond' by M. J. Pencina et al., Statistics in Medicine (DOI: 10.1002/sim.2929)". Stat Med 27 (2): 191–5. DOI:10.1002/sim.2987. PMID 17671959. Research Blogging.

[pmid17729377-14] Greenland S (January 2008). "The need for reorientation toward cost-effective prediction: comments on 'Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond' by M. J. Pencina et al., Statistics in Medicine (DOI: 10.1002/sim.2929)". Stat Med 27 (2): 199–206. DOI:10.1002/sim.2995. PMID 17729377. Research Blogging.

[pmid10493205-15] Lijmer JG, Mol BW, Heisterkamp S, et al (September 1999). "Empirical evidence of design-related bias in studies of diagnostic tests". JAMA 282 (11): 1061–6. PMID 10493205. ^[e]

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

Sensitivity and specificity

Contents

Calculations