Ewha Medical Journal

"Agreement"

Review Article

[English]

Statistical Methods: Reliability Assessment and Method Comparison: Kyoung Ae Kong; Ewha Med J 2017;40(1):9-16. Published online January 31, 2017; DOI: https://doi.org/10.12771/emj.2017.40.1.9

Abstract
PDF

The reliability of clinical measurements is critical to medical research and clinical practice. Newly proposed methods are assessed in terms of their reliability, which includes their repeatability, intra- and interobserver reproducibility. In general, new methods that provide repeatable and reproducible results are compared with established methods used clinically. This paper describes common statistical methods for assessing reliability and agreement between methods, including the intraclass correlation coefficient, coefficient of variation, Bland-Altman plot, limits of agreement, percent agreement, and the kappa statistic. These methods are more appropriate for estimating reliability than hypothesis testing or simple correlation methods. However, some methods of reliability, especially unscaled ones, do not clearly define the acceptable level of error in real size and unit. The Bland-Altman plot is more useful for method comparison studies as it assesses the relationship between the differences and the magnitude of paired measurements, bias (as mean difference), and degree of agreement (as limits of agreement) between two methods or conditions (e.g., observers). Caution should be used when handling heteroscedasticity of difference between two measurements, employing the means of repeated measurements by method in methods comparison studies, and comparing reliability between different studies. Additionally, independence in the measuring processes, the combined use of different forms of estimating, clear descriptions of the calculations used to produce indices, and clinical acceptability should be emphasized when assessing reliability and method comparison studies.

Citations

Citations to this article as recorded by

Analysis of the Reliability of Feather Sections for Corticosterone Measurement in Pekin Ducks
Se-Jin Lim, Chan Ho Kim, Ka Young Yang, Woo Do Lee, Su Mi Kim, Yang-Ho Choi, Jung Hwan Jeon
Animals.2025; 15(2): 138. CrossRef
Reliability and Validity of the Single-Camera Markerless Motion Capture System for Measuring Shoulder Range of Motion in Healthy Individuals and Patients with Adhesive Capsulitis: A Single-Center Study
Suji Lee, Unhyung Lee, Yohwan Kim, Seungjin Noh, Hungu Lee, Seunghoon Lee
Sensors.2025; 25(7): 1960. CrossRef
The dynamics of perceived justice and its outcomes in the online tourism sector: inter-relationships and temporal and carryover effects
Kowoon Kim, Hong-Youl Ha
Current Issues in Tourism.2024; 27(24): 4740. CrossRef
Accuracy of a Smart Diaper System for Nursing Home Residents for Automatically Detecting Voided Volume: Instrument Validation Study
Jae Heon Kim, Ui Cheol Lee, Byeong Hun Jeong, Byeong Uk Kang, Sung Ryul Shim, In Gab Jeong
JMIR Formative Research.2024; 8: e58583. CrossRef
Translation and validation of the Korean Version of the Reflux Symptom Score
Hye Kyu Min, So Young Jeon, Jerome R Lechien, Jung Min Park, Hwanhee Park, Jung-wan Yu, Suk Kim, Su Jin Jeong, Jung wook Kang, Kim Su il, Lee Young chan, Young-Gyu Eun, Seong-Gyu Ko
Journal of Voice.2024; 38(2): 545.e1. CrossRef
Development and validation of a semi-quantitative food frequency questionnaire as a tool for assessing dietary vitamin D intake among Korean women
Hye Ran Shin, SuJin Song, Sun Yung Ly
Nutrition Research and Practice.2024; 18(6): 872. CrossRef
The Risk of Contamination according to Hairstyle during Aseptic Procedures in Nursing Students: An Observational Study
Se Young Lim, Eun Jung Kim
Journal of Korean Academy of Fundamentals of Nursing.2024; 31(3): 295. CrossRef
A Novel Dental Plaque Index Using Intraoral Camera Images
Ji-Soo Kim
Journal of Dental Hygiene Science.2024; 24(3): 200. CrossRef
Test–retest Reliability and Concurrent Validity of a Headphone and Necklace Posture Correction System Developed for Office Workers
Gyu-hyun Han, Chung-hwi Yi, Seo-hyun Kim, Su-bin Kim, One-bin Lim
Physical Therapy Korea.2023; 30(3): 174. CrossRef
Clinical Evaluation of a Rapid Diagnostic Test Kit for Canine Parvovirus and Coronavirus
Chaeyeong MIN, Won-Shik KIM, Chom-Kyu CHONG, Yong LIM
Korean Journal of Clinical Laboratory Science.2023; 55(1): 45. CrossRef
A Study on the Validity and Test-retest Reliability of the Measurement of the Head Tilt Angle of the Smart Phone Application ‘KPIMT Torticollis Protractor’
Seong Hyeok Song, Ji Su Park, Ki Yeon Song, Ki Hyun Baek, Seung Hak Yoo, Ju Sang Kim
The Journal of Korean Physical Therapy.2023; 35(6): 177. CrossRef
Comparison of Corneal Higher-order Aberrations Measured by Scheimpflug Camera and Placido Disc-based Topography in Korean Patients
Yeon Ju Lim, Do Hee Jung, Kang Yeun Pak, Chan-Ho Cho
Journal of the Korean Ophthalmological Society.2023; 64(12): 1141. CrossRef
Writing Development of Children before Entering Primary School: Focusing on Graphomotor Skills and Written Expression
Boram No, Naya Choi
Korean Journal of Child Studies.2022; 43(1): 47. CrossRef
Clinical Accuracy of Non-Contact Forehead Infrared Thermometer Measurement in Children: An Observational Study
Yeon-Mi Kim, Myung-Roul Jang, Ju-Ryoung Moon, Goeun Park, Ye-Jin An, Jeong-Meen Seo
Children.2022; 9(9): 1389. CrossRef
A Prototype of a Stereoacuity Test Using a Head-Mounted Display
Hyuna Cho, Hyosun Kim, Rang Kyun Mok, Sung Eun Park, Wungrak Choi, Sueng-Han Han, Jinu Han
Journal of the Korean Ophthalmological Society.2022; 63(3): 301. CrossRef
Lumbar Spine Computed Tomography to Magnetic Resonance Imaging Synthesis Using Generative Adversarial Network: Visual Turing Test
Ki-Taek Hong, Yongwon Cho, Chang Ho Kang, Kyung-Sik Ahn, Heegon Lee, Joohui Kim, Suk Joo Hong, Baek Hyun Kim, Euddeum Shim
Diagnostics.2022; 12(2): 530. CrossRef
Assessing Agreement Between Upright and Supine Head Roll Tests for Horizontal Semicircular Canal Benign Paroxysmal Positional Vertigo
Tae Ho Kim, Jae Sang Han, Jae Hong Han, Dong-Hee Lee, Yeonji Kim, Shi Nae Park, Kyoung-Ho Park, Jae-Hyun Seo
Korean Journal of Otorhinolaryngology-Head and Neck Surgery.2022; 65(9): 497. CrossRef
Effect of Unmeasured Time Hours on Occupational Noise Exposure Assessment in the Shipbuilding Process in Korea
Jaewoo Shin, Seokwon Lee, Kyoungho Lee, Hyunwook Kim
International Journal of Environmental Research and Public Health.2021; 18(16): 8847. CrossRef
Repeatability of Bruch’s Membrane Opening-minimum Rim Width in Age-related Macular Degeneration and Diabetic Macular Edema
Bum Jun Kim, Woo Hyuk Lee, Ki Yup Nam, Ji Hye Kim, Tae Seen Kang, Hyun Kyung Cho, Yong Seop Han
Journal of the Korean Ophthalmological Society.2021; 62(11): 1490. CrossRef
Reliability and Validity of an Ultrasonic Device for Measuring Height in Adults
Seon Hwa Cho, Young Gyu Cho, Hyun Ah Park, A Ra Bong
Korean Journal of Family Medicine.2021; 42(5): 376. CrossRef
A Health Information Quality Assessment Tool for Korean Online Newspaper Articles: Development Study
Naae Lee, Seung-Won Oh, Belong Cho, Seung-Kwon Myung, Seung-Sik Hwang, Goo Hyeon Yoon
Journal of Medical Internet Research.2021; 23(7): e24436. CrossRef
A Comparison of Central Corneal Thickness Measurements and Measurement Repeatability Using Three Imaging Modalities
Sang Earn Woo, Si Hyung Lee
Journal of the Korean Ophthalmological Society.2021; 62(2): 184. CrossRef
Development and validation of prediction equations for the assessment of muscle or fat mass using anthropometric measurements, serum creatinine level, and lifestyle factors among Korean adults
Gyeongsil Lee, Jooyoung Chang, Seung-sik Hwang, Joung Sik Son, Sang Min Park
Nutrition Research and Practice.2021; 15(1): 95. CrossRef
A Study on the Characteristics of Indoor Radon Concentration in Water Curtain Cultivation Facilities
Sang-cheol Kim, Chan-ju Park, Jae-hyuk Choi, Min-a Seo, Jin-gyun Eom, Mi-sun Park
Journal of Environmental Analysis, Health and Toxicology.2021; 24(2): 84. CrossRef
Assessment of Repeatability and Reproducibility of Non-Invasive TBUT Measurement Using the Bland-Altman Plot
Yee-Rin Jung, Hyung-Min Park, Byoung-Sun Chu
Journal of Korean Ophthalmic Optics Society.2021; 26(4): 307. CrossRef
Effect of depth of anesthesia on the phase lag entropy in patients undergoing general anesthesia by propofol
Jae Hong Park, Sang Eun Lee, Eunsu Kang, Yei Heum Park, Hyun-seong Lee, Soo Jee Lee, Dongju Shin, Gyu-Jeong Noh, Il Hyun Lee, Ki Hwa Lee
Medicine.2020; 99(30): e21303. CrossRef
Reproducibility of abnormality detection on chest radiographs using convolutional neural network in paired radiographs obtained within a short-term interval
Yongwon Cho, Young-Gon Kim, Sang Min Lee, Joon Beom Seo, Namkug Kim
Scientific Reports.2020;[Epub] CrossRef
Repeatability and Reproducibility of Tear Meniscus Evaluations Using Two Different Spectral Domain-optical Coherence Tomography
Jin Ha Kim, Kyu Ryong Choi, Roo Min Jun, Kyung Eun Han
Journal of the Korean Ophthalmological Society.2019; 60(10): 929. CrossRef
Reliability and Validity of Non-invasive Blood Pressure Measurement System Using Three-Axis Tactile Force Sensor
Sun-Young Yoo, Ji-Eun Ahn, György Cserey, Hae-Young Lee, Jong-Mo Seo
Sensors.2019; 19(7): 1744. CrossRef
Efficacy of the Mobile Three-Dimensional Wound Measurement System in Pressure Ulcer Assessment
Dongkeun Jun, Hyungon Choi, Jeenam Kim, Myungchul Lee, Soonheum Kim, Dongin Jo, Cheolkeun Kim, Donghyeok Shin
Journal of Wound Management and Research.2019; 15(2): 78. CrossRef
Evaluation of Image Receptor Characteristics in Computed Radiography System Using Exposure Index in International Electrotechnical Commission (Ⅰ)
Park Hyemin, Yoon Yongsu, Roh Younghoon, Kim Sungjun, Na Chanyoung, Han Taeho, Kim Jungsu, Jeong hoiwoun, Kim Jungmin
Journal of Radiological Science and Technology.2019; 42(4): 291. CrossRef
Short-term Reproducibility of Pulmonary Nodule and Mass Detection in Chest Radiographs: Comparison among Radiologists and Four Different Computer-Aided Detections with Convolutional Neural Net
Young-Gon Kim, Yongwon Cho, Chen-Jiang Wu, Sejin Park, Kyu-Hwan Jung, Joon Beom Seo, Hyun Joo Lee, Hye Jeon Hwang, Sang Min Lee, Namkug Kim
Scientific Reports.2019;[Epub] CrossRef
Evaluating test-retest reliability in patient-reported outcome measures for older people: A systematic review
Myung Sook Park, Kyung Ja Kang, Sun Joo Jang, Joo Yun Lee, Sun Ju Chang
International Journal of Nursing Studies.2018; 79: 58. CrossRef
Clinical Assessment of Cellulose Tube-Type Tear Test Kit
Jung-Eun Park, Myeong-Jin Jeong, Koon-Ja Lee
The Korean Journal of Vision Science.2018; 20(3): 305. CrossRef
Cross-cultural Adaptation and Validation of the eHealth Literacy Scale in Korea
Sun Ju Chang, Eunjin Yang, Hyunju Ryu, Hee Jung Kim, Ju Young Yoon
Korean Journal of Adult Nursing.2018; 30(5): 504. CrossRef
Comparison of the Utility of dnaJ and 16S rDNA Sequences for Identification of Clinical Isolates of Vibrio Species
In-Sun Choi, Dae Soo Moon, Geon Park, Seong-Ho Kang, Choon-Mee Kim, Young-Joon Ahn, Dong-Min Kim, Na Ra Yun, Dong Hoon Lim, Sung Heui Shin, Joong-Ki Kook, Young-Hyo Chang, Sook-Jin Jang
Laboratory Medicine Online.2018; 8(1): 7. CrossRef

703 View
12 Download
36 Crossref

Original Article

[English]

Evaluation of Checklist and Inter-Rater Agreement in Oral Case Presentation of Undergraduate Medical Students: Jungwon Huh, Miae Lee, Whasoon Chung; Ihwa Ŭidae chi 2007;30(1):47-52. Published online March 30, 2007; DOI: https://doi.org/10.12771/emj.2007.30.1.47

Abstract
PDF

Background

Undergraduate medical students should learn oral presentation skills, which are central to physician-physician communication. The purpose of this study was to compare checklist scores with global ratings for evaluation of oral case presentation and to investigate interrater agreement in the scoring of checklists.

Methods

The study group included twenty-one teams of undergraduate medical students who did clerkship for 2 weeks in the department of Laboratory Medicine of Mokdong Hospital, School of Medicine, Ewha Womans University from January 2005 to October 2006. Three faculty raters independently evaluated oral case presentations by checklists, composing of 5 items. A consensus scores of global ratings were determined after discusssion. Inter-rater agreement was measured using intraclass correlation coefficient(ICC). As the ICC values approaches 1.0, it means higher inter-rater agreement.

Results

The mean of consensus global ratings was significantly higher than that of checklists by three faculty raters(12.6±1.7 vs 11.1±2.0, P<0.001). Spearman's correlation coefficient between global ratings and checklist scores was r=0.82(P<0.01). The overall scores of checklist were significantly different among three raters (12.3±2.0, 10.8±2.8, 10.0±2.7, P<0.05). ICC values in the scoring of checklists were as follows ; for overall scores, 0.750 ; for individual checklist items, 0.350-0.753.

Conclusions

These results suggest that checklist scores by faculty raters could be one of the most useful tools for evaluation of oral case presentation, if checklist would be modified to make less ambiguous and more objective and faculty raters would have opportunities to be educated and trained for evaluation skills of oral case presentation.

Citations

Citations to this article as recorded by

Comparison of Standardized Patient and Faculty Agreement in Evaluating Nursing Students’ Assessment and Communication Skills
Young Ju Kim
Journal of Korean Academy of Fundamentals of Nursing.2017; 24(3): 189. CrossRef