The analysis of marking reliability through the approach of gauge repeatability and reproducibility (GR&amp;R) study: a case of English-speaking test

Sureeyatanapas P.; Panitanarak U.; Kraisriwattana J.; Sarootyanapat P.; O’Connell D.

The analysis of marking reliability through the approach of gauge repeatability and reproducibility (GR&R) study: a case of English-speaking test

dc.contributor.author	Sureeyatanapas P.
dc.contributor.author	Sureeyatanapas P.
dc.contributor.author	Panitanarak U.
dc.contributor.author	Kraisriwattana J.
dc.contributor.author	Sarootyanapat P.
dc.contributor.author	O’Connell D.
dc.contributor.correspondence	Sureeyatanapas P.
dc.contributor.other	Mahidol University
dc.date.accessioned	2024-02-08T18:14:15Z
dc.date.available	2024-02-08T18:14:15Z
dc.date.issued	2024-12-01
dc.description.abstract	Ensuring consistent and reliable scoring is paramount in education, especially in performance-based assessments. This study delves into the critical issue of marking consistency, focusing on speaking proficiency tests in English language learning, which often face greater reliability challenges. While existing literature has explored various methods for assessing marking reliability, this study is the first of its kind to introduce an alternative statistical tool, namely the gauge repeatability and reproducibility (GR&R) approach, to the educational context. The study encompasses both intra- and inter-rater reliabilities, with additional validation using the intraclass correlation coefficient (ICC). Using a case study approach involving three examiners evaluating 30 recordings of a speaking proficiency test, the GR&R method demonstrates its effectiveness in detecting reliability issues over the ICC approach. Furthermore, this research identifies key factors influencing scoring inconsistencies, including group performance estimation, work presentation order, rubric complexity and clarity, the student’s chosen topic, accent familiarity, and recording quality. Importantly, it not only pinpoints these root causes but also suggests practical solutions, thereby enhancing the precision of the measurement system. The GR&R method can offer significant contributions to stakeholders in language proficiency assessment, including educational institutions, test developers and policymakers. It is also applicable to other cases of performance-based assessments. By addressing reliability issues, this study provides insights to enhance the fairness and accuracy of subjective judgements, ultimately benefiting overall performance comparisons and decision making.
dc.identifier.citation	Language Testing in Asia Vol.14 No.1 (2024)
dc.identifier.doi	10.1186/s40468-023-00271-z
dc.identifier.eissn	22290443
dc.identifier.scopus	2-s2.0-85182491063
dc.identifier.uri	https://repository.li.mahidol.ac.th/handle/123456789/95798
dc.rights.holder	SCOPUS
dc.subject	Social Sciences
dc.subject	Arts and Humanities
dc.title	The analysis of marking reliability through the approach of gauge repeatability and reproducibility (GR&R) study: a case of English-speaking test
dc.type	Article
mu.datasource.scopus	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85182491063&origin=inward
oaire.citation.issue	1
oaire.citation.title	Language Testing in Asia
oaire.citation.volume	14
oairecerif.author.affiliation	Khon Kaen University International College
oairecerif.author.affiliation	Khon Kaen University
oairecerif.author.affiliation	Mahidol University

Collections

Scopus 2024

	Office Hour: Monday-Friday 08.30-12.00 and 13.00-16.30 hrs.
	Phutthamonthon Sai 4 Rd. Salaya, Nakhon Pathom 73170, Thailand
	The office: +66 (2) 800 2680 ext.4306
	thipsuda.van@mahidol.ac.th
	https://repository.li.mahidol.ac.th

The analysis of marking reliability through the approach of gauge repeatability and reproducibility (GR&amp;R) study: a case of English-speaking test

Files

Collections

The analysis of marking reliability through the approach of gauge repeatability and reproducibility (GR&R) study: a case of English-speaking test