Performance of an AI-Based Tool for Scoring the TTCT-Verbal in the Assessment of Creative Thinking

Authors

DOI: https://doi.org/10.6018/rie.710621
Keywords: creativity, Artificial Intelligence, TTCT-Verbal, prompt engineering

Supporting Agencies

  • Ministerio de Ciencia, Innovación y Universidades del Gobierno de España

Abstract

Considering the specific challenges related to the assessment of the Torrance Tests of Creative Thinking – Verbal Form (TTCT-Verbal) concerning time demands, inter-rater variability, and subjectivity in the assessment of originality, this study preliminarily examines the performance of an artificial intelligence tool based on a large language model to support the scoring of fluency, flexibility, and originality, contrasting results with those of a panel of six human evaluators. A total of 47 protocols from postgraduate students at the Universitat de València were used, of which 30 were selected for the comparative analysis between the AI system and the human average. Similarly, the Intraclass Correlation Coefficient ICC (3,1), mean absolute error, root mean square error, Pearson correlations, and Bland-Altman plots were calculated. Results indicate close mean scores between both systems and low absolute error, observing the strongest association in originality, while fluency and flexibility showed lower relative consistency. Graphical analyses allowed the distribution of differences to be explored without identifying evident systematic bias, although these results should be interpreted descriptively. As such, it is concluded that the tool shows promising, but dimension-dependent performance, especially with regards to originality. Its use appears more appropriate as a complementary support within hybrid systems for assessing creative thinking than as a replacement for expert human judgment.

Downloads

Download data is not yet available.
Metrics
Views/Downloads
  • Abstract
    0
  • PDF (Español (España))
    0
  • PDF
    0

References

Acar, S., Berthiaume, K., Grajzel, K., Dumas, D., Flemister, Ch., y Organisciak, P. (2023). Applying automated originality scoring to the verbal form of Torrance Tests of Creative Thinking. Gifted Child Quarterly, 67(1), 3–17. https://doi.org/10.1177/00169862211061874

Alcántara Santuario, A. (2023). La inteligencia artificial y sus implicaciones en educación. Perfiles Educativos, 45(Especial), 5–8. https://doi.org/10.22201/iisue.24486167e.2023.Especial.61687

Alfonso-Benlliure, V., Checa, I. y Meléndez, J. C. (2025). Long in the tooth for creativity? Differences in divergent thinking between young and older adults. Thinking Skills and Creativity, 57, 101847 https://doi.org/10.1016/j.tsc.2025.101847

Barragán-Giraldo, D. F., Pirela-Morillo, J. E., y Riaño Diaz, J. A. (2024). Datificación en contextos educativos. Entre subjetivación y ética. REICE. Revista Iberoamericana Sobre Calidad, Eficacia y Cambio en Educación, 22(2), 119-132. https://doi.org/10.15366/reice2024.22.2.007

Beaty, R. E., Johnson, D. R., Zeitlen, D. C., y Forthmann, B. (2022). Semantic distance and the alternate uses task: Recommendations for reliable automated assessment of originality. Creativity Research Journal, 34(3), 245–260. https://doi.org/10.1080/10400419.2022.2025720

Bland, J. M., y Altman, D. G. (1986). Statistical methods for assessing agreement between two methods of clinical measurement. The Lancet, 327(8476), 307–310. https://doi.org/10.1016/S0140-6736(86)90837-8

Bland, J. M., y Altman, D. G. (1999). Measuring agreement in method comparison studies. Statistical Methods in Medical Research, 8(2), 135–160. https://doi.org/10.1177/096228029900800204

Cesário, V., y Nisi, V. (2023). Designing mobile museum experiences for teenagers. Museum Management and Curatorship, 38(3), 272-292. https://doi.org/10.1080/09647775.2022.2111329

Cisternas San Martín, N., Guzman Muñoz, E., y Rivas Poblete, A. (2025). Inteligencia Artificial como asistente para desarrollar un currículum de educación contínua basado en competencias. Revista de Investigación Educativa, (43). https://doi.org/10.6018/rie.619441

Dumas, D., y Runco, M. (2018). Objectively Scoring Divergent Thinking Tests for Originality: A Re-Analysis and Extension. Creativity Research Journal, 30(4), 466–468. https://doi.org/10.1080/10400419.2018.1544601

Giavarina, D. (2015). Understanding Bland Altman analysis. Biochemia Medica, 25(2), 141–151. https://doi.org/10.11613/BM.2015.015

Goenechea, C., y Valero-Franco, C. (2024). Educación e Inteligencia Artificial: Un Análisis desde la Perspectiva de los Docentes en Formación. REICE. Revista Iberoamericana Sobre Calidad, Eficacia y Cambio en Educación, 22(2), 33-50. https://doi.org/10.15366/reice2024.22.2.002

Guilford, J. P. (1950). Creativity. American Psychologist, 5(9), 444–454. https://doi.org/10.1037/h0063487

Guilford, J. P. (1967). The nature of human intelligence. McGraw-Hill.

Guzik, E. E., Byrge, C., y Gilde, C. (2023). The originality of machines: AI takes the Torrance Test. Journal of Creative Behavior, 57(1), 45–58. https://doi.org/10.1002/jocb.546

Huerta, R. (2023). Artivismo y creatividad en la formación docente: cementerios, arte y literatura. Arte y Políticas de Identidad, 29, 65-86. https://doi.org/10.6018/reapi.598721

Huerta, R. (2025). Aprendizaje basado en proyectos (ABP). Visitas a museos y fotografía creativa en la formación del profesorado. Observar. Revista Electrònica De Didàctica De Les Arts, (19), 48–81. https://doi.org/10.1344/observar.2025.19.3

Huerta, R., y Alfonso-Benlliure, V. (2025). Museums as Catalysts for Creativity in Adolescence: A Review. Heritage, 8(8), 327. https://doi.org/10.3390/heritage8080327

Jarquín-Ramírez, M.-R., Alonso-Martínez, H., y Díez-Gutiérrez, E.-J. (2025). Chat GPT para un Sistema Educativo Justo, Democrático y Transformador. REICE. Revista Iberoamericana Sobre Calidad, Eficacia y Cambio en Educación, 23(3), 1-14. https://doi.org/10.15366/reice2025.23.3.001

Kim, K. H. (2006). Can we trust creativity tests? A review of the Torrance Tests of Creative Thinking (TTCT). Creativity Research Journal, 18(1), 3–14. https://doi.org/10.1207/s15326934crj1801_2

Kim, K. H. (2017). The Torrance Tests of Creative Thinking - Figural or Verbal: Which one should we use? Creativity. Theories – Research – Applications, 4(2), 302–321. https://doi.org/10.1515/ctra-2017-0015

Koo, T. K., y Li, M. Y. (2016). A guideline of selecting and reporting intraclass correlation coefficients for reliability research. Journal of Chiropractic Medicine, 15(2), 155–163. https://doi.org/10.1016/j.jcm.2016.02.012

Mella-Mella, F. M., Calatayud Salom, M. A., y Lucas Calatayud, Á. J. (2026). Uso de la IA como Estrategia de y para el Aprendizaje en Educación Superior. REICE. Revista Iberoamericana Sobre Calidad, Eficacia y Cambio en Educación, 24(1). https://doi.org/10.15366/reice2026.24.1.007

Organisciak, P., Acar, S., Dumas, D., y Berthiaume, K. (2023). Beyond semantic distance: Automated scoring of divergent thinking greatly improves with large language models. Thinking Skills and Creativity, 49, 101356. https://doi.org/10.1016/j.tsc.2023.101356

Perchtold-Stefan, C. M., Fink, A., Rominger, C., y Papousek, I. (2024). Social exclusion increases antisocial tendencies: Evidence from retaliatory ideation in a malevolent creativity task. Psychology of Aesthetics, Creativity, and the Arts, 18(6), 1014–1025. https://doi.org/10.1037/aca0000500

Reiter-Palmon, R., Forthmann, B., y Barbot, B. (2019). Scoring divergent thinking tests: A review and systematic framework. The Psychology of Aesthetics, Creativity, and the Arts, 13(2), 144-152. https://doi.org/10.1037/aca0000227

Runco, M. A. (2023). Updating the Standard Definition of Creativity to Account for the Artificial Creativity of AI. Creativity Research Journal, 37(1), 1 –5. https://doi.org/10.1080/10400419.2023.2257977

Runco, M. A., Millar, G., Acar, S., y Cramond, B. (2010). Torrance Tests of Creative Thinking as predictors of personal and public achievement: A fifty-year follow-up. Creativity Research Journal, 22(4), 361–368. https://doi.org/10.1080/10400419.2010.523393

Sanjurjo Pérez, P., Solana Domínguez, I., y Arana Cuenca, A. (2026). Un año sin móvil: Estudio de caso sobre políticas de prohibición y su influencia en el rendimiento académico. REICE. Revista Iberoamericana sobre Calidad, Eficacia y Cambio en Educación, 24(1). https://doi.org/10.15366/reice2026.24.1.003

Sanz-Leal, M., y Orozco Gómez, M. L. (2025). Validación en español de una escala de Competencia Global en docentes en formación y en servicio. Revista de Investigación Educativa, (43). https://doi.org/10.6018/rie.564141

Silva Fuentealba, E. (2024). ChatGPT como catalizador del pensamiento creativo. [Chat GPT as a Catalyst for Creative Thinking]. European Public & Social Innovation Review, 9, 1-19. https://doi.org/10.31637/epsir-2024-410

Silva-Fuentealba, E., Valdés-León, G., y Oyarzún Yáñez, R. (2025).

Inteligencia artificial en el aula: potenciando la resolución de problemas a través del pensamiento creativo. Revista de Comunicación de la SEECI, 58, 1–19. https://doi.org/10.15198/seeci.2025.58.e927

Torrance, E. P. (1966). Torrance Tests of Creative Thinking: Norms-technical manual. Personnel Press.

Torrance, E. P. (1974). Torrance Tests of Creative Thinking: Norms-technical manual (Research edition). Personnel Press.

Torrance, E. P. (2008). Torrance Tests of Creative Thinking: Technical manual and scoring guide. Scholastic Testing Service.

Verdú-Pina, M., Serrano, V., Grimalt-Álvaro, C., y Usart, M. (2026). Perfiles docentes según la competencia digital autopercibida y el uso de la tecnología: Un análisis de clústeres. Revista de Investigación Educativa, (44). https://doi.org/10.6018/rie.672661

Welter, M. M., Jaarsveld, S., van Leeuwen, C., y Lachmann, T. (2016). Intelligence and Creativity: Over the Threshold Together? Creativity Research Journal, 28(2), 212–218. https://doi.org/10.1080/10400419.2016.1162564

Published
12-06-2026
How to Cite
Huerta, R., & Silva Fuentealba, E. (2026). Performance of an AI-Based Tool for Scoring the TTCT-Verbal in the Assessment of Creative Thinking. Revista De Investigación Educativa, (44). https://doi.org/10.6018/rie.710621
Issue
Section
Articles