Mostrar el registro sencillo del ítem

Cognitive inference as the main predictor of AI reliability in automated behavioral coding of parent–child interactions

dc.rights.licenseAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.contributor.authorAmorocho, José
dc.contributor.authorBalsa, Ana
dc.contributor.authorGiraldo-Huertas, Juan José
dc.contributor.authorBloomfield, Juanita
dc.contributor.authorPatrone, Paula
dc.contributor.authorCid, Alejandro
dc.date.accessioned2026-02-27T14:00:24Z
dc.date.available2026-02-27T14:00:24Z
dc.date.issued2026es
dc.identifier.urihttps://hdl.handle.net/20.500.12806/2795
dc.format.extent31 p.es
dc.format.mimetypetext/plaines
dc.languageenges
dc.rightsAbiertoes
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.titleCognitive inference as the main predictor of AI reliability in automated behavioral coding of parent–child interactionses
dc.typeArtículoes
dc.type.versionAceptadaes
dc.description.abstractenglishObservational coding of parent–child interactions is a gold standard in developmental science but remains unscalable. Multimodal generative AI could help, yet its reliability and failure modes are not well characterized. We benchmarked a multi-agent pipeline (GABRIEL) against a multi-rater expert consensus when scoring 22 PICCOLO items on 156 ten-minute free-play interactions from Uruguay. Agreement was summarized with Percent Agreement (PA) and Cohen’s κ, and disagreement with a unit-free normalized mean squared error (nMSE = MSE/Var(Yitem)). A priori item classes indexed cognitive inference (Low/Medium/High). Final calibration yielded modest agreement (PA = 50.7%, κ = .216). Disagreement was chiefly structured by inference (Kruskal–Wallis H = 308.70, p<.001), a pattern that persisted in late iterations. The model also overused the middle category (1) and underused “2.” No systematic differences in nMSE emerged by sex, age quartile, or maternal education. We conclude that generative AI is promising for scalable detection of concrete, low-inference behaviors, whereas high-inference judgments still require expert adjudication. A human-in-the-loop, co-intelligence workflow aligns current strengths with ethical oversight and supports equitable deployment at scale.es
dc.subject.keywordGenerative AIes
dc.subject.keywordParent-Child Interactiones
dc.subject.keywordObservational Codinges
dc.subject.keywordInter-Rater Reliabilityes
dc.subject.keywordPICCOLOes
dc.subject.keywordMultimodal AIes

Ver ítem
Nombre:
Amorocho, Balsa, Giraldo-Huertas, ...
Tamaño:
1.217Mb
Formato:
PDF
[File]license_rdf (805bytes )
Nombre:
license_rdf
Tamaño:
805bytes
Formato:
application/rdf+xml

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

Attribution-NonCommercial-NoDerivatives 4.0 Internacional
Excepto si se señala otra cosa, la licencia del ítem se describe como Attribution-NonCommercial-NoDerivatives 4.0 Internacional