Testing a computational model of causative overgeneralizations : child judgment and production data from English, Hebrew, Hindi, Japanese and K’iche’

Ambridge, B, Doherty, L, Maitreyee, R, Tatsumi, T, Zicherman, S, Mateo Pedro, P, Kawakami, A, Bidgood, A ORCID: https://orcid.org/0000-0002-9719-4256, Pye, C, Narasimhan, B, Arnon, I, Bekman, D, Efrati, A, Can Pixabaj, SF, Pelíz, MM, Mendoza, MJ, Samanta, S, Campbell, S, McCauley, S, Berman, R, Sharma, DM, Nair, RB and Fukumura, K 2021, 'Testing a computational model of causative overgeneralizations : child judgment and production data from English, Hebrew, Hindi, Japanese and K’iche’' , Open Research Europe .

PDF (Version 1 (pre-Open Peer Review)) - Published Version
Available under License Creative Commons Attribution 4.0.

Download (7MB) | Preview
Access Information: This is a research article which is undergoing open peer-review. Updated versions of this article will be accessible via the link above.


How do language learners avoid the production of verb argument structure overgeneralization errors (*The clown laughed the man c.f. The clown made the man laugh), while retaining the ability to apply such generalizations productively when appropriate? This question has long been seen as one that is both particularly central to acquisition research and particularly challenging. Focussing on causative overgeneralization errors of this type, a previous study reported a computational model that learns, on the basis of corpus data and human-derived verb-semantic-feature ratings, to predict adults’ by-verb preferences for less- versus more-transparent causative forms (e.g., *The clown laughed the man vs The clown made the man laugh) across English, Hebrew, Hindi, Japanese and K’iche Mayan. Here, we tested the ability of this model to explain binary grammaticality judgment data from children aged 4;0-5;0, and elicited-production data from children aged 4;0-5;0 and 5;6-6;6 (N=48 per language). In general, the model successfully simulated both children’s judgment and production data, with correlations of r=0.5-0.6 and r=0.75-0.85, respectively, and also generalized to unseen verbs. Importantly, learners of all five languages showed some evidence of making the types of overgeneralization errors – in both judgments and production – previously observed in naturalistic studies of English (e.g., *I’m dancing it). Together with previous findings, the present study demonstrates that a simple discriminative learning model can explain (a) adults’ continuous judgment data, (b) children’s binary judgment data and (c) children’s production data (with no training of these datasets), and therefore constitutes a plausible mechanistic account of the retreat from overgeneralization.

Item Type: Article
Schools: Schools > School of Health and Society
Journal or Publication Title: Open Research Europe
Publisher: Open Research Europe/European Commission
ISSN: 2732-5121
Related URLs:
Funders: European Union Horizon 2020, Economic and Social Research Council (ESRC)
Depositing User: A Bidgood
Date Deposited: 25 Mar 2021 12:06
Last Modified: 28 Aug 2021 10:56
URI: http://usir.salford.ac.uk/id/eprint/59926

Actions (login required)

Edit record (repository staff only) Edit record (repository staff only)


Downloads per month over past year