Objective: To study the special characteristics of native and learner language, in various language pairs and using varied methodologies.
Researchers: Liat Nativ, Chen Gafni, Shuly Wintner. In collaboration with Anat Prior (University of Haifa), Anke Lüdeling (Humboldt-Universität zu Berlin), Noam Ordan and Yuval Nov (University of Haifa).
Status: Complete
Funding: DFG grant no. LU856/13-1.
Most people in the world today use more than one language in the course of their daily lives, and it is estimated that most children today grow up with exposure to two or more languages. In addition to individual multilingualism, culture, technology and information are increasingly characterized by a globalization of resources, mediated by translation of materials originating in a wide variety of languages. This innovative and interdisciplinary proposal brings together researchers and methods from corpus linguistics, psycholinguistics and computer science to achieve a better understanding of how individuals use the various languages at their disposal. The findings of the proposed research could then be used to inform language education and translation studies. The proposed research program will simultaneously investigate several varieties of the language produced by bilinguals using the same set of multidisciplinary methods in order to establish common characteristics as well as meaningful differences among these varieties. We will use qualitative and quantitative computational methods on language corpora and complement our findings with psycholinguistic experiments.
The Hebrew Essay Corpus is available upon request. ILCoWE, the Israeli Learner Corpus of Written English, is available on GitHub.
Publications
Chen Gafni, Livnat Herzig Sheinfux, Hadar Klunover, Anat Bar Siman Tov, Anat Prior and Shuly Wintner. Analyzing learner language: the case of the Hebrew Learner Essay Corpus. Language Resources and Evaluation 59:685–726. June 2025 📖
Omaima Abboud, Batia Laufer, Noam Ordan, Uliana Sentsova and Shuly Wintner. A corpus of English learners with Arabic and Hebrew backgrounds. Language Resources and Evaluation 59:591–599. March 2025 📖
Liat Nativ, Yuval Nov, Noam Ordan, Shuly Wintner and Anat Prior. Do more proficient writers use fewer cognates in L2? A computational approach. Bilingualism: Language and Cognition. 27(1):84-94. 2024 📖
Isabelle Nguyen and Shuly Wintner. Predicting the Proficiency Level of Nonnative Hebrew Authors. Proceedings of the Language Resources and Evaluation Conference, pages 5356-5365. Marseille, France. June 2022 📖
Chen Gafni, Anat Prior and Shuly Wintner. The Hebrew Essay Corpus. Proceedings of the Language Resources and Evaluation Conference, pages 5580--5586. Marseille, France. June 2022 📖