WALTER VAN HEUVEN WALTER.VANHEUVEN@NOTTINGHAM.AC.UK
Associate Professor
SUBTLEX-CY: A new word frequency database for Welsh
van Heuven, Walter J. B.; Payne, Joshua S; Jones, Manon W
Authors
Joshua S Payne
Manon W Jones
Abstract
We present SUBTLEX-CY, a new word frequency database created from a 32-million-word corpus of Welsh television subtitles. An experiment comprising a lexical decision task examined SUBTLEX-CY frequency estimates against words with inconsistent frequencies in a much smaller Welsh corpus that is often used by researchers, the Cronfa Electroneg o’r Gymraeg (CEG), and three other Welsh word frequency databases. Words were selected that were classified as low frequency (LF) in SUBTLEX-CY and high frequency (HF) in CEG and compared with words that were classified as medium frequency (MF) in both SUBTLEX-CY and CEG. Reaction time analyses showed that HF words in CEG were responded to more slowly compared to MF words, suggesting that SUBTLEX-CY corpus provides a more reliable estimate of Welsh word frequencies. The new Welsh word frequency database that also includes part-of-speech, contextual diversity, and other lexical information is freely available for research purposes on the Open Science Framework repository at https://osf.io/9gkqm/.
Citation
van Heuven, W. J. B., Payne, J. S., & Jones, M. W. (2024). SUBTLEX-CY: A new word frequency database for Welsh. Quarterly Journal of Experimental Psychology, 77(5), 1052-1067. https://doi.org/10.1177/17470218231190315
Journal Article Type | Article |
---|---|
Acceptance Date | Jun 27, 2023 |
Online Publication Date | Aug 30, 2023 |
Publication Date | 2024-05 |
Deposit Date | Aug 25, 2023 |
Publicly Available Date | Aug 25, 2023 |
Journal | Quarterly Journal of Experimental Psychology |
Print ISSN | 1747-0218 |
Electronic ISSN | 1747-0226 |
Publisher | SAGE Publications |
Peer Reviewed | Peer Reviewed |
Volume | 77 |
Issue | 5 |
Pages | 1052-1067 |
DOI | https://doi.org/10.1177/17470218231190315 |
Keywords | Welsh; Word Frequency; Visual word recognition |
Public URL | https://nottingham-repository.worktribe.com/output/23593262 |
Publisher URL | https://journals.sagepub.com/doi/10.1177/17470218231190315 |
Files
QJEP_AAM_vanheuven_et_al2023_SUBTLEX-CY
(2.7 Mb)
PDF
Publisher Licence URL
https://creativecommons.org/licenses/by-nc/4.0/
You might also like
Electrophysiological measures of conflict detection and resolution in the Stroop task
(2011)
Journal Article
Incidental acquisition of foreign language vocabulary through brief multi-modal exposure
(2013)
Journal Article
Is the masked priming same-different task a pure measure of prelexical processing?
(2013)
Journal Article
SUBTLEX-UK: a new and improved word frequency database for British English
(2014)
Journal Article
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: discovery-access-systems@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search