Ansgar Koene
Ethics considerations for Corpus Linguistic studies using internet resources
Koene, Ansgar; Adolphs, Svenja; Perez, Elvira; Carter, Chris James; Statache, Ramona; O'Malley, Claire; Rodden, Tom; McAuley, Derek
Authors
Professor SVENJA ADOLPHS SVENJA.ADOLPHS@NOTTINGHAM.AC.UK
PROFESSOR OF ENGLISH LANGUAGE AND LINGUISTICS
Professor ELVIRA PEREZ VALLEJOS elvira.perez@nottingham.ac.uk
PROFESSOR OF DIGITAL TECHNOLOGY FOR MENTAL HEALTH
Dr CHRISTOPHER CARTER CHRISTOPHER.CARTER@NOTTINGHAM.AC.UK
ASSISTANT PROFESSOR IN ENTREPRENEURSHIP AND INNOVATION
Ramona Statache
Claire O'Malley
Professor TOM RODDEN TOM.RODDEN@NOTTINGHAM.AC.UK
Pro-Vice-Chancellor of Research & Knowledge Exchange
Derek McAuley
Abstract
With the rising popularity of public and semi-public communication channels such as Blogs (late 1990s), Wikipedia (launched in 2001), Facebook (launched in 2004), Reddit (from 2005) and Twitter (from 2006), the Internet has become an increasingly fertile medium through which to collect substantial data sets of written language. Additional features that make online communication platforms attractive include the comparatively low effort and cost associated with data collection and the unobtrusive nature of the collection process, which can often be performed ‘behind the scenes’ using application programme interfaces (APIs) or web scraping techniques, depending upon the affordances of the specific type of social media studies (e.g. Twitter, Blogs). While the unobtrusive nature of the methods offers the advantage of ensuring that observed conversations are not unduly influenced by the researcher, it raises ethical concerns around issues of privacy violation, informed consent and the right to withdraw.
In this paper we will discuss some of the ethical concerns around the use of online communications data. We will start by looking at the current guidelines by the British Association for Applied Linguistics (BAAL). Next we will discuss some of the core difficulties related to identifying ‘publicness’ of Internet-based information. This will lead to a discussion about ethical responsibilities when dealing with ‘public’ online communications, and how this issue is being addressed in current corpus linguistics research.
Citation
Koene, A., Adolphs, S., Perez, E., Carter, C. J., Statache, R., O'Malley, C., Rodden, T., & McAuley, D. (2015, July). Ethics considerations for Corpus Linguistic studies using internet resources. Presented at Corpus Linguistics 2015, Lancaster, UK
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | Corpus Linguistics 2015 |
Start Date | Jul 21, 2015 |
End Date | Jul 24, 2015 |
Acceptance Date | Feb 28, 2015 |
Online Publication Date | Jul 21, 2015 |
Publication Date | Jul 21, 2015 |
Deposit Date | Mar 29, 2021 |
Publicly Available Date | Apr 9, 2021 |
Pages | 204-206 |
Book Title | Corpus Linguistics 2015: abstract book |
Public URL | https://nottingham-repository.worktribe.com/output/1405774 |
Publisher URL | http://ucrel.lancs.ac.uk/cl2015/doc/CL2015-AbstractBook.pdf |
Related Public URLs | http://ucrel.lancs.ac.uk/cl2015/ |
Files
Koene Et Al 2015 - CL2015 - AbstractBook
(11.6 Mb)
PDF
You might also like
Discomfort—the dark side of fun
(2018)
Book Chapter
Learning from the Veg Box: Designing Unpredictability in Agency Delegation
(2018)
Presentation / Conference Contribution
A Method for Evaluating Options for Motif Detection in Electricity Meter Data
(2018)
Journal Article
Bread stories: understanding the drivers of bread consumption for digital food customisation
(2017)
Presentation / Conference Contribution
Data Work: How Energy Advisors and Clients Make IoT Data Accountable
(2017)
Journal Article
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: discovery-access-systems@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search