Palimpsest: improving assisted curation of loco-specific literature
Alex, Beatrice; Grover, Claire; Oberlander, Jon; Thomson, Tara; Anderson, Miranda; Loxley, James; Hinrichs, Uta; Zhou, Ke
Text mining and information visualization techniques applied to large-scale historical and literary document collections have enabled new types of humanities research. The assumption behind such efforts is often that trends will emerge from the analysis despite errors for individual data points and that noise will be dominated by the signal in the data. However, for some text analysis tasks, the technology is unable to perform as well as domain experts, perhaps because it does not have sufficient world knowledge or metadata available. Yet, the advantage of language processing technology is that it can process at scale, even if not perfectly accurately. Geo-locating literary works is one example where human expert knowledge is invaluable when it comes to distinguishing between candidate works. This was the underlying assumption in Palimpsest, an interdisciplinary digital humanities research project on mining literary Edinburgh. From the outset, the project adopted an assisted curation process whereby the automatic processing of large data collections was combined with manual checking to identify literary works set in Edinburgh. In this article, we introduce the assisted curation process and evaluate how the feedback from literary scholars helped to improve the technology, thereby highlighting the importance of placing humanities research at the core of digital humanities projects.
|Journal Article Type||Article|
|Publication Date||Apr 1, 2017|
|Journal||Digital Scholarship in the Humanities|
|Publisher||Oxford University Press (OUP)|
|Peer Reviewed||Peer Reviewed|
|APA6 Citation||Alex, B., Grover, C., Oberlander, J., Thomson, T., Anderson, M., Loxley, J., …Zhou, K. (2017). Palimpsest: improving assisted curation of loco-specific literature. Digital Scholarship in the Humanities, 32(Supp 1), doi:10.1093/llc/fqw050|
|Copyright Statement||Copyright information regarding this work can be found at the following address: http://eprints.nottingh.../end_user_agreement.pdf|
You might also like
Meta-evaluation of online and offline web search evaluation metrics
Does document relevance affect the searcher's perception 0f time?
Predicting pre-click quality for native advertisements
Incorporating non-sequential behavior into click models
A comparative analysis of interleaving methods for aggregated search