Haixia Liu
Towards Idea Mining: Problem-Solution Phrase Extraction fromText
Liu, Haixia; Brailsford, Tim; Goulding, James; Maul, Tomas; Tan, Tao; Chaudhuri, Debanjan
Authors
Tim Brailsford
JAMES GOULDING JAMES.GOULDING@NOTTINGHAM.AC.UK
Associate Professor
Tomas Maul
Tao Tan
Debanjan Chaudhuri
Abstract
This paper investigates the feasibility of problem-solution phrases extraction from scientific publications using neural network approaches. Bidirectional Long Short-Term Memory with Conditional Random Fields (Bi-LSTM-CRFs) and Bidirectional Encoder Representations from Transformers (BERT) were evaluated on two datasets, one of which was created by University of Cambridge Computer Laboratory containing 1000 positive examples of problems and solutions (UCCL1000) with the corresponding phrases annotated. The F1-scores computed on the UCCL1000 dataset indicate that BERT is an effective approach to extract solution phrases (with an F1-score of 97%) and problem phrases (with an F1-score of 83%). To test the model’s robustness on a different corpus with a different annotation scheme, a dataset consisting of 488 problem-solution samples from the Conference on Neural Information Processing Systems (NIPS488) was collected and annotated by human readers. Both Bi-LSTM-CRFs and BERT performances were dramatically lower for NIPS488 in comparison with UCCL1000.
Citation
Liu, H., Brailsford, T., Goulding, J., Maul, T., Tan, T., & Chaudhuri, D. (2022). Towards Idea Mining: Problem-Solution Phrase Extraction fromText. In Advanced Data Mining and Applications 18th International Conference, ADMA 2022, Brisbane, QLD, Australia, November 28–30, 2022: Proceedings, Part II (3-14). https://doi.org/10.1007/978-3-031-22137-8_1
Conference Name | 18th International Conference Advanced Data Mining and Applications (ADMA 2022) |
---|---|
Start Date | Nov 28, 2022 |
End Date | Nov 30, 2022 |
Acceptance Date | Nov 10, 2022 |
Online Publication Date | Nov 24, 2022 |
Publication Date | Nov 24, 2022 |
Deposit Date | Jan 16, 2023 |
Publicly Available Date | Nov 25, 2023 |
Publisher | Springer |
Volume | 13726 LNAI |
Pages | 3-14 |
Series Title | Lecture notes in computer science |
Series ISSN | 1611-3349 |
Book Title | Advanced Data Mining and Applications 18th International Conference, ADMA 2022, Brisbane, QLD, Australia, November 28–30, 2022: Proceedings, Part II |
ISBN | 9783031221361 |
DOI | https://doi.org/10.1007/978-3-031-22137-8_1 |
Keywords | Text mining, Problem-solution extraction, NLP |
Public URL | https://nottingham-repository.worktribe.com/output/16217568 |
Publisher URL | https://link.springer.com/chapter/10.1007/978-3-031-22137-8_1 |
Files
18454
(536 Kb)
PDF
You might also like
The Bayesian Spatial Bradley–Terry model: Urban deprivation modelling in Tanzania
(2022)
Journal Article
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: digital-library-support@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search