Yuli Liu
Detecting collusive spamming activities in community question answering
Liu, Yuli; Liu, Yiqun; Zhou, Ke; Zhang, Min; Ma, Shaoping
Abstract
Community Question Answering (CQA) portals provide rich sources of information on a variety of topics. However, the authenticity and quality of questions and answers (Q&As) has proven hard to control. In a troubling direction, the widespread growth of crowdsourcing websites has created a large-scale, potentially difficult-to-detect workforce to manipulate malicious contents in CQA. The crowd workers who join the same crowdsourcing task about promotion campaigns in CQA collusively manipulate deceptive Q&As for promoting a target (product or service). The collusive spamming group can fully control the sentiment of the target. How to utilize the structure and the attributes for detecting manipulated Q&As? How to detect the collusive group and leverage the group information for the detection task? To shed light on these research questions, we propose a unified framework to tackle the challenge of detecting collusive spamming activities of CQA. First, we interpret the questions and answers in CQA as two independent networks. Second, we detect collusive question groups and answer groups from these two networks respectively by measuring the similarity of the contents posted within a short duration. Third, using attributes (individual-level and group-level) and correlations (user-based and content-based), we proposed a combined factor graph model to detect deceptive Q&As simultaneously by combining two independent factor graphs. With a large-scale practical data set, we find that the proposed framework can detect deceptive contents at early stage, and outperforms a number of competitive baselines.
Citation
Liu, Y., Liu, Y., Zhou, K., Zhang, M., & Ma, S. (2017, April). Detecting collusive spamming activities in community question answering. Presented at 26th International Conference on World Wide Web, Perth, Australia
Presentation Conference Type | Edited Proceedings |
---|---|
Conference Name | 26th International Conference on World Wide Web |
Start Date | Apr 3, 2017 |
End Date | Apr 7, 2017 |
Acceptance Date | Dec 20, 2016 |
Publication Date | Apr 3, 2017 |
Deposit Date | Aug 22, 2017 |
Publicly Available Date | Aug 22, 2017 |
Peer Reviewed | Peer Reviewed |
Pages | 1073-1082 |
Book Title | Proceedings of the 26th International Conference on World Wide Web - WWW '17 |
ISBN | 9781450349130 |
DOI | https://doi.org/10.1145/3038912.3052594 |
Keywords | Community Question Answering; Crowdsourcing Manipulation; Spam Detection; Factor Graph |
Public URL | https://nottingham-repository.worktribe.com/output/854570 |
Publisher URL | https://doi.org/10.1145/3038912.3052594 |
Related Public URLs | http://www.www2017.com.au/ |
Contract Date | Aug 22, 2017 |
Files
WWW2017 (1).pdf
(1.7 Mb)
PDF
Copyright Statement
Copyright information regarding this work can be found at the following address: http://creativecommons.org/licenses/by/4.0
You might also like
Meta-evaluation of online and offline web search evaluation metrics
(2017)
Presentation / Conference Contribution
Does document relevance affect the searcher's perception of time?
(2017)
Presentation / Conference Contribution
Palimpsest: improving assisted curation of loco-specific literature
(2016)
Journal Article
Predicting pre-click quality for native advertisements
(2016)
Presentation / Conference Contribution
Incorporating non-sequential behavior into click models
(2015)
Presentation / Conference Contribution
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: discovery-access-systems@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search