首页| JavaScript| HTML/CSS| Matlab| PHP| Python| Java| C/C++/VC++| C#| ASP| 其他|
购买积分 购买会员 激活码充值

您现在的位置是:虫虫源码 > 其他 > wikivandalismdata

wikivandalismdata

  • 资源大小:1.47 kB
  • 上传时间:2021-06-30
  • 下载次数:0次
  • 浏览次数:1次
  • 资源积分:1积分
  • 标      签:

资 源 简 介

Wikipedia is a popular and influential collaborative information system. Vandalism-- malicious editing to deliberately compromise the integrity of the content of articles, has been a major problem for Wikipedia. Extensive manual efforts are being made to combat vandalism but only a few automatic countermeasures are available. Research on Wikipedia vandalism is still in its infancy. The project built statistical language models, constructing distributions of words from the revision history of Wikipedia articles. As vandalism often involves the use of unexpected words to draw attention, the fitness (or lack thereof) of a new edit when compared with language models built from previous versions may well indicate that an edit is a vandalism instance. To minimize the manual effort to identify vandalism instances on Wikipedia, we trained an active learning model, requiring the annotation on only the top 50 results and learned from those. The experimental results demonstrated the ef

相 关 资 源

您 可 能 感 兴 趣 的

同 类 别 推 荐

VIP VIP