首页 > 最新目录 > 正文

21.基于语义分析的微博热点话题发现技术研究*

日期:2013-09-15 11:00:00 点击:

.

 柏建普,田芳

(内蒙古科技大学信息与工程学院,内蒙古包头014010)

关键词:语义分析微博;热点;话题发现

中图分类号:TP391 文献标识码:A

摘要:近年来,微博热点话题发现已经成为当前网络舆情分析研究的热点. 本文针对微博信息的碎片化、口语化等短文本特点,为解决向量空间模型(VSM)文本表示方法存在高维度、稀疏,及同义多义等问题,采用潜在语义分析法对微博信息进行建模,再通过贝叶斯分类算法实现话题发现.并采用J2EE 开发包及Eclipse 集成开发环境,结合HibernateLucene 等技术实现了微博热点话题发现系统,实验表明这种方法是有效的.

esearch of micro-blogs hot topic detection technology based on semantic analysis

BAI Jian-puTIAN Fang

(Information Science and Engineering SchoolInner Mongolia University of Science and TechnologyBaotou 014010china)

Key words:semantic analysis micro blogs;hot topics;topic detection

Abstract:The hot topics of micro-blog detecting has become the current research focuses of Internet public opinion information In order to solve the existing problems of high-dimensionsparsesynonymy and polysemy from the Vector Space Model (VSM) text presentationthe micro-blog information model was developed using LSA for the short texts of the fragmentcolloquial micro blog informationthen the topic detection was achieved through the Bayesian classification algorithm Furthermorethe micro blog topic detecting system was constructed by adopting software developers kit J2EEthe integrated development environment Eclipse and techniques such as Hibernate

and Luceneand the operation of the system was proved to be effective

地址:内蒙古包头市昆都仑区阿尔丁大街7号 邮编:014010 电话:0472-5951610或0472-5953910 Email:cky@imust.edu.cn nkdxb@imust.edu.cn

版权所有:内蒙古科技大学学报编辑部(©2013)