登录 | 注册 | 充值 | 退出 | 公司首页 | 繁体中文 | 满意度调查
综合馆
基于语义的文本特征加权分类算法
  • 摘要

    文本分类存在维数灾难、数据集噪声及特征词对分类贡献不同等问题,影响文本分类精度.为提高文本分类精度,在数据处理方面提出一种新方法.该方法首先对数据集进行去噪处理,结合特征提取算法和语义分析方法对数据实现降维,再利用词语语义相关度对文本特征向量中每个特征词赋予不同权重;并利用经过上述处理的文本数据学习分类器.实验结果表明,该文本处理方法能够有效提高文本分类精度.

  • 作者

    张国栋  张化祥  ZHANG Guo-dong  ZHANG Hua-xiang 

  • 作者单位

    山东师范大学信息科学与工程学院,济南250014;山东省分布式计算机软件新技术重点实验室,济南250014

  • 刊期

    2012年12期 ISTIC PKU

  • 关键词

    语义分析  降维  语义相关度  分类 

参考文献
  • [1] 王建会,王洪伟,申展,胡运发. 一种实用高效的文本分类算法. 计算机研究与发展, 2005,1
  • [2] Aseervatham, S;Bennani, Y. Semi-structured document categorization with a semantic kernel. Pattern Recognition: The Journal of the Pattern Recognition Society, 2009,9
  • [3] Cai D.;He X.;Han J.. Document clustering using locality preserving indexing. IEEE Transactions on Knowledge and Data Engineering, 2005,12
  • [4] Guo, Y;Shao, ZQ;Hua, N. Automatic text categorization based on content analysis with cognitive situation models. Information Sciences: An International Journal, 2010,5
  • [5] ZAKARIA E;ABDELATTIF R;MOHAMED A. Using WordNet for text categorization. The International Arab Journal of Information Technology, 2008,01
  • [6] WANG Zi-qiang;XU Qian. Text categorization based on LDA and SVM. Washington,DC:IEEE Computer Society, 2008
  • [7] LI Yan-jun;HSU D F;CHUNG S N. Combining multiple feature selection methods for text categorization by using rank-score characteristics. Washington,DC:IEEE Computer Society, 2009
  • [8] ZHANG Yun-liang;ZHU Li-jun;QIAO Xiao-dong. Flexible KNN algorithm for text categorization by authorship based on features of lingual conceptual expression. Washington,DC:IEEE Computer Society, 2009
  • [9] KAZAMA J;TSUJII J. Maximum entropy models with inequality constraints:a case study on text categorization. Machine Learning, 2005,1-3
  • [10] YAHIA M E. Arabic text categorization based on rough set classification. IEEE Press, 2011
  • [11] WEI C P;LIN Yen-ting;YANG C C. Cross-lingual text categorization:conquering language boundaries in globalized environments. Information Processing & Management, 2011,05
  • [12] FERNANDO F;KSENIYA Z;WOLF-GANG M. Text categorization methods for automatic estimation of verbal intelligence. Expert System s with Applications, 2012,10
  • [13] MANNE S;KOTHA S K;FATIMA S S. Text categorization with Knearest neighbor approach. Beilin:Springer-Verlag, 2012
查看更多︾
相似文献 查看更多>>
34.207.82.217