包含词性的朴素Bayes分类在文本分类中的应用
【摘要】 由于国内网络时代的迅速发展,我们可以在网络上获得大量的数据。为了更好的对数据进行处理,我们可以依据词性对文本进行分类处理。本篇论文的研究重点是分析了朴素贝叶斯算法对基于词性的文本分类的应用,然后对使用了朴素贝叶斯的文本分类模型进行了论述,在此基础上通过搭建给予python的文本分类平台,进行了实验准备和数据分类,经过实验数据表明本论文提出朴素Bayes分类算法具有较好的分类效率和性能表现。
【关键词】 朴素贝叶斯算法,文本分类,中文分词,算法设计
――
【Abstract】 With the rapid development of the Internet age, people can get a lot of data through the Internet. In order to improve this situation, text can be classified. This paper studies the application of naive Bayes algorithm in text classification, gives the basic principle of text classification technology and naive Bayes algorithm, and discusses the text classification model of naive Bayes. Based on this, a text classification platform for python was built, and experimental preparation and data classification were carried out. The experimental data showed that the naive Bayes classification algorithm proposed in this paper has good classification efficiency and performance.
【Key Words】 Naive Bayes algorithm, text classification, Chinese word segmentation, algorithm design
目 录













