python documents in chinese_Chinese Literature Clustering Research Based on Python K-means Algorithm|电子爱好者

admin管理员组
文章数量:1630185

Chinese Literature Clustering Research Based on Python K-means Algorithm

ZHAO Qian-yi;Guizhou University of Finance and Economics School of Information;

Clustering is an important means of effective organization, summarization and navigation of text information. The K-means algorithm is a very typical distance-based clustering algorithm. It is used for Chinese document clustering. According to the content similarity, a group of documents is divided into several categories and the invisible knowledge is found. In this paper, the K-means algorithm based on Python language is used to summarize the Chinese literature clustering process. The initial cluster cluster number of K-means algorithm is selected by three evaluation indexes: CH index, contour coefficient index and SSE index. The range of optimal k-values is then clustered according to keywords and based on abstracts, and the clustering results are compared and analyzed, so that the clustering of Chinese documents based on abstracts can get better results. In conclusion, the literature in the same category can be clustered by keywords to further explore the invisible knowledge.

CAJViewer7.0 supports all the CNKI file formats; AdobeReader only supports the PDF format.

本文标签： Literature Clustering chineseChinese Python Documents

版权声明：本文标题：python documents in chinese_Chinese Literature Clustering Research Based on Python K-means Algorithm 内容由热心网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：https://m.elefans.com/dongtai/1729056431a1184020.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

更多相关文章

xp系统

电子爱好者 - 最新技术资讯及电子产品介绍！

python documents in chinese_Chinese Literature Clustering Research Based on Python K-means Algorithm

更多相关文章

15th ZJP Acm- Pro L. Doki Doki Literature Club

Literature Books

chapter 4: A literature review(re-read papers to gain fresh understanding)

A Review of the Healthcare-Management (Modeling) Literature

Literature Review文献综述常用句型 - EssayMin

Literature Review高分模板句, 毕业论文Distinction必备!

Literature and Life(文学与人生)

To be a Literature and Art Programmer

Doki Doki Literature Club（sort 函数对结构体函数的排序、结构体字符串之间的比较）

The Prevention of Literature

[Literature]比一千个太阳还亮

每日一词|网络文学 online literature

探索安全图学习的新边界：Graph Adversarial Learning Literature

Literature-humor.txt

python把PDF转换成图片

利用python将PDF转为PPT(课件专用)

python pdf转图片 poppler_Python将PDF转成图片—PyMuPDF和pdf2image

Python PDF转image方法小结

新书上市 | Python办公自动化（好友新书，值得一看，文末有福利）

Ubuntu学习（二）搭建系统 与 python、vscode 相关环境搭建

发表评论

推荐文章

Windows循环检测，直到网络通断后执行指定命令

mariadb-libs 被 mysql-community-libs取代

学术写作中Introduction，Literature Review和Conclusion &amp; Future Research的写法

SQL Server 2022从入门到精通

视频转动漫软件有哪些？小编亲测6款工具，1秒穿越漫画场景

热门文章

2022最全Hbuilder打包成苹果IOS-App的详解

dcloud如何苹果ios系统真机测试-HBuilderX真机运行ios测试

Shutdown complete (mysqld 8.0.22) MySQL Community Server - GPL.

阿里云ESC安装Mysql报错：未找到匹配的参数： mysql-community-server 错误：没有任何匹配: mysql-community-server

浏览器帧率(fps)对比：QQ,Firefox,Chrome,Edge

检查安装包(grid infrastructure和Oracle database所需补丁)

windows系统下安装JDK8

15th ZJP Acm - Pro L. Doki Doki Literature Club

鸿蒙系统比安卓快,华为自研的鸿蒙系统比安卓快60%

实测：华为鸿蒙系统比 Android 系统快 60%！

最新文章

桌面显示电脑配置的PE_你还用软件看电脑配置？分享三种无需软件查看配置的方法...

汉字录入计算机是什么时候,电脑汉字录入快速通

计算机专业能报税务师,税务师机考模式下 你会遇到哪些技术层面的难题

九种常用输入法特殊符号功能大揭密

表形码 输入法!

cmd命令怎么查看电脑配置？

国外BT下载网站

输入法卸载的问题解决

税务系统什么时候使用计算机,2020年税务师考试题量、答题要求及计算器使用规定...

学计算机用什么输入语法最好,怎么才能有效的学好电脑打字

怎么查看电脑配置|win7查看电脑配置教程

职高计算机应用基础试题,中职职高计算机应用基础考试试题doc

台式计算机打字标准手法,电脑打字技巧口诀

学计算机打字重不重要,怎么才能有效的学好电脑打字

cpa用计算机考,cpa是机考还是笔试？考试方式大揭秘！

小米手机肿么还原时钟

15000流明是多少瓦

一般普通投影机功率多大?

苹果绿联转换器有些投影机不能用

坚果V9投影机具体参数?

有关九年级作文850字精选

80后90后_高一作文

中级卫生专业资格中医全科学主治医师中级模拟题2021年(9)案与解析

(精品)师范大学招考硕士研究生课程八六0试卷

ZXMVC8900(V3

【模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313】模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313 官方免费下载

【生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD】生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD 官方免费下载

【模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311】模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311 官方免费下载

【模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311】模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311 官方免费下载

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改 官方免费下载

如何实现高效的treenode搜索算法

treenode与链表有何本质区别

在哪些场景下应优先考虑使用treenode

treenode在树形结构中的角色是什么

如何通过treenode实现二叉树

Ubuntu学习（二）搭建系统与 python、vscode 相关环境搭建

学术写作中Introduction，Literature Review和Conclusion & Future Research的写法

计算机专业能报税务师,税务师机考模式下你会遇到哪些技术层面的难题

表形码输入法!

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改官方免费下载