Genetic Algorithms for Auto-Clustering in KDD

来源 :Journal of Systems Engineering and Electronics | 被引量 : 0次 | 上传用户:sunyulong378
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
In solving the clustering problem in the context of knowledge discovery in databases (KDD), the traditional methods, for example, the K-means algorithm and its variants, usually require the users to provide the number of clusters in advance based on the pro-information. Unfortunately, the number of clusters in general is unknown to the users who are usually short of pro-information. Therefore, the clustering calculation becomes a tedious trial-and-error work, and the result is often not global optimal especially when the number of clusters is large. In this paper, a new dynamic clustering method based on genetic algorithms (GA) is proposed and applied for auto-clustering of data entities in large databases. The algorithm can automatically cluster the data according to their similarities and find the exact number of clusters. Experiment results indicate that the method is of global optimization by dynamically clustering logic. In solving the clustering problem in the context of knowledge discovery in databases (KDD), the traditional methods, for example, the K-means algorithm and its variants, usually require the users to provide the number of clusters in advance based on the pro- information, Unfortunately, the number of clusters in general is unknown to the users who are usually short of pro-information. Therefore, the clustering calculation becomes a tedious trial-and-error work, and the result is often not global optimal especially when the number of clusters is large. In this paper, a new dynamic clustering method based on genetic algorithms (GA) is proposed and applied for auto-clustering of data entities in large databases. The algorithm can automatically cluster the data according to their similarities and find the exact number of clusters. Experiment results indicating that the method is of global optimization by dynamically clustering logic.
其他文献
目的:通过分析早期脑梗死组织的SWI表现及测量静脉Phase值差(Δψ)来评估梗死组织的氧代谢水平,并分析静脉Δψ与血流灌注、临床NIHSS评分之间的关系。  材料与方法:收集符
毛主席指出:“自力更生为主,争取外援为辅,破除迷信,独立自主的干工业,干农业,干技术革命和文化革命,打倒奴隶思想,埋葬教条主义,认真学习外国的好经验,也一定研究外国的坏经
建筑电气智能应急照明系统设计工作的有效落实,一方面能够为火场疏散工作提供更好的照明引导,使逃生人群疏散速率得以显著提升,增强消防现场的可控性;另一方面,凭借智能系统
本文介绍了一种由 AT89C2 0 51单片机作主控制器而具有数据录入、显示及通讯等功能的智能手操器。运行证明该手操器具有功能强、小巧、灵活、工作稳定、性价比高的特点 ,有广
现在我国经济发展水平还在不断提高,同时随着社会的进步,我国城市化建设也如火如荼地开展着.在城市建设工作当中,建筑工程的建设可以说是一项非常重要的工作,其也可以给人们
美术作为中国传统文化的重要组成部分,以兼容并蓄的强大包容性,融入了人类的精神和文化特质,彰显出强大的生命力.中国山水画有着独特的艺术语言和审美品格,中国画家用笔墨形
近年来,由于马铃薯环腐病、黑胫病的蔓延和退化问题,严重地影响了产量的提高。为了防止马铃薯病烂问题,我们曾试验过切刀消毒、药剂浸种,在栽培管理上也采取过深耕、选用好
不久前,美国能源部所属的桑迪亚国家实验室研制出微型发动机。该发动机的主要活动部件是一个只有花粉颗粒大小的齿轮,人们只能借助显微镜观看它的旋转,其速度为每分钟35万转。科学
目的:探讨研究下蒂瓣法在巨乳缩小整形术中优缺点。方法:收集自2009年1月年至2018年12月在烧伤整形科住院并采用下蒂瓣法行手术病人的临床资料31例。资料内容包括患者姓名、性别、年龄、体重、体重指数、孕育史、专科查体以及手术记录等内容指标。就于术前症状改善,外形的满意度,瘢痕,以及乳头、乳晕的敏感性及总体满意度等方面进行电话或微信随访。根据外观的满意度、术后遗留的瘢痕大小及是否发生增生、出现瘙痒
学位
随着土地资源紧缺日渐严重,以及人们对住房需求的增加,使得高层式建筑得到了广泛的应用.为保障高层式建筑的质量,应提高对施工环节的重视.其中桩基施工是土木工程中不可或缺