Integrating outlier filtering in large margin training

来源 :Journal of Zhejiang University-Science C(Computers & Electro | 被引量 : 0次 | 上传用户:zjk8818
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Large margin classifiers such as support vector machines (SVM) have been applied successfully in various classification tasks.However,their performance may be significantly degraded in the presence of outliers.In this paper,we propose a robust SVM formulation which is shown to be less sensitive to outliers.The key idea is to employ an adaptively weighted hinge loss that explicitly incorporates outlier filtering in the SVM training,thus performing outlier filtering and classification simultaneously.The resulting robust SVM formulation is non-convex.We first relax it into a semi-definite programming which admits a global solution.To improve the efficiency,an iterative approach is developed.We have performed experiments using both synthetic and real-world data.Results show that the performance of the standard SVM degrades rapidly when more outliers are included,while the proposed robust SVM training is more stable in the presence of outliers. Large margin classifiers such as support vector machines (SVM) have been applied successfully in various classification tasks. However, their performance may be significantly degraded in the presence of outliers. In this paper, we propose a robust SVM formulation which is shown to be less sensitive to outliers. The key idea is to employ an adaptively weighted hinge loss that explicitly incorporates outlier filtering in the SVM training, thus performing outlier filtering and classification simultaneously. The first robust it formulation is non-convex. We first relax it into a semi -definite programming which admits a global solution. To improve the efficiency, an iterative approach is developed. We have performed experiments using both synthetic and real-world data. Results show that the performance of the standard SVM degrades rapidly when more outliers are included, while the proposed robust SVM training is more stable in the presence of outliers.
其他文献
传统的粗糙集理论只能对数据库中的离散属性进行处理,所以对存在连续属性的数据库必须进行离散化处理。连续属性离散化是机器学习和数据挖掘领域中的一个重要问题,对后继阶段
班主任如何加强自身建设,做需要帮助学生的转变工作,培养班级干部,怎样让特长生充分发挥特长。这些工作中,班主任要有先导、督导、辅导、引导这四导作用。
P2P网络是近年来计算机领域研究与关注的一个焦点,它在很多领域都得到了应用。然而,由于P2P网络具有分散化、自治性、动态性、自组织性、异构性等特点,使得P2P通信系统需要在
随着通信技术、嵌入式计算技术和传感器技术的飞速发展,无线传感器网络成为当今研究的热门领域,在军事国防、环境科学、医疗监控、家庭应用和各种商业领域有着广泛的应用前景
无线通信是不是由电导体连接的两个或多个点之间的信息传输,最常见的无线技术使用无线电。无线电波的距离可以很短。  无线网络连接(或Wi-Fi)是一个局域无线计算机网络技术,它
在北京奥林匹克转播有限公司(BOB)刚刚建成5.5万平方米奥运转播史上最大面积的国际广播中心(IBC),中央电视台也已搭建完成了奥运转播中心之际,中国电影电视技术学会于7月25日
Internet是一个由多个自治系统相互连接构成的网络,BGP协议就是一种应用在基于TCP/IP网络的多个自治系统间交换网络层可达性信息的路由协议,研究BGP协议对Internet的发展有着
目前,数字电视机顶盒已成为模拟电视向数字电视过渡的最好的“桥梁”。我国数字电视产业发展很快,市场对于数字电视的需求也在迅速发展,高清数字电视节目也陆续开始播放,因此各种
计算机网络在全球范围内迅速普及同时,也带来了很多安全方面的问题,近年来,僵尸网络作为一种新型的攻击方式正在互联网中迅速蔓延,给互联网的安全造成了严重的威胁以及巨大的
本文研究了图像分割的相关算法,包括基于肤色的图像分割、椭圆轮廓的快速检测、基于Snake的轮廓检测、基于PCA的边缘检测、基于像素几何特性的连续边缘抽取等,并将它们用于人脸