论文部分内容阅读
随着商品的数量和规模的不断扩大,对数据管理的要求已不再局限于以往简单的管理模式,因此,电子商务商品归一化是商务网站面临的首要而又艰巨的任务。商品归一化即从多源异构、各个独立和复杂多样以及不同的电子商务数据中找出同一商品的实体。商品的归一化一旦实现,便会造福于千万用户。但由于在C2C模式下,商品信息缺乏统一的模式并且数据的质量很低,使得已有的商品归一化方法难以实现,下面介绍一种方法来实现商品的归一化进行初步探索,并通过已有的方法对其进行验证.从而证实该方法的可行性和有效性。
With the continuous expansion of the number and size of products, the requirements for data management are no longer confined to simple management models in the past. Therefore, the normalization of e-commerce products is the most important and arduous task for commercial websites. Product normalization is to find out the same commodity entity from heterogeneous heterogeneous, independent and complex diversified and different e-commerce data. Once the normalization of goods is achieved, it will benefit millions of users. However, due to the lack of a unified mode of product information and the low quality of the data in the C2C mode, the existing product normalization method is difficult to be implemented. The following describes a method to achieve the normalization of the product to be initially explored and adopted The method has been verified to prove the feasibility and effectiveness of the method.