论文部分内容阅读
本文介绍基于《中国档案分类法金融档案分类表》(以下简称《金融档案分类表》)的中文文本自动分类算法。提出了类别词概念,介绍了类别词库和分类规则词库建造法以及自动分类的三维加权算法等内容。经过对真实金融档案文本测试,自动分类正确率可达81%以上。
This article introduces an automatic Chinese text classification algorithm based on the “Chinese File Classification Financial File Classification Table” (hereinafter referred to as the “Financial File Classification Table”). Proposed the concept of category words, introduced thesaurus construction of category thesaurus and classification rules and three-dimensional weighted algorithm of automatic classification. After the real financial file text test, automatic classification accuracy rate of up to 81%.