[ "兰海波(1979- ),男,中国气象局公共气象服务中心高级工程师,主要研究方向为大数据处理技术、自然语言处理技术、数据库技术和气象服务信息系统的关键技术及应用" ]
网络首发:2022-07,
纸质出版:2022-07-15
移动端阅览
兰海波. 混合型数据的邻域条件互信息熵属性约简算法[J]. 大数据, 2022,8(4):133-144.
Haibo LAN. Neighborhood conditional mutual information entropy attribute reduction algorithm for hybrid data[J]. Big data research, 2022, 8(4): 133-144.
兰海波. 混合型数据的邻域条件互信息熵属性约简算法[J]. 大数据, 2022,8(4):133-144. DOI: 10.11959/j.issn.2096-0271.2022066.
Haibo LAN. Neighborhood conditional mutual information entropy attribute reduction algorithm for hybrid data[J]. Big data research, 2022, 8(4): 133-144. DOI: 10.11959/j.issn.2096-0271.2022066.
属性约简是粗糙集理论的重要研究内容之一,其主要目的是消除信息系统中不相关的属性,降低数据维度并提高数据知识发现性能。然而,基于粗糙集的属性约简方法大多没有考虑属性之间的依赖性,使得最终的属性约简结果存在一定的冗余属性。对此,提出一种基于邻域条件互信息熵的属性约简算法。首先,在传统邻域熵的基础上,针对混合型数据,提出混合型邻域互信息熵模型和混合型邻域条件互信息熵模型;然后利用这两种熵模型进行混合型信息系统的属性依赖度评估和属性启发式搜索,并设计出一种属性约简算法;最后通过UCI数据集的实验分析,证明了提出的算法具有较高的属性约简性能。
Attribute reduction is an important research content of the rough set theory.Its main purpose is to eliminate irrelevant attributes in information systems
reduce data dimensions and improve data knowledge discovery performance.However
most of the attribute reduction methods based on a rough set do not consider the dependence between attributes
which makes the final attribute reduction result have some redundant attributes.An attribute reduction algorithm based on neighborhood conditional mutual information entropy was proposed.Firstly
based on the traditional neighborhood entropy
a hybrid neighborhood mutual information entropy model and a hybrid neighborhood conditional mutual information entropy model were proposed for hybrid data.Then
the two entropy models were used to evaluate the attribute dependence and attribute heuristic search of the hybrid information system
and an attribute reduction algorithm was designed.Finally
through the experimental analysis of UCI data sets
it was proved that the algorithm had higher attribute reduction performance.
YANG X , LI M M , FUJITA H , et al . Incremental rough reduction with stable attribute group [J ] . Information Sciences , 2022 , 589 : 283 - 299 .
SUN L , YIN T Y , DING W P , et al . Multilabel feature selection using MLReliefF and neighborhood mutual information for multilabel neighborhood decision systems [J ] . Information Sciences , 2020 , 537 : 401 - 424 .
周涛 , 陆惠玲 , 任海玲 , 等 . 基于粗糙集的属性约简算法综述 [J ] . 电子学报 , 2021 , 49 ( 7 ): 1439 - 1449 .
ZHOU T , LU H L , REN H L , et al . Survey on attribute reduction algorithm of rough set [J ] . Acta Electronica Sinica , 2021 , 49 ( 7 ): 1439 - 1449 .
HU Q H , YU D R , LIU J F , et al . Neighborhood rough set based heterogeneous feature subset selection [J ] . Information Sciences , 2008 , 178 ( 18 ): 3577 - 3594 .
PANG Q Q , ZHANG L . Semi-supervised neighborhood discrimination index for feature selection [J ] . Knowledge-Based Systems , 2020 , 204 : 106224 .
HU M , TSANG E C C , GUO Y T , et al . A novel approach to attribute reduction based on weighted neighborhood rough sets [J ] . Knowledge-Based Systems , 2021 , 220 : 106908 .
SHU W H , QIAN W B , XIE Y H . Incremental feature selection for dynamic hybrid data using neighborhood rough set [J ] . Knowledge-Based Systems , 2020 , 194 : 105516 .
盛魁 , 王伟 , 卞显福 , 等 . 混合数据的邻域区分度增量式属性约简算法 [J ] . 电子学报 , 2020 , 48 ( 4 ): 682 - 696 .
SHENG K , WANG W , BIAN X F , et al . Neighborhood discernibility degree incremental attribute reduction algorithm for mixed data [J ] . Acta Electronica Sinica , 2020 , 48 ( 4 ): 682 - 696 .
姚晟 , 李初宴 , 陈悦 . 基于非平衡数据下不完备混合型信息系统的属性约简 [J ] . 计算机应用研究 , 2021 , 38 ( 5 ): 1331 - 1335 .
YAO S , LI C Y , CHEN Y . Attribute reduction of incomplete hybrid information system based on unbalanced data [J ] . Application Research of Computers , 2021 , 38 ( 5 ): 1331 - 1335 .
WANG C Z , HUANG Y , DING W P , et al . Attribute reduction with fuzzy rough self-information measures [J ] . Information Sciences , 2021 , 549 : 68 - 86 .
YUAN Z , CHEN H M , LI T R , et al . Unsupervised attribute reduction for mixed data based on fuzzy rough sets [J ] . Information Sciences , 2021 , 572 : 67 - 87 .
栾雨雨 , 王锡淮 , 肖健梅 . 基于混沌离散粒子群的粗糙集属性约简算法 [J ] . 计算机仿真 , 2021 , 38 ( 7 ): 271 - 275 .
LUAN Y Y , WANG X H , XIAO J M . Rough set attribute reduction algorithm based on chaotic discrete particle swarm optimization [J ] . Computer Simulation , 2021 , 38 ( 7 ): 271 - 275 .
HU M , TSANG E C C , GUO Y T , et al . Attribute reduction based on overlap degree and k-nearest-neighbor rough sets in decision information systems [J ] . Information Sciences , 2022 , 584 : 301 - 324 .
桑彬彬 , 杨留中 , 陈红梅 , 等 . 优势关系粗糙集增量属性约简算法 [J ] . 计算机科学 , 2020 , 47 ( 8 ): 137 - 143 .
SANG B B , YANG L Z , CHEN H M , et al . Incremental attribute reduction algorithm in dominance-based rough set [J ] . Computer Science , 2020 , 47 ( 8 ): 137 - 143 .
熊菊霞 , 吴尽昭 , 王秋红 . 邻域互信息熵的混合型数据决策代价属性约简 [J ] . 小型微型计算机系统 , 2021 , 42 ( 8 ): 1584 - 1590 .
XIONG J X , WU J Z , WANG Q H . Decision cost attribute reduction of hybrid data based on neighborhood mutual information entropy [J ] . Journal of Chinese Computer Systems , 2021 , 42 ( 8 ): 1584 - 1590 .
陈帅 , 张贤勇 , 唐玲玉 , 等 . 邻域互补信息度量及其启发式属性约简 [J ] . 数据采集与处理 , 2020 , 35 ( 4 ): 630 - 641 .
CHEN S , ZHANG X Y , TANG L Y , et al . Neighborhood complementary information measures and heuristic attribute reduction [J ] . Journal of Data Acquisition and Processing , 2020 , 35 ( 4 ): 630 - 641 .
陈帅 . 基于三层粒结构的邻域互补信息度量及其属性约简 [D ] . 成都:四川师范大学 , 2020 .
CHEN S . Neighborhood complementary information measures and their attribute reductions based on three-layer granular structure [D ] . Chengdu:Sichuan Normal University , 2020 .
姚晟 , 徐风 , 吴照玉 , 等 . 基于邻域粗糙互信息熵的非单调性属性约简 [J ] . 控制与决策 , 2019 , 34 ( 2 ): 353 - 361 .
YAO S , XU F , WU Z Y , et al . Nonmonotonic attribute reduction based on neighborhood rough mutual information entropy [J ] . Control and Decision , 2019 , 34 ( 2 ): 353 - 361 .
SALEM O A M , LIU F , CHEN Y P P , et al . Feature selection and threshold method based on fuzzy joint mutual information [J ] . International Journal of Approximate Reasoning , 2021 , 132 : 107 - 126 .
HU Q H , ZHANG L , ZHANG D , et al . Measuring relevance between discrete and continuous features based on neighborhood mutual information [J ] . Expert Systems With Applications , 2011 , 38 ( 9 ): 10737 - 10750 .
0
浏览量
267
下载量
0
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621