1. 复旦大学计算机科学技术学院,上海 201203
2. 上海市数据科学重点实验室,上海 201203
3. 西北师范大学计算机科学与工程学院,甘肃 兰州 730070
4. 上海市第六人民医院,上海 200233
5. 上海市普陀区疾病预防控制中心,上海 200333
[ "王晓霞(1978-),女,博士,复旦大学计算机科学技术学院博士生,主要研究方向为数据科学、大数据。" ]
[ "蒋伏松(1974-),男,博士,上海市第六人民医院内分泌代谢科副主任医师,主要研究方向为糖尿病并发症的防治。" ]
[ "王宇(1982-),男,上海市普陀区疾病预防控制中心主管医师、慢性病科科长,主要从事慢性病的预防控制工作。" ]
[ "熊贇(1980-),女,博士,复旦大学计算机科学技术学院教授。2004年起从事数据领域方面的研究工作,作为项目负责人主持国家自然科学基金、上海市科学技术委员会发展基金以及企业合作项目。相关研究成果在本领域国际权威期刊或会议发表论文40余篇,出版著作3本。目前主要研究方向为数据科学、大数据。" ]
网络首发:2018-05,
纸质出版:2018-05-15
移动端阅览
王晓霞, 蒋伏松, 王宇, 等. 基于ICD-10诊断编码的慢性病并发症聚类算法[J]. 大数据, 2018,4(3):2018028.
Xiaoxia WANG, Fusong JIANG, Yu WANG, et al. Chronic disease complications clustering based on ICD-10 diagnoses code[J]. Big Data Research, 2018, 4(3): 2018028.
王晓霞, 蒋伏松, 王宇, 等. 基于ICD-10诊断编码的慢性病并发症聚类算法[J]. 大数据, 2018,4(3):2018028. DOI: 10.11959/j.issn.2096-0271.2018028.
Xiaoxia WANG, Fusong JIANG, Yu WANG, et al. Chronic disease complications clustering based on ICD-10 diagnoses code[J]. Big Data Research, 2018, 4(3): 2018028. DOI: 10.11959/j.issn.2096-0271.2018028.
慢性病与其相关并发症关系的研究,对患者以及医学研究都有重要意义。电子病历中记录的患者就诊数据为研究目标慢性病与其并发症的关系提供了数据基础,其中面临的挑战之一在于既需要使用临床医生的领域知识对并发症进行标注,又不希望给医生增加过多负担。设计了一种采用分组策略的基于ICD-10诊断编码的慢性病并发症半监督聚类方法,以实现在较少的医生专家参与下对慢性病并发症归类。真实糖尿病患者电子医疗记录数据集上的实验结果表明提出的算法是实用且有效的。
Study on the relationship between the chronic disease and the corresponding complications has great theoretical significance and applicable value for patients and clinical medicine.In order to utilize healthcare electronic record more reasonably
preprocessing was needed according to prior medical knowledge for chronic disease complication.The challenge of this work is that medical knowledge should be exploited to label the corresponding complications.To meet these challenges and assist physicians in labeling complications of a target chronic disease
a semi-supervised chronic disease complications clustering algorithm based on ICD-10 code for diagnoses was proposed.Experiments on a real dataset of diabetes electronic healthcare record show that the algorithms are practical and effective.
WANG X , SONTAG D , WANG F . Unsupervised learning of disease progression models [C ] // The 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,August 24-27,2014,New York,USA . New York:ACM Press , 2014 : 85 - 94 .
JOHNSON L W , WEINSTOCK R S . Medical complications of diabetes mellitus [M ] . Hoboken : John Wiley & Sons,IncPress , 2014 .
CHAPELLE O , SCHÖLKOPF B , ZIEN A . Semi-supervised learning [M ] . Cambridge : MIT PressPress , 2006 .
李昆仑 , 曹铮 , 曹丽苹 , 等 . 半监督聚类的若干新进展 [J ] . 模式识别与人工智能 , 2009 , 22 ( 5 ): 735 - 742 .
LI K L , CAO Z , CAO L P , et al . Some developments on semi-supervised clustering [J ] . Pattern Recognition And Artificial Intelligence , 2009 , 22 ( 5 ): 735 - 742 .
ZHU X J , GOLDBERG A B . Introduction to semi-supervised learning [J ] . Synthesis Lectures on Artificial Intelligence and Machine Learning , 2009 , 3 ( 1 ):130.
HUYEN D T T , LE H S , TUAN T M , et al . Semi-supervised fuzzy co-clustering for hospital-cost analysis from electronic medical records [C ] // The 8th IEEE International Conference on Knowledge and Systems Engineering,October 6-8,2016,Hanoi,Vietnam . Piscataway:IEEE Press , 2016 : 25 - 30 .
PHUONG N D , CHAU V T N , BAO H T . A hybrid semi-supervised learning approach to identifying protected health information in electronic medical records [C ] // The 2016 ACM International Conference on Ubiquitous Information Management and Communication,January 4-6,2016,Danang,Vietnam . New York:ACM Press , 2016 :80.
KHAN F M , KULIKOWSKI C A . Survival analysis via transduction for semi-supervised neural networks in medical prognosis [C ] // The 2015 IEEE International Conference on Bioinformatics and Biomedicine,November 9-12,2015,Washington,DC,USA . Piscataway:IEEE Press , 2015 : 433 - 437 .
BASU S , BANERJEE A , MOONEY R J . Semi-supervised clustering by seeding [C ] // The 19th International Conference on Machine Learning,July 8-12,2002,Sydney,Australia . New York:ACM Press , 2002 : 27 - 34 .
YOUNG B A , LIN E , KORFF M V , et al . Diabetes complications severity index and risk of mortality,hospitalization,and healthcare utilization [J ] . American Journal of Managed Care , 2008 , 14 ( 1 ): 15 - 23 .
GLASHEEN W P , RENDA A , DONG Y T . Diabetes complications severity index (DCSI)-update and ICD-10 translation [J ] . Diabetes Complications , 2017 , 31 ( 6 ): 1007 - 1013 .
International Diabetes Federation [R ] . IDF Diabetes Atlas 8th Edition , 2017 .
0
浏览量
895
下载量
0
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621