1. 国防科技大学计算机学院,湖南 长沙 410073
2. 绿色计算产业联盟,北京 100036
3. 湖南智擎科技有限公司,湖南 长沙 410073
[ "张洋(1991- ),男,博士,国防科技大学计算机学院助理研究员,中国计算机学会会员,主要研究方向为实证软工程、软件版本库挖掘、DevOps等" ]
[ "王涛(1986- ),男,博士,国防科技大学计算机学院副研究员,中国计算机学会会员,主要研究方向为分布式计算、软件工程、数据挖掘等" ]
[ "尹刚(1975- ),男,博士,绿色计算产业联盟实践教学工作委员会副主任,中国计算机学会会员,主要研究方向为在线教育、分布式计算、软件工程、数据挖掘、云计算等" ]
[ "余跃(1988- ),男,博士,国防科技大学计算机学院副研究员,中国计算机学会会员,主要研究方向为数据挖掘、实证软件工程、社交化编码等" ]
[ "黄井泉(1986- ),男,湖南智擎科技有限公司高级工程师,主要研究方向为在线教育、软件工程、数据挖掘等" ]
网络首发:2021-01,
纸质出版:2021-01-15
移动端阅览
张洋, 王涛, 尹刚, 等. 面向智能化软件开发的开源生态大数据[J]. 大数据, 2021,7(1):2021007-1.
Yang ZHANG, Tao WANG, Gang YIN, et al. Big data of open source ecosystem for intelligent software development[J]. Big Data Research, 2021, 7(1): 2021007-1.
张洋, 王涛, 尹刚, 等. 面向智能化软件开发的开源生态大数据[J]. 大数据, 2021,7(1):2021007-1. DOI: 10.11959/j.issn.2096-0271.2021007.
Yang ZHANG, Tao WANG, Gang YIN, et al. Big data of open source ecosystem for intelligent software development[J]. Big Data Research, 2021, 7(1): 2021007-1. DOI: 10.11959/j.issn.2096-0271.2021007.
开源软件开发过程中包含大量有价值的数据,针对其数据规模巨大、碎片分散、快速膨胀的特点,研究了软件工程开源生态大数据体系,提出了一种自生长的采集处理框架与汇聚共享环境,阐述了基于软件工程开源生态大数据的智能化软件开发,以及基于软件工程开源生态大数据分析挖掘的典型应用,为面向智能化软件开发的开源生态大数据研究与应用提供相关指导。
The open source software development process contains a lot of valuable data
which is huge in scale
fragmented
and rapidly expanding. Aimming to the characteristics
the big data structure of open source ecosystem of software engineering was studied
and a self-growing collection and processing framework and a convergence and sharing environment was proposed. The related research on the development of intelligent software based on open source big data of software engineering
and typical applications based on analysis and mining of open source big data of software engineering were expounded
and relevant guidance for the research and application of big data of open source ecosystem for intelligent software development was provided.
Black Duck Software, Inc . 2017 open source 360 degree survey [R ] . 2017 .
GOLDEN B . Succeeding with open source [M ] . New Jersey : Addison-Wesley Professional , 2005 .
WANG L , SUN X B , WANG J W , et al . Construct bug knowledge graph for bug resolution [C ] // The 2017 IEEE/ACM 39th International Conference on Software Engineering Companion . Piscataway:IEEE Press , 2017 : 189 - 191 .
HAN Z B , LI X H , LIU H T , et al . DeepWeak: reasoning common software weaknesses via knowledge graph embedding [C ] // The 2018 IEEE 25th International Conference on Software Analysis, Evolution and Reengineering . Piscataway: IEEE Press , 2018 : 456 - 466 .
ZHAO X J , XING Z C , KABIR M A , et al . HDSKG: harvesting domain specific knowledge graph from content of webpages [C ] // The 2017 IEEE 24th International Conference on Software Analysis, Evolution and Reengineering . Piscataway: IEEE Press , 2017 : 56 - 67 .
WANG M , ZOU Z C , CAO Y C , et al . Searching software knowledge graph with question [C ] // International Conference on Software and Systems Reuse . Cham:Springer , 2019 : 115 - 131 .
凌春阳 , 邹艳珍 , 林泽琦 , 等 . 基于图嵌入的软件项目源代码检索方法 [J ] . 软件学报 , 2019 , 30 ( 5 ): 1481 - 1497 .
LING C Y , ZOU Y Z , LIN Z Q , et al . Approach to searching software source code with graph embedding [J ] . Journal of Software , 2019 , 30 ( 5 ): 1481 - 1497 .
WANG Y J , YAO Y , TONG H H , et al . Bug localization via supervised topic modeling [C ] // The 18th IEEE International Conference on Data Mining . [S.l.:s.n.] , 2018 : 607 - 616 .
TANG S J , YAO Y , ZHANG S W , et al . An integral tag recommendation model for textual content [C ] // The 33rd AAAI Conference on Artificial Intelligence . [S.l.:s.n.] , 2019 : 5109 - 5116 .
LUKINS S K , KRAFT N A , ETZKORN L H . Bug localization using latent dirichlet allocation [J ] . Information and Software Technology , 2010 , 52 ( 9 ): 972 - 990 .
XUAN J F , MONPERRUS M . Learning to combine multiple ranking metrics for fault localization [C ] // The 2014 IEEE International Conference on Software Maintenance and Evolution . Piscataway:IEEE Press , 2014 : 191 - 200 .
WESTON J , CHOPRA S , ADAMS K . #TagSpace: semantic embeddings from hashtags [C ] // The 2014 Conference on Empirical Methods in Natural Language Processing . [S.l.:s.n.] , 2014 : 1822 - 1827 .
XU M , JIN R , ZHOU Z H . Speedup matrix completion with side information:application to multi-label learning [C ] // The Advances in Neural Information Processing Systems . [S.l.:s.n.] , 2013 : 2301 - 2309 .
SHAO B , YAN J F . Recommending answerers for stack overflow with LDA model [C ] // The 12th Chinese Conference on Computer Supported Cooperative Work and Social Computing . New York: ACM Press , 2017 : 80 - 86 .
YAO Y , TONG H H , XIE T , et al . Detecting high-quality posts in community question answering sites [J ] . Information Sciences , 2015 , 302 ( C ): 70 - 82 .
AHASANUZZAMAN M , ASADUZZAMAN M ROY C K , et al . Mining duplicate questions of stack overflow [C ] // The 2016 IEEE/ACM 13th Working Conference on Mining Software Repositories . Piscataway: IEEE Press , 2016 : 402 - 412 .
ARWAN A , ROCHIMAH S AKBAR R J . Source code retrieval on Stack Overflow using LDA [C ] // The 2015 3rd International Conference on Information and Communication Technology . Piscataway:IEEE Press , 2015 : 295 - 299 .
YIN K , ZHOU J H CHEN W , et al . D-Tagger:a tag recommendation approach for docker repositories [C ] // he 10th Asia-Pacific Symposium on Internetware . New York:ACM Press , 2018 : 1 - 10 .
CHEN W , ZHOU J H , ZHU J X , et al . Semi-supervised learning based tag recommendation for Docker repositories [J ] . Journal of Computer Science and Technology , 2019 , 34 ( 5 ): 957 - 971 .
YIN G , WANG T WANG H M , et al . OSSEAN: mining crowd wisdom in open source communities [C ] // The 2015 IEEE Symposium on Service-Oriented System Engineering . Piscataway: IEEE Press , 2015 : 367 - 371 .
0
浏览量
724
下载量
0
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621