1. 南京大学计算机科学与技术系,江苏 南京 210023
2. 计算机软件新技术国家重点实验室(南京大学),江苏 南京 210023
[ "席圣渠(1992- ),男,南京大学计算机科学与技术系博士生,主要研究方向为基于深度学习的软件智能开发方法" ]
[ "徐锋(1975- ),男,博士,南京大学计算机科学与技术系教授、博士生导师,主要研究方向为智能化软件可信术" ]
[ "陈鑫(1975- ),男,博士,南京大学计算机科学与技术系副教授,主要研究方向为嵌入式系统、软件建模与分、软件测试与验证" ]
[ "李宣东(1963- ),男,博士,南京大学计算机科学与技术系教授、博士生导师,软件学院院长,主要研究方向为软件建模与分析、软件测试与验证" ]
网络首发:2021-01,
纸质出版:2021-01-15
移动端阅览
席圣渠, 徐锋, 陈鑫, 等. 基于大数据的开源项目缺陷报告智能预检技术[J]. 大数据, 2021,7(1):2021004-1.
Shengqu XI, Feng XU, Xin CHEN, et al. Big-data based intelligent bug triage techniques for open-source projects[J]. Big Data Research, 2021, 7(1): 2021004-1.
席圣渠, 徐锋, 陈鑫, 等. 基于大数据的开源项目缺陷报告智能预检技术[J]. 大数据, 2021,7(1):2021004-1. DOI: 10.11959/j.issn.2096-0271.2021004.
Shengqu XI, Feng XU, Xin CHEN, et al. Big-data based intelligent bug triage techniques for open-source projects[J]. Big Data Research, 2021, 7(1): 2021004-1. DOI: 10.11959/j.issn.2096-0271.2021004.
缺陷报告预检目标在于确定优先级和修复措施,是保障软件可信的关键环节。然而,在日益普及的开源项目中,由于缺陷数量众多、缺乏组织管理等特性,人工预检难以及时完成,迫切需要基于大数据的自动化、智能化预检技术。结合工业界、学术界对缺陷报告预检的认知,提出了一种智能化缺陷报告预检技术框架,全面系统地归纳了缺陷报告预检中存在的3个关键任务:缺陷优先级分类、缺陷分派、缺陷再分派,并结合开源项目的特点提出了相关技术。实验结果初步验证了上述技术的合理性和有效性。
Bug triage aims to determine the priority and repair measures and is critical in ensuring software trustability. However
in the increasingly popular open-source projects
due to a large number of defects and lack of organization and management
it is challenging to triage all the bug reports by hand on time
making big-data based
automated and intelligent bug triage urgent. An intelligent bug triage technical framework based on industry and academia’s cognition was proposed
and three key tasks: bug priority classification
bug assignment
and bug reassignment
were identified comprehensively and systematically. Related technologies for the characteristics of open-source projects were proposed. The preliminary experiment results show the reasonableness and effectiveness of the above techniques.
XIE T , ZHANG L , XIAO X , et al . Cooperative software testing and analysis:advances and challenges [J ] . Journal of Computer Science & Technology , 2014 , 29 ( 4 ): 713 - 723 .
GUO P J , ZIMMERMANN T , NAGAPPAN N , et al . Characterizing and predicting which bugs get fixed: an empirical study of Microsoft Windows [C ] // The 32nd ACM/IEEE International Conference on Software Engineering . [S.l.:s.n.] , 2010 : 495 - 504 .
JEONG G , KIM S , ZIMMERMANN T , et al . Improving bug triage with bug tossing graphs [C ] // The 7th Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering . New York: ACM Press , 2009 : 111 - 120 .
ZOU W , LO D , CHEN Z , et al . How practitioners perceive automated bug report management techniques [J ] . IEEE Transactions on Software Engineering , 2020 , 46 : 836 - 862 .
ALENEZI M , BANITAAN S . Bug reports prioritization: which features and classifier to use [C ] // 2013 12th International Conference on Machine Learning and Applications . Piscataway:IEEE Press , 2013 : 112 - 116 .
SAH R K , KHURSHID S , PERRY D E . Understanding the triaging and fixing processes of long lived bugs [J ] . Information and Software Technology , 2015 , 65 ( C ).
PODGURSKI A , LEON D , FRANCIS P , et al . Automated support for classifying software failure reports [C ] // The 25th International Conference on Software Engineering . Piscataway: IEEE Press , 2003 : 465 - 475 .
KANWAL J , MAQBOOL O . Bug prioritization to facilitate bug report triage [J ] . Bug prioritization to facilitate bug report triage , 2012 , 27 ( 2 ): 397 - 412 .
TIAN Y , LO D , SUN C . Drone: predicting priority of reported bugs by multi-factor analysis [C ] // 2013 IEEE International Conference on Software Maintenance . Piscataway: IEEE Press , 2013 : 200 - 209 .
UMER Q , LIU H , SULTAN Y , et al . Emotion based automated priority prediction for bug reports [J ] . IEEE Access , 2018 , 6 : 35743 - 35752 .
UMER Q , LIU H , ILLAHI I . CNN-based automatic prioritization of bug reports [J ] . IEEE Transactions on Reliability , 2019 ( 99 ): 1 - 14 .
VALDIVIA-GARCIA H , SHIHAB E , et al . Characterizing and predicting blocking bugs in open source projects [C ] // The 11th Working Conference on Mining Software Repositories . New York: ACM Press , 2014 : 72 - 81 .
JIANG Y , LU P , SU X , et al . LTRWES: a new framework for security bug report detection [J ] . Information and Software Technology , 2020 , 124 : 106314 .
YANG X L , LO D , XIA X , et al . Highimpact bug report identification with imbalanced learning strategies [J ] . Journal of Computer Science and Technology , 2017 , 32 ( 1 ): 181 - 198 .
REN H , LI Y , CHEN L . An empirical study on critical blocking bugs [C ] // The 28th International Conference on Program Comprehension . New York: ACM Press , 2020 :
MURPHY G , CUBRANIC D , et al . Automatic bug triage using text categorization [C ] // The 16th International Conference on Software Engineering & Knowledge Engineering . [S.l.:s.n.] , 2004 : 1 - 6 .
ANVIK J , HIEW L , MURPHY G C , et al . Who should fix this bug [C ] // The 28th International Conference on Software Engineering . New York: ACM Press , 2006 : 361 - 370 .
TAMRAWI A , NGUYEN T T , ALKOFAHI J M , et al . Fuzzy set and cachebased approach for bug triaging [C ] // The 19th ACM SIGSOFT Symposium and the 13th European Conference on Foundations of Software Engineering . New York: ACM Press , 2011 : 365 - 375 .
YANG G , ZHANG T , LEE B . Towards semi-automatic bug triage and severity prediction based on topic model and multifeature of bug reports [C ] // IEEE Computer Software & Applications Conference . Piscataway: IEEE Press , 2014 .
XIA X , LO D , DING Y , et al . Improving automated bug triaging with specialized topic model [J ] . IEEE Transactions on Software Engineering , 2016 , 43 ( 3 ): 272 - 297 .
LEE S R , HEO M J , LEE C G , et al . Applying deep learning based automatic bug triager to industrial projects [C ] // The 2017 11th Joint Meeting on Foundations of Software Engineering . New York: ACM Press , 2017 : 926 - 931 .
MANI S , SANKARAN A , ARALIKATTE R . Deeptriage: exploring the effectiveness of deep learning for bug triaging [C ] // The ACM India Joint International Conference on Data Science and Management of Data . [S.l.:s.n.] , 2019 : 171 - 179 .
ALKHAZI B , DISTASI A , ALJEDAANI W , et al . Learning to rank developers for bug report assignment [J ] . Applied Soft Computing , 2020 , 95 : 106667 .
ZHANG T , CHEN J , JIANG H , et al . Bug report enrichment with application of automated fixer recommendation [C ] // IEEE/ACM International Conference on Program Comprehension . Piscataway:IEEE Press , 2017 .
SUN X B , ZHOU C , YANG H , et al . Developer recommendation for software security bugs [J ] . Journal of Software , 2018 , 29 ( 8 ): 2294 - 2305 .
YADAV A , SINGH S K , SURI J S , et al . Ranking of software developers based on expertise score for bug triaging [J ] . Information and Software Technology , 2019 , 112 : 1 - 17 .
GUO S , ZHANG X , YANG X , et al . Developer activity motivated bug triaging:via convolutional neural network [J ] . Neural Processing Letters , 2020 , 51 ( 3 ): 2589 - 2606 .
BHATTACHARYA P , NEAMTIU I , SHELTON C R . Automated, highlyaccurate, bug assignment using machine learning and tossing graphs [J ] . Journal of Systems and Software , 2012 , 85 ( 10 ): 2275 - 2292 .
WU H R , LIU H Y , MA Y T , et al . Empirical study on developer factors affecting tossing path length of bug reports [J ] . IET Software , 2018 , 12 ( 3 ): 258 - 270 .
XIA X , LO D , SHIHAB E , et al . Automatic, high accuracy prediction of reopened bugs [J ] . Automated Software Engineering , 2015 , 22 ( 1 ): 75 - 109 .
MI Q , KEUNG J , HUO Y , et al . Not all bug reopens are negative: a case study on eclipse bug reports [J ] . Information &Software Technology , 2018 , 99 : 93 - 97 .
BLEI D M , NG A Y , JORDAN M I , et al . Latent dirichlet allocation [J ] . Journal of Machine Learning Research , 2012 , 3 ( 4-5 ): 993 - 1022 .
ASUNCION A , WELLING M , SMYTH P , et al . On smoothing and inference for topic models [J ] . arXiv preprint , 2020, arXiv:1205.2662 .
DENTON E , GROSS S , FERGUS R , et al . Semisupervised learning with context-conditional generative adversarial networks [J ] . arXiv preprint , 2016, arXiv:1611.06430 .
SALIMANS T , GOODFELLOW I , ZAREMBA W , et al . Improved techniques for training gans [C ] // Advances in Neural Information Processing Systems . New York: ACM Presss , 2016 : 2234 - 2242 .
SAHA R K , KHURSHID S , PERRY D E , et al . An empirical study of long lived bugs [C ] // 2014 Software Evolution Week-IEEE Conference on Software Maintenance, Reengineering, and Reverse Engineering (CSMR-WCRE) . Piscataway:IEEE Press , 2014 : 144 - 153 .
JOHNSON R , ZHANG T . Supervised and semi-supervised text categorization using LSTM for region embeddings [J ] . arXiv preprint , 2016, arXiv:1602.02373 .
YANG Z , YANG D , DYER C , et al . Hierarchical attention networks for document classification [C ] // The 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . [S.l.:s.n.] , 2016 : 1480 - 1489 .
CHANG C C , LIN C J . LIBSVM: a library for support vector machines [J ] . ACM Transactions on Intelligent Systems and Technology , 2011 , 2 ( 3 ).
SUTSKEVER I , VINYALS O , LE Q V . Sequence to sequence learning with neural networks [C ] // Advances in Neural Information Processing Systems . New York: ACM Press , 2014 : 3104 - 3112 .
BAHDANAU D , CHO K , BENGIO Y , et al . Neural machine translation by jointly learning to align and translate [J ] . arXiv preprint , 2014, arXiv:1409.0473 .
0
浏览量
426
下载量
0
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621