1. 中国科学技术大学,安徽 合肥 230026
2. 中国科学院计算技术研究所,北京 100190
3. 中国科学院大学,北京 100049
[ "王秉睿(1994-),男,中国科学技术大学硕士生,主要研究方向为计算机体系结构、机器学习编程方法、人工智能。" ]
[ "兰慧盈(1990-),女,中国科学院计算技术研究所博士生,主要研究方向为计算机体系结构、领域专用编程语言和编译器、人工智能。" ]
[ "陈云霁(1983-),男,博士,中国科学院计算技术研究所研究员(正教授)、博士生导师,主要研究方向为计算机体系结构、人工智能。" ]
网络首发:2018-07,
纸质出版:2018-07-15
移动端阅览
王秉睿, 兰慧盈, 陈云霁. 深度学习编程框架[J]. 大数据, 2018,4(4):2018040.
Bingrui WANG, Huiying LAN, Yunji CHEN. Programming frameworks for deep learning algorithms[J]. Big Data Research, 2018, 4(4): 2018040.
王秉睿, 兰慧盈, 陈云霁. 深度学习编程框架[J]. 大数据, 2018,4(4):2018040. DOI: 10.11959/j.issn.2096-0271.2018040.
Bingrui WANG, Huiying LAN, Yunji CHEN. Programming frameworks for deep learning algorithms[J]. Big Data Research, 2018, 4(4): 2018040. DOI: 10.11959/j.issn.2096-0271.2018040.
近年来,深度学习算法日益流行,在各种领域的应用都取得了出色的效果,受到工业界和学术界的广泛关注。越来越多的研究者开始利用深度学习算法解决实际问题(如图像分类、图像识别、语音识别、自然语言处理等)。人们提出了各种各样的深度学习编程框架,便于研究者们开发新的深度学习算法。这些深度学习库的设计原则、抽象层次各有不同。对常见的深度学习编程框架进行了分类介绍,针对深度学习编程框架设计中的关键问题进行了分析,并且对未来深度学习编程框架的发展方向做了展望,为以后深度学习编程框架的设计提供了设计思路和方向。
In recent years
deep learning algorithms became increasingly pervasive.It has drew extensive attentions from both researchers and industries
as it achieves very promising results on many applications of various fields.More and more researchers began to use deep learning algorithms to solve practical problems (e.g.
image classification
image recognition
speech recognition
and natural language processing).Many deep learning frameworks and libraries were proposed so that researchers can develop new deep learning algorithms in a more convenient fashion.These frameworks and libraries were different in many aspects (e.g.
design principles and abstraction).Firstly
several pervasive deep learning frameworks were introduced
and then the critical issue of designing such frameworks was analyzed.At last
the future challenges of designing deep learning frameworks were discussed.The study provides ideas and directions for future design.
HE K , ZHANG X , REN S , et al . Deep residual learning for image recognition [C ] // 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR),June 26-July 1,2016,Las Vegas,USA . Piscataway:IEEE Press , 2016 : 770 - 778 .
ABADI M , BARHAM P , CHEN J , et al . TensorFlow:a system for large-scale machine learning [C ] // The 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI ’16),November 2-4,2016,Savannah,USA . Berkeley:USENIX Association , 2016 : 265 - 284 .
JOUPPI N P , YOUNG C , PATIL N , et al . In-datacenter performance analysis of a tensor processing unit [C ] // The 44th Annual International Symposium on Computer Architecture,June 24-28,2017,Toronto,Canada . New York:ACM Press , 2017 : 1 - 12 .
ZHANG M , ZHOU Z . ML-KNN :a lazy learning approach to multi-label learning [J ] . Pattern Recognition , 2007 , 40 ( 7 ): 2038 - 2048 .
YANG S L , LI Y S , HU X X , et al . Optimization study on k value of kmeans algorithm [J ] . Systems EngineeringTheory & Practice , 2006 , 26 ( 2 ): 97 - 101 .
MADIGAN D , YORK J . Bayesian graphical models for discrete data [J ] . International Statistical Review , 1995 , 63 ( 2 ): 215 - 232 .
OSUNA E , FREUND R , GIROSI F . Training svm:an application to face detection [R ] . 1997 .
LECUN Y , KAVUKCUOGLU K , FARABET C . Convolutional networks and applications in vision [C ] // International Symposium on Circuits and Systems,May 30-June 2,2010,Paris,France . Piscataway:IEEE Press , 2010 : 253 - 256 .
KRIZHEVSKY A , SUTSKEVER I , HINTON G E . ImageNet classification with deep convolutional neural networks [C ] // The 25th International Conference on Neural Information Processing Systems,December 3-6,2012,Lake Tahoe,Nevada.New York:Curran Associates Inc . , 2012 : 1 - 9 .
SIMONYAN K , ZISSERMAN A . Very deep convolutional networks for large-scale image recognition [J ] . Computer Science , 2014 :arXiv:1409.1556.
CHEN T , LI M , LI Y , et al . MXNet :a flexible and efficient machine learning library for heterogeneous distributed systems [J ] . Statistics , 2015 :arXiv:1512.01274.
TEAM T D , ALRFOU R , ALAIN G , et al . Theano:a Python framework for fast computation of mathematical expressions [J ] . Computer Science , 2016 :arXiv:1605.02688.
COLLOBERT R , KAVUKCUOGLU K , FARABET C . Torch7:a matlab-like environment for machine learning [C ] // The 25th Annual Conference on Neural Information Processing Systems,December 12-14,2011,Granada,Spain.[S.l.:s.n] . 2011 : 1 - 6 .
JIA Y , SHELHAMER E , DONAHUE J , et al . Caffe:convolutional architecture for fast feature embedding [J ] . Computer Science , 2014 :arXiv:1408.5093.
CHEN T , DU Z , SUN N , et al . DianNao:a small-footprint high-throughput accelerator for ubiquitous machinelearning [C ] // The 19th International Conference on Architectural Support for Programming Languages and Operating Systems,March 1-5,2014,Salt Lake City,USA . New York:ACM Press , 2014 : 269 - 284 .
CHEN Y , LUO T , LIU S , et al . DaDianNao:a machine-learning supercomputer [C ] // The 47th Annual IEEE/ACM International Symposium on Microarchitecture,December 13-17,2014,Cambridge,UK . Washington DC:IEEE Computer Society , 2014 : 609 - 622 .
DU Z , FASTHUBER R , CHEN T , et al . ShiDianNao [C ] // The 42nd Annual International Symposium on Computer Architecture,June 13-17,2015,Portland,USA.[S.l.:s.n] . 2015 : 92 - 104 .
LIU D , CHEN T , LIU S , et al . PuDianNao:a polyvalent machine learning accelerator [C ] // The 20th International Conference on Architectural Support for Programming Languages and Operating Systems,March 14-18,2015,Istanbul,Turkey . New York:ACM Press , 2015 : 369 - 381 .
ZHANG S , DU Z , ZHANG L , et al . Cambricon-X:an accelerator for sparse neural networks [C ] // The 49th Annual IEEE/ACM International Symposium on Microarchitecture,October 15-19,2016,Taipei,China . Piscataway:IEEE Press , 2016 : 1 - 12 .
LIU S , DU Z , TAO J , et al . Cambricon:an instruction set architecture for neural networks [C ] // The 43rd International Symposium on Computer Architecture,June 18-22,2016,Seoul,Korea . New York:ACM Press , 2016 : 393 - 405 .
WEI R , SCHWARTZ L , ADVE V . DLVM:a modern compiler infrastructure for deep learning systems [J ] . Computer Science , 2017 :arXiv:1711.03016.
CHEN T , MOREAU T , JIANG Z , et al . TVM:end-to-end optimization stack for deep learning [J ] . Computer Science , 2018 :arXiv:1802.04799v1.
0
浏览量
1009
下载量
0
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621