深度学习编程框架

王秉睿; 兰慧盈; 陈云霁

doi:10.11959/j.issn.2096-0271.2018040

您当前的位置：

首页 >

文章列表页 >

深度学习编程框架

专题：大数据驱动的智能计算体系架构 | 更新时间：2024-06-03

- 深度学习编程框架
- Programming frameworks for deep learning algorithms
- 大数据 2018年4卷第4期页码：2018040
- 作者机构：
  
  1. 中国科学技术大学，安徽合肥 230026
  2. 中国科学院计算技术研究所，北京 100190
  3. 中国科学院大学，北京 100049
- 作者简介：
  
  [ "王秉睿（1994-），男，中国科学技术大学硕士生，主要研究方向为计算机体系结构、机器学习编程方法、人工智能。" ]
  [ "兰慧盈（1990-），女，中国科学院计算技术研究所博士生，主要研究方向为计算机体系结构、领域专用编程语言和编译器、人工智能。" ]
  [ "陈云霁（1983-），男，博士，中国科学院计算技术研究所研究员（正教授）、博士生导师，主要研究方向为计算机体系结构、人工智能。" ]
- 基金信息：
  
  国家重点研发计划基金资助项目;National Key Research and Development Program of China(2017YFB1003101);国家自然科学基金资助项目;The National Netural Science Foundation of China(61472396);国家自然科学基金资助项目;The National Netural Science Foundation of China(61432016);国家自然科学基金资助项目;The National Netural Science Foundation of China(61473275);国家自然科学基金资助项目;The National Netural Science Foundation of China(61522211);国家自然科学基金资助项目;The National Netural Science Foundation of China(61532016);国家自然科学基金资助项目;The National Netural Science Foundation of China(61521092);国家自然科学基金资助项目;The National Netural Science Foundation of China(61502446);国家自然科学基金资助项目;The National Netural Science Foundation of China(61672491);国家自然科学基金资助项目;The National Netural Science Foundation of China(61602441);国家自然科学基金资助项目;The National Netural Science Foundation of China(61602446);国家自然科学基金资助项目;The National Netural Science Foundation of China(61732002);国家自然科学基金资助项目;The National Netural Science Foundation of China(61702478);北京市科技计划基金资助项目;Beijing Science and Technology Projects(Z151100000915072);中国科学院STS计划；国家重点基础研究发展计划（“973”计划）基金资助项目;STS Project of Chinese Academy of Sciences,The National Key Basic Research Program of China(973 Program)(2015CB358800)
- DOI：10.11959/j.issn.2096-0271.2018040
  中图分类号： F830
- 网络首发：2018-07，
  
  纸质出版：2018-07-15
- 稿件说明：
移动端阅览
王秉睿, 兰慧盈, 陈云霁. 深度学习编程框架[J]. 大数据, 2018,4(4):2018040.

Bingrui WANG, Huiying LAN, Yunji CHEN. Programming frameworks for deep learning algorithms[J]. Big Data Research, 2018, 4(4): 2018040.
王秉睿, 兰慧盈, 陈云霁. 深度学习编程框架[J]. 大数据, 2018,4(4):2018040. DOI： 10.11959/j.issn.2096-0271.2018040.

Bingrui WANG, Huiying LAN, Yunji CHEN. Programming frameworks for deep learning algorithms[J]. Big Data Research, 2018, 4(4): 2018040. DOI： 10.11959/j.issn.2096-0271.2018040.

摘要

近年来，深度学习算法日益流行，在各种领域的应用都取得了出色的效果，受到工业界和学术界的广泛关注。越来越多的研究者开始利用深度学习算法解决实际问题（如图像分类、图像识别、语音识别、自然语言处理等）。人们提出了各种各样的深度学习编程框架，便于研究者们开发新的深度学习算法。这些深度学习库的设计原则、抽象层次各有不同。对常见的深度学习编程框架进行了分类介绍，针对深度学习编程框架设计中的关键问题进行了分析，并且对未来深度学习编程框架的发展方向做了展望，为以后深度学习编程框架的设计提供了设计思路和方向。

Abstract

In recent years

deep learning algorithms became increasingly pervasive.It has drew extensive attentions from both researchers and industries

as it achieves very promising results on many applications of various fields.More and more researchers began to use deep learning algorithms to solve practical problems (e.g.

image classification

image recognition

speech recognition

and natural language processing).Many deep learning frameworks and libraries were proposed so that researchers can develop new deep learning algorithms in a more convenient fashion.These frameworks and libraries were different in many aspects (e.g.

design principles and abstraction).Firstly

several pervasive deep learning frameworks were introduced

and then the critical issue of designing such frameworks was analyzed.At last

the future challenges of designing deep learning frameworks were discussed.The study provides ideas and directions for future design.

关键词

Keywords

references

HE K , ZHANG X , REN S , et al . Deep residual learning for image recognition [C ] // 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR),June 26-July 1,2016,Las Vegas,USA . Piscataway:IEEE Press , 2016 : 770 - 778 .

ABADI M , BARHAM P , CHEN J , et al . TensorFlow:a system for large-scale machine learning [C ] // The 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI ’16),November 2-4,2016,Savannah,USA . Berkeley:USENIX Association , 2016 : 265 - 284 .

JOUPPI N P , YOUNG C , PATIL N , et al . In-datacenter performance analysis of a tensor processing unit [C ] // The 44th Annual International Symposium on Computer Architecture,June 24-28,2017,Toronto,Canada . New York:ACM Press , 2017 : 1 - 12 .

ZHANG M , ZHOU Z . ML-KNN :a lazy learning approach to multi-label learning [J ] . Pattern Recognition , 2007 , 40 ( 7 ): 2038 - 2048 .

YANG S L , LI Y S , HU X X , et al . Optimization study on k value of kmeans algorithm [J ] . Systems EngineeringTheory ＆ Practice , 2006 , 26 ( 2 ): 97 - 101 .

MADIGAN D , YORK J . Bayesian graphical models for discrete data [J ] . International Statistical Review , 1995 , 63 ( 2 ): 215 - 232 .

OSUNA E , FREUND R , GIROSI F . Training svm:an application to face detection [R ] . 1997 .

LECUN Y , KAVUKCUOGLU K , FARABET C . Convolutional networks and applications in vision [C ] // International Symposium on Circuits and Systems,May 30-June 2,2010,Paris,France . Piscataway:IEEE Press , 2010 : 253 - 256 .

KRIZHEVSKY A , SUTSKEVER I , HINTON G E . ImageNet classification with deep convolutional neural networks [C ] // The 25th International Conference on Neural Information Processing Systems,December 3-6,2012,Lake Tahoe,Nevada.New York:Curran Associates Inc . , 2012 : 1 - 9 .

SIMONYAN K , ZISSERMAN A . Very deep convolutional networks for large-scale image recognition [J ] . Computer Science , 2014 :arXiv:1409.1556.

CHEN T , LI M , LI Y , et al . MXNet :a flexible and efficient machine learning library for heterogeneous distributed systems [J ] . Statistics , 2015 :arXiv:1512.01274.

TEAM T D , ALRFOU R , ALAIN G , et al . Theano:a Python framework for fast computation of mathematical expressions [J ] . Computer Science , 2016 :arXiv:1605.02688.

COLLOBERT R , KAVUKCUOGLU K , FARABET C . Torch7:a matlab-like environment for machine learning [C ] // The 25th Annual Conference on Neural Information Processing Systems,December 12-14,2011,Granada,Spain.[S.l.:s.n] . 2011 : 1 - 6 .

JIA Y , SHELHAMER E , DONAHUE J , et al . Caffe:convolutional architecture for fast feature embedding [J ] . Computer Science , 2014 :arXiv:1408.5093.

CHEN T , DU Z , SUN N , et al . DianNao:a small-footprint high-throughput accelerator for ubiquitous machinelearning [C ] // The 19th International Conference on Architectural Support for Programming Languages and Operating Systems,March 1-5,2014,Salt Lake City,USA . New York:ACM Press , 2014 : 269 - 284 .

CHEN Y , LUO T , LIU S , et al . DaDianNao:a machine-learning supercomputer [C ] // The 47th Annual IEEE/ACM International Symposium on Microarchitecture,December 13-17,2014,Cambridge,UK . Washington DC:IEEE Computer Society , 2014 : 609 - 622 .

DU Z , FASTHUBER R , CHEN T , et al . ShiDianNao [C ] // The 42nd Annual International Symposium on Computer Architecture,June 13-17,2015,Portland,USA.[S.l.:s.n] . 2015 : 92 - 104 .

LIU D , CHEN T , LIU S , et al . PuDianNao:a polyvalent machine learning accelerator [C ] // The 20th International Conference on Architectural Support for Programming Languages and Operating Systems,March 14-18,2015,Istanbul,Turkey . New York:ACM Press , 2015 : 369 - 381 .

ZHANG S , DU Z , ZHANG L , et al . Cambricon-X:an accelerator for sparse neural networks [C ] // The 49th Annual IEEE/ACM International Symposium on Microarchitecture,October 15-19,2016,Taipei,China . Piscataway:IEEE Press , 2016 : 1 - 12 .

LIU S , DU Z , TAO J , et al . Cambricon:an instruction set architecture for neural networks [C ] // The 43rd International Symposium on Computer Architecture,June 18-22,2016,Seoul,Korea . New York:ACM Press , 2016 : 393 - 405 .

WEI R , SCHWARTZ L , ADVE V . DLVM:a modern compiler infrastructure for deep learning systems [J ] . Computer Science , 2017 :arXiv:1711.03016.

CHEN T , MOREAU T , JIANG Z , et al . TVM:end-to-end optimization stack for deep learning [J ] . Computer Science , 2018 :arXiv:1802.04799v1.

浏览量

1009

下载量

CSCD

文章被引用时，请邮件提醒。

提交

工具集

关联资源

基于大数据技术的P2P网贷平台风险预警模型

大数据机器学习系统研究进展

面向大数据的可扩展正则采样并行排序算法

小样本数据下特种材料基因工程的数据扩充方法

基于大数据技术的甘肃智慧旅游系统