1. 复旦大学计算机科学技术学院 上海 201203
2. 上海市数据科学重点实验室(复旦大学) 上海 201203 451191
[ "朱扬勇,男,博士,复旦大学计算机科学技术学院教授、学术委员会主任,上海市数据科学重点实验室主任。1989年起从事数据领域研究,2008年提出数据资源保护和利用,2009年发表了数据科学论文“Data explosion, data nature and dataology”,并出版专著《数据学》,对数据科学进行了系统探讨和描述。2010年创办了“International Workshop on Dataology and Data Science”, 2014年和石勇、张成奇共同创办了“International Conference on Data Science”。第462次香山科学会议“数据科学与大数据的理论问题探索”的执行主席,《大数据技术与应用丛书》主编。目前研究兴趣为数据科学、大数据。" ]
[ "熊赟,女,博士,复旦大学计算机科学技术学院副教授。2004年起从事数据领域方面的研究工作,作为项目负责人主持国家自然科学基金、上海市科委发展基金以及企业合作项目。相关研究成果在本领域国际权威期刊或会议发表论文30余篇,出版专著2本。目前研究兴趣为数据科学、大数据。" ]
网络首发:2015-05,
纸质出版:2015-05-20
移动端阅览
朱扬勇, 熊赟. 大数据是数据、技术,还是应用[J]. 大数据, 2015,1(1):70-80.
Yangyong ZHU, Yun Xiong. Defining Big Data[J]. BIG DATA RESEARCH, 2015, 1(1): 70-80.
朱扬勇, 熊赟. 大数据是数据、技术,还是应用[J]. 大数据, 2015,1(1):70-80. DOI: 10.11959/j.issn.2096-0271.2015.01.007.
Yangyong ZHU, Yun Xiong. Defining Big Data[J]. BIG DATA RESEARCH, 2015, 1(1): 70-80. DOI: 10.11959/j.issn.2096-0271.2015.01.007.
通常认为大数据是一个现有技术难以处理的复杂而庞大的数据集,这将导致一个谬误的出现:大数据都不能被处理,能处理的都不是大数据。显然,如何定义大数据是一个问题。分析了已有的大数据定义和现象,发现数据、技术和应用是大数据的三要素,定义大数据是为决策提供服务的大数据集、大数据技术和大数据应用的总称。其中,大数据集是指一个决策问题所用到的所有可能的数据,而不是一个领域的所有数据。还给出了大数据应用遇到的问题及技术挑战,并指出大数据未来的研究方向。
Generally
big data is regarded as a term about data sets so large or complex that conventional data technologies cannot handle. This statement of big data leads to confusion: none of big data has been handled by existing data technologies;or none of current successful data applications can be called as big data. Therefore
what is the best way to define big data becomes a problem. Data
technology
and application were regarded as three associated key factors of big data by analyzing the state-of-the-art of big data. A comprehensive definition on big data was defined as the umbrella of big data set
big data technology
and big data application. Here
big data set means all data that can be acquired and were related to one decision-making application instead of all data in an area or an enterprise. In addition
the issues in big data applications and the main challenges in big data technologies were discussed. Finally
the future directions of big data research were presented including data science and the technologies of big data reservation and development.
Cox M , Ellsworth D . Application-controlled demand paging for out-of-core visualizetion . Proceedings of the 8th Conference on Visualization , Phoenix, AZ, USA , 1997 235 - 244
U. S. Government Big data research and development initiative . http://www.whitehouse.gov/sites/default/files/microsites/ostp/big_data_press_release_final_2.pdf,2012 http://www.whitehouse.gov/sites/default/files/microsites/ostp/big_data_press_release_final_2.pdf,2012 . 2012
Wikipedia . Big data http://en.wikipedia.org/wiki/Big_data,2015 http://en.wikipedia.org/wiki/Big_data,2015 . 2015
Mark B . Gartner says solving ‘big data’ challenge involves more than just managing volumes of data . http://www.gartner.com/newsroom/id/1731916,2011 http://www.gartner.com/newsroom/id/1731916,2011 . 2011
. What is big data http://www.villanovau.com/resources/bi/what-is-big-data/,2015 http://www.villanovau.com/resources/bi/what-is-big-data/,2015 . 2015
. 数据科学与大数据的科学原理及发展前景 . 第462次香山科学会议 , 北京, 中国 , 2013
The scientific principle and prospect of data science and big data Proceedings of the 462nd Xiangshan Science Conference , Beijing, China , 2013
World Economic Forum . Big data, big impact: new possibilities for international development . http://www3.weforum.org/docs/WEF_TC_MFS_BigDataBigImpact_Briefing_2012.pdf,2012 http://www3.weforum.org/docs/WEF_TC_MFS_BigDataBigImpact_Briefing_2012.pdf,2012 . 2012
Zhu Y Y , Zhong N , Xiong Y . data nature and dataology . Proceedings of International Conference on Brain Informatics , Beijing, China , 2009 : 147 ~ 158 .
朱扬勇 , 熊赟 . 数据学 . 上海 : 复旦大学出版社 , 2009
Zhu Y Y , Xiong Y . Dataology and Data Science Shanghai : Fudan University Press , 2009
CODATA中国全国委员会 大数据时代的科学活动 . 北京 : 科学出版社 , 2014
CODATA China National Committee . Scientific Discovery in Big Data Era Beijing : Science Press , 2014
Zhu Y Y , Xiong Y . Defining data science . http://arxiv.org/ftp/arxiv/papers/1501/1501.05039.pdf,2015 http://arxiv.org/ftp/arxiv/papers/1501/1501.05039.pdf,2015 . 2015
Google . Google flu trends . http://www.google.org/flutrends,2008 http://www.google.org/flutrends,2008 . 2008
Greg L , Brent S , Jeremy Y . com recommendations: item-to-item collaborative filtering . IEEE Internet Computing , 2003 , 7 ( 1 ): 76 ~ 80 .
Adrian C . Higgs boson positively identified . http://news.sciencemag.org/sciencenow/2013/03/higgs-boson-positively-identifie.html?ref=hp,2013 http://news.sciencemag.org/sciencenow/2013/03/higgs-boson-positively-identifie.html?ref=hp,2013 .
吴俊伟 , 朱扬勇 . 汇计划在行动 . 上海 : 上海科学技术出版社 , 2015
Wu J W Zhu Y Y Shanghai Big Data in Action Shanghai : Shanghai Scientific and Technical Publishers , 2015
Schonberger V M , Cukier K . Big Data: A Revolution That Will Transform How We Live Work and Think . London;Hodder Export , 2013
Reichman O J , Jones M B , Schildhauer M P . Challenges and opportunities of open data in ecology . Science , 2011 , 331 ( 6018 ): 703 ~ 705
McVean G A , Abecasis D M . An integrated map of genetic variation from 1092 human genomes . Nature , 2012 , 491 ( 7422 ): 56 - 65 .
Feigelson E D , Babu G J . Big data in astronomy http://astrostatistics.psu.edu/2012Significance.pdf,2012 http://astrostatistics.psu.edu/2012Significance.pdf,2012 .
0
浏览量
1187
下载量
0
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621