同济大学计算机科学与技术系 上海 201804
[ "袁书寒,男,同济大学博士生,主要研究方向为自然语言处理、深度学习、大数据分析。" ]
[ "向阳,男,同济大学教授、博士生导师,主要研究方向为大数据分析、云计算、语义计算、管理信息系统,主持和参与多项国家“973”计划、“863”计划、国家科技支撑计划、国家自然科学基金项目,近年来发表论文50余篇。" ]
[ "鄂世嘉,男,同济大学博士生,CCF学生会员,主要研究方向为云计算、知识图谱、大数据系统。" ]
网络首发:2015-06,
纸质出版:2015-06-20
移动端阅览
袁书寒, 向阳, 鄂世嘉. 基于特征学习的文本大数据内容理解及其发展趋势[J]. 大数据, 2015,1(3):65-74.
Shuhan Yuan, Yang Xiang, Shijia E. Text Big Data Content Understanding and Development Trend Based on Feature Learning[J]. BIG DATA RESEARCH, 2015, 1(3): 65-74.
袁书寒, 向阳, 鄂世嘉. 基于特征学习的文本大数据内容理解及其发展趋势[J]. 大数据, 2015,1(3):65-74. DOI: 10.11959/j.issn.2096-0271.2015030.
Shuhan Yuan, Yang Xiang, Shijia E. Text Big Data Content Understanding and Development Trend Based on Feature Learning[J]. BIG DATA RESEARCH, 2015, 1(3): 65-74. DOI: 10.11959/j.issn.2096-0271.2015030.
大数据中蕴含着重要的价值信息,文本大数据作为大数据的重要组成部分,是人类知识的主要载体。特征作为数据内在规律的反映,将文本大数据映射到反映数据本质的特征空间是文本大数据语义理解的重要手段。介绍了文本大数据的特征表示、特征学习,进而梳理了特征学习在文本大数据内容理解中的进展,最后阐述了基于特征学习的文本大数据内容理解未来的发展趋势。
Big data contains important value information.Text big data as an important part of big data is the main carrier of human knowledge.Feature represents the inherent law of the data.Mapping the text big data to its feature space which reflects the nature of data is an important method to understand the semantic meaning of the text.Text big data feature representations and feature learning were reviewed.Then the progress of feature learning used in text content understanding was presented.Finally
the future development trends of big text data content understanding were discussed.
Bengio Y . Deep learning: theoretical motivations . Presented at the Deep Learning Summer School , 2015
Bengio Y , Courville A , Vincent P , . Representation learning: a review and new perspectives . IEEE Transactions on Pattern Analysis and Machine Intelligence , 2013 , 35 ( 8 ): 1798 ~ 1828
Bengio Y . Learning deep architectures for AI . Foundations and Trends in Machine Learning , 2009 , 2 ( 1 ): 1 ~ 127
Hinton G E , Osindero S . A fast learning algorithm for deep belief nets . Neural Computation , 2006 , 18 ( 7 ): 1527 ~ 1554
Srivastava N , Hinton G , Krizhevsky A , et al . Dropout: asimple way to prevent neural networks from overfitting . Journal of Machine Learning Research , 2014 , 15 ( 1 ): 1929 ~ 1958
Nair V , Hinton G E . Rectified linear units improve restricted boltzmann machines . Proceedings of the 27th International Conference on Machine Learning (ICML-10) , Haifa,Israel , 2010 : 807 ~ 814
Miller G A . WordNet: a lexical database for English . Communications of the ACM , 1995 , 38 ( 11 ): 39 ~ 41
董振东 , 董强 , 郝长伶 . 知网的理论发现 . 中文信息学报 , 2007 , 21 ( 4 ): 3 ~ 9
Dong Z D , Dong Q , Hao C L . Theoretical findings of HowNet . Journal of Chinese Information Proceeding , 2007 , 21 ( 4 ): 3 ~ 9
Collobert R , Weston J . A unified architecture for natural language processing : deep neural networks with multitask learning . Proceedings of the 25th International Conference on Machine Learning , Helsinki,Finland , 2008
Mikolov T , Corrado G , Chen K , et al . Efficient estimation of word representations in vector space . Proceedings of Workshop at ICLR , Florida,USA , 2013 : 1 ~ 12
Maaten L V D , Hinton G . Visualizing data using t-SNE . Journal of Machine Learning Research , 2008 ( 9 ): 2579 ~ 2605
Mitchell J , Lapata M . Composition in distributional models of semantics . Cognitive Science , 2010 , 34 ( 8 ): 1388 ~ 1429
Socheer R , Perelygin A , Wu J Y , et al . Recursive deep models for semantic compositionality over a sentiment treebank . Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP 2013) , Washington DC,USA , 2013 : 1631 ~ 1642
Bordes A , Usunier N , Garcia-Duran A , et al . Translating embeddings for modeling multi-relational data . Proceedings of Conference on Advances in Neural Information Processing Systems (NIPS) , South Lake Tahoe,Nevoda,US , 2013 : 2787 ~ 2795
Garcia-Durran A , Bordes A , Usunier N . Composing relationships with translations . Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP 2015) , Lisbon,Portugal , 2015 : 286 ~ 290
Sukhbaatar S , Szlam A , Weston J , et al . End-to-end memory networks . arXiv Preprint arXiv:1503.08895 , 2015
0
浏览量
402
下载量
0
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621