[ "陈思莹(2001- ),女,哈尔滨工业大学计算学部硕士生,主要研究方向为数据清洗。" ]
[ "张丹(1992- ),女,博士,哈尔滨工业大学计算学部研究员,主要研究方向为数据质量、以人为中心的人工智能。" ]
[ "丁小欧(1993- ),女,博士,哈尔滨工业大学计算学部助理教授,主要研究方向为数据清洗、时间数据质量管理、时间数据挖掘、工业数据清理和多元时间序列数据中的异常行为挖掘。" ]
[ "王宏志(1978- ),男,博士,哈尔滨工业大学计算学部教授,计算机科学与工程系主任,海量数据计算研究中心主任,黑龙江省大数据科学与工程重点实验室主任,主要研究方向为数据库和大数据。" ]
网络首发:2024-03,
纸质出版:2024-03-15
移动端阅览
陈思莹, 张丹, 丁小欧, 等. 基于数据质量的公平数据定价[J]. 大数据, 2024,10(2):54-67.
Siying CHEN, Dan ZHANG, Xiaoou DING, et al. Fair data pricing based on data quality[J]. Big data research, 2024, 10(2): 54-67.
陈思莹, 张丹, 丁小欧, 等. 基于数据质量的公平数据定价[J]. 大数据, 2024,10(2):54-67. DOI: 10.11959/j.issn.2096-0271.2024025.
Siying CHEN, Dan ZHANG, Xiaoou DING, et al. Fair data pricing based on data quality[J]. Big data research, 2024, 10(2): 54-67. DOI: 10.11959/j.issn.2096-0271.2024025.
随着数据的爆炸式增长,以数据为关键要素的数字经济进一步发展。在数据市场中,建立公平高效的定价交易系统变得尤为重要。针对数据市场的公平性,提出了基于数据质量的数据市场模型。首先,以用户的需求为目标,制定综合数据质量的定价策略。其次,为防止用户的恶意欺诈行为,设计了保障公平数据交易的市场机制。最后,在原始数据交易的基础上,讨论了与数据质量相关的数据清洗服务,通过游戏理论中的机制设计多用户清洗价值分配机制。通过实验证明了按照模型构建系统的效率和有效性,可以保证数据市场的公平性。
With the explosive growth of data
the digital economy
where data serves as a crucial element
continues to advance.In the context of data markets
establishing a fair and efficient pricing and trading system becomes paramount.Addressing fairness within data markets
this study introduces a data market model based on data quality.Firstly
aiming at user demands
a comprehensive pricing strategy based on data quality is formulated.Secondly
to mitigate malicious fraudulent behaviors from users
a market mechanism ensuring fair data transactions is designed.Lastly
building upon primary data transactions
data cleaning services related to data quality are discussed.A multi-user value allocation mechanism for cleaning is designed using principles from game theory.Experimental results demonstrate that constructing systems according to this model ensures both efficiency and fairness within data markets.
李兵兵 . 我国数据市场发展的理论基础与路径 [J ] . 社会科学动态 , 2022 ( 11 ): 34 - 37 .
LI B B . Theoretical foundation and path on the development of Chinese data market [J ] . Dynamics of Social Sciences , 2022 ( 11 ): 34 - 37 .
ZHANG M X , BELTRÁN F , LIU J M . A survey of data pricing for data marketplaces [J ] . IEEE Transactions on Big Data , 2023 , 9 ( 4 ): 1038 - 1056 .
尹传儒 , 金涛 , 张鹏 , 等 . 数据资产价值评估与定价:研究综述和展望 [J ] . 大数据 , 2021 , 7 ( 4 ): 14 - 27 .
YIN C R , JIN T , ZHANG P , et al . Assessment and pricing of data assets:research review and prospect [J ] . Big Data Research , 2021 , 7 ( 4 ): 14 - 27 .
任洪润 , 朱扬勇 . 基于数据市场类型的数据定价模型研究 [J ] . 大数据 , 2023 , 9 ( 4 ): 116 - 138 .
REN H R , ZHU Y Y . Research on data pricing model based on data market type [J ] . Big Data Research , 2023 , 9 ( 4 ): 116 - 138 .
李然辉 . 数据资产价值评估模型的理论研究与技术实现探讨 [Z ] . 2018 .
LI R H . Theoretical research and technical realization of data asset value evaluation model [Z ] . 2018 .
YANG J , ZHAO C C , XING C X . Big data market optimization pricing model based on data quality [J ] . Complexity,2019 , 2019 :5964068.
YU H F , ZHANG M X . Data pricing strategy based on data quality [J ] . Computers and Industrial Engineering , 2017 , 112 ( C ): 1 - 10 .
HECKMAN J R , BOEHMER E , PETERS E H , et al . A pricing model for data markets [J ] . iSchools , 2015 .
DING X O , WANG H Z , ZHANG D , et al . A fair data market system with data quality evaluation and repairing recommendation [C ] // Proceedings of AsiaPacific Web Conference . Cham:Springer , 2015 : 855 - 858 .
DEEP S , KOUTRIS P , BIDASARIA Y . QIRANA demonstration [J ] . Proceedings of the VLDB Endowment , 2017 , 10 ( 12 ): 1949 - 1952 .
PEI J . A survey on data pricing:from economics to data science [J ] . IEEE Transactions on Knowledge and Data Engineering , 2022 , 34 ( 10 ): 4586 - 4608 .
CHEN L J , KOUTRIS P , KUMAR A , et al . Model-based pricing for machine learning in a data marketplace [EB ] . arXiv preprint , 2018 ,arXiv:1805.11450.
DING X O , WANG H Z , SU J X , et al . Cleanits [J ] . Proceedings of the VLDB Endowment , 2019 , 12 ( 12 ): 1786 - 1789 .
丁小欧 , 王宏志 , 于晟健 . 工业时序大数据质量管理 [J ] . 大数据 , 2019 , 5 ( 6 ): 1 - 11 .
DING X O , WANG H Z , YU S J . Data quality management of industrial temporal big data [J ] . Big Data Research , 2019 , 5 ( 6 ): 1 - 11 .
丁小欧 , 王宏志 , 张笑影 , 等 . 数据质量多种性质的关联关系研究 [J ] . 软件学报 , 2016 , 27 ( 7 ): 1626 - 1644 .
DING X O , WANG H Z , ZHANG X Y , et al . Association relationships study of multi-dimensional data quality [J ] . Journal of Software , 2016 , 27 ( 7 ): 1626 - 1644 .
SIDI F , SHARIAT PANAHY P H , AFFENDEY L S , et al . Data quality:a survey of data quality dimensions [C ] // Proceedings of the 2012 International Conference on Information Retrieval & Knowledge Management . Piscataway:IEEE Press , 2012 : 300 - 304 .
WANG R Y , STOREY V C , FIRTH C P . A framework for analysis of data quality research [J ] . IEEE Transactions on Knowledge and Data Engineering , 1995 , 7 ( 4 ): 623 - 640 .
STRONG D M , LEE Y W , WANG R Y . Data quality in context [J ] . Communications of the ACM , 1997 , 40 ( 5 ): 103 - 110 .
张小伟 , 江东 , 袁野 . 基于博弈论和拍卖的数据定价综述 [J ] . 大数据 , 2021 , 7 ( 4 ): 61 - 79 .
ZHANG X W , JIANG D , YUAN Y . A survey of game theory and auction-based data pricing [J ] . Big Data Research , 2021 , 7 ( 4 ): 61 - 79 .
SHAPLEY L S . A value for n-person games [M ] . Santa Monica : RAND Corporation , 1952 .
0
浏览量
311
下载量
0
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621