科学数据复用研究的演化、知识体系与方法工具——兼论第四科研范式的影响

黄欣卓 米加宁 章昌平 巩宜萱

科研管理 ›› 2022, Vol. 43 ›› Issue (8) : 100-108.

PDF(786 KB)
PDF(786 KB)
科研管理 ›› 2022, Vol. 43 ›› Issue (8) : 100-108.
论文

科学数据复用研究的演化、知识体系与方法工具——兼论第四科研范式的影响

  • 黄欣卓1,米加宁1,章昌平1,2,巩宜萱1
作者信息 +

The evolution, knowledge system and method tools of scientific data reuse——A concurrent discussion of the influence of the fourth research paradigm

  • Huang Xinzhuo1, Mi Jianing1, Zhang Changping1,2, Gong Yixuan1#br#
Author information +
文章历史 +

摘要

科学数据的开放共享推动了学术界对于其公共学术价值的认识和利用,大数据、科研基础设施和科研环境信息化使得科学研究向第四研究范式转型,科学数据复用为新的科学发现和知识创新提供了有效途径。科学数据复用研究受到学术界的关注,相关研究成果在近20年来日益丰富,但该学科领域的知识体系尚未建立。本研究以Web of Science核心合集数据库作为数据采集来源,运用HistCite和CiteSpace软件绘制知识图谱并结合文本内容分析,梳理了科学数据复用研究的发展态势、演进过程及研究结构,研究发现:科学数据复用研究经历了萌芽阶段(2006年前)、发展阶段(2007—2014年)和爆发阶段(2015年至今),主要包括基本内涵、共享与复用关系、数据复用影响因素、学科领域研究以及数据复用伦理等五个方面的研究主题。基于此,本研究从保障平台、理论基础、研究分支和方法工具四个层面构建科学数据复用研究的知识体系,并提出科学数据公共学术价值、科学数据复用行为及机制、科学数据复用评价及影响力、科学数据复用政策和领域科学数据复用研究等几个亟须深入开展的研究主题。本研究为今后开展科学数据复用的相关研究提供理论和实践指导。
〖HT5”H〗

Abstract

   Data reuse, the reuse of scientific data to solve new research problems, accepts both the new interpretation of data explored by other researchers and the new test of original research data by researchers using other analysis technologies. Although big data, research infrastructure and informatization of the research environment are transforming scientific research into the fourth research paradigm, data reuse has provided an effective way for new scientific discovery and knowledge innovation. Its public value increases daily as a strategic resource of national scientific and technological innovation and scientific research infrastructure. The research of data reuse has received much attention in the past 20 years, but the knowledge system in this subject area has not yet been established and lacks proper planning and forward-looking prediction.
   This study comprehensively uses the bibliometric methods and knowledge map analysis tools (such as HistCite and CiteSpace) to process and analyze the large-scale research literature data objectively and intuitively. Using the Web of Science database as the source of literature collection, we utilize the "data reuse", "data re-use", "data reusing", "reusing data", "reusing of data", "secondary data use", and "data re-usability" as the keywords and the deadline of data collection was March 20, 2021. This study involves 364 papers in sum finally.
    The main findings and theoretical contributions of this study are as follows:
    (1) The existing research on data reuse presents the development path, evolution process, driving factors, and research structure of "two main lines", "three stages", "four forces" and "five core fields". From the perspective of the development path, data reuse is mainly carried out along two main lines, which run through three evolutionary stages: germination (before 2006), development (2007-2014) and outbreak (2015-). From the keyword co-occurrence analysis, data reuse research has five core fields: basic theoretical research, data sharing and reuse relationship, user behavior and scientific research management, data reuse ethics, and data reuse in various disciplines.
    (2) The knowledge system of data reuse research consists of four levels, including the guarantee platform layer, theoretical foundation layer, research branch layer and method tool layer. The development of digital scientific research and data infrastructure, the change of data behavior, scientific research evaluation, and the development of big data technology are the frontiers and growth points of developing four levels of knowledge systems and methods and tools. They also constitute the four driving forces for the in-depth development of scientific data reuse: the needs of big scientific research and the formation of a digital scientific research environment, the development of the data-intensive scientific discovery, the recognition of scientific data achievements, and the development of digital technology.
    (3) The subsequent research on data reuse has an opportunity window for academic research in five aspects: public academic value of scientific data, behavior and mechanism of data reuse, influence of data reuse, policy of data reuse, and data reuse in the different fields. We expect the academic community to follow up continuously on these research topics and provide theoretical supports for practically improving scientific data reuse.

关键词

科学数据 / 数据复用 / 第四研究范式 / 引文分析 / 知识图谱


Key words

scientific data / data reuse / fourth research paradigm / citation analysis / knowledge map

引用本文

导出引用
黄欣卓 米加宁 章昌平 巩宜萱. 科学数据复用研究的演化、知识体系与方法工具——兼论第四科研范式的影响[J]. 科研管理. 2022, 43(8): 100-108
Huang Xinzhuo, Mi Jianing, Zhang Changping, , Gong Yixuan . The evolution, knowledge system and method tools of scientific data reuse——A concurrent discussion of the influence of the fourth research paradigm[J]. Science Research Management. 2022, 43(8): 100-108

参考文献

[1] 中华人民共和国科学技术部. SDS/T 1003-2004 科学数据共享工程技术标准[S]. 2005.
[2] AS Zimmerman. New knowledge from old data[J]. ence Technology & Human Values, 2008,33(5): 631-652.
[3] A Y. Data reuse and users' trust judgments: toward trusted data curation[D]. University of North Carolina at Chapel Hill Graduate School, 2015.
[4] SHEN Y. Data Sustainability and Reuse Pathways of Natural Resources and Environmental Scientists[J]. The new review of academic librarianship, 2018,24(2): 136-156.
[5] European Commission Expert Group on FAIR Data. TURNING FAIR INTO REALITY[R].European Union, 2018.
[6] 傅天珍, 郑江平. 国外面向科研人员的科学数据共享探析[J]. 图书馆论坛, 2015,35(02): 76-81.
[7] PIWOWAR H A, VISION T J, WHITLOCK M C. Data archiving is a good investment[J]. Nature, 2011,473(7347): 285.
[8] Kathleen Marie Fear. Measuring and anticipating the impact of data reuse[EB/OL]. (2013-10-30)[2020-10-10]. https://deepblue.lib.umich.edu/bitstream/handle/2027.42/102481/kfear_1.pdf?sequence=1&isAllowed=y.
[9] 孙玉伟, 成颖, 谢娟. 科研人员数据复用行为研究:系统综述与元综合[J]. 中国图书馆学报, 2019,45(03): 110-130.
[10] 张莹, 戚景琳, 孙玉伟. 管理学科研人员数据复用行为特征探析[J]. 信息资源管理学报, 2020,10(4): 79-87.
[11] WALLIS J C, ROLANDO E, BORGMAN C L, et al. If We Share Data, Will Anyone Use Them? Data Sharing and Reuse in the Long Tail of Science and Technology[J]. PloS one, 2013,8(7): e67332.
[12] FEDERER L M, LU Y L, JOUBERT D J, et al. Biomedical Data Sharing and Reuse: Attitudes and Practices of Clinical and Scientific Research Staff[J]. PLoS One, 2015,10(6): e129506.
[13] H C. Understanding and using archaeological topographic surveys: COMPUTER APPLICATIONS AND QUANTITATIVE METHODS IN ARCHAEOLOGY[C], 2001.
[14] COENEN A, MCNEIL B, BAKKEN S, et al. Toward comparable nursing data: American Nurses Association criteria for data sets, classification systems, and nomenclatures[J]. Comput Nurs, 2001,19(6): 240-246, 246-248.
[15] FANIEL I M, JACOBSEN T E. Reusing Scientific Data: How Earthquake Engineering Researchers Assess the Reusability of Colleagues’ Data[J]. Computer supported cooperative work, 2010,19(3-4): 355-375.
[16] GUNST R F, BASU S, BRUNELL R. Defining and estimating global mean temperature anomalies[J]. Journal of Climate, 1993,6(7): 1368-1374.
[17] J N J, U N. Knowledge and data reuse in ship system design and engineering[J]. Proceedings of the 8th International Design Conference, 2004,Vols 1-3: 441-446.
[18] WHITE H. A Reality Check for Data Snooping[J]. Econometrica, 2000,68(5): 1097-1126.
[19] ZIMMERMAN A. Not by metadata alone: the use of diverse forms of knowledge to locate data for reuse[J]. International journal on digital libraries, 2007,7(1): 5-16.
[20] FRANK R D, YAKEL E, FANIEL I M, et al. Destruction/reconstruction: preservation of archaeological and zoological research data[J]. Archival Science, 2015,15(2): 141-167.
[21] BISHOP, LIBBY. Ethical sharing and reuse of qualitative data[J]. The Australian journal of social issues, 2009,44(3): 255-272.
[22] BISHOP L. Using archived qualitative data for teaching: practical and ethical considerations[J]. International journal of social research methodology, 2012,15(4): 341-350.
[23] CHATFIELD A T, REDDICK C G. A longitudinal cross-sector analysis of open data portal service capability: The case of Australian local governments[J]. Government information quarterly, 2017,34(2): 231-243.
[24] ABELLA A, ORTIZ-DE-URBINA-CRIADO M, DE-PABLOS-HEREDERO C. A model for the analysis of data-driven innovation and value generation in smart cities' ecosystems[J]. Cities, 2017,64: 47-53.
[25] TEMPINI N. Till data do us part: Understanding data-based value creation in data-intensive infrastructures[J]. Information and Organization, 2017,27(4): 191-210.
[26] YOON A, KIM Y. Social scientists' data reuse behaviors: Exploring the roles of attitudinal beliefs, attitudes, norms, and data repositories[J]. Library & Information Science Research, 2017,39(3): 224-233.
[27] 章昌平, 米加宁, 李大宇. 数据科学研究在社会科学中的应用前景[J]. 社会科学, 2018(09): 78-88.
[28] KANSA S W. Using Linked Open Data to Improve Data Reuse in Zooarchaeology[J]. Ethnobiology letters, 2015,6(2): 224-231.
[29] NICHOLS B N, POHL K M. Neuroinformatics Software Applications Supporting Electronic Data Capture, Management, and Sharing for the Neuroimaging Community[J]. Neuropsychol Rev, 2015,25(3): 356-368.
[30] HEY T, TANSLEY S, TOLLE K. The fourth paradigm: data-intensive scientific discovery[J]. proceedings of the ieee, 2009,99(8): 1334-1337.
[31] WILKINSON M D, DUMONTIER M, JAN A I, et al. Addendum: The FAIR Guiding Principles for scientific data management and stewardship[J]. Sci Data, 2019,6(1): 6.
[32] TENOPIR C, DALTON E D, ALLARD S, et al. Changes in Data Sharing and Data Reuse Practices and Perceptions among Scientists Worldwide[J]. PloS one, 2015,10(8): e134826.
[33] 邓君, 宋文凤. 科学数据价值鉴定研究进展[J]. 情报科学, 2012,30(06): 942-946.
[34] L T P, LAURIAULT B, CRAIG D R, et al. Today's Data are Part of Tomorrow's Research: Archival Issues in the Sciences[J]. Archivaria, 2007(64): 123-179.

基金

国家社会科学基金重大项目:“数据科学对社会科学转型的重大影响研究”(17ZDA030);中央高校基本科研业务费专项资金资助项目:“开放科学数据的学术价值及其影响力测度:一项社会调查数据来源的研究”(HIT.HSS.201841);广西壮族自治区科协资助青年科技工作者专项课题:“大数据环境下广西科技工作者数据素养测评及培育策略研究”(桂科协〔2019〕ZB-13);广西哲学社会科学规划研究课题:“大数据驱动下面向科研第四范式的高校图书馆应对策略研究”(17FTQ004)。

PDF(786 KB)

Accesses

Citation

Detail

段落导航
相关文章

/