科研管理 ›› 2024, Vol. 45 ›› Issue (2): 176-188.DOI: 10.19571/j.cnki.1000-2995.2024.02.018

• 论文 • 上一篇    下一篇

基于LDA与共现网络动态分析的技术机会识别

王金凤1,2,张芷芯1,冯立杰1,3,张珂4   

  1. 1.郑州大学管理学院,河南 郑州450001;
    2.上海海事大学中国(上海)自贸区供应链研究院,上海201306;
    3.上海海事大学物流工程学院,上海201306;
    4.郑州大学信息管理学院,河南 郑州450001

  • 收稿日期:2022-05-10 修回日期:2023-05-15 出版日期:2024-02-20 发布日期:2024-01-23
  • 通讯作者: 冯立杰
  • 基金资助:
    国家科技部创新方法工作专项:“‘端端驱动、融合赋能’创新方法新系统研究与应用示范”(2019IM020200,2019—2023);NSFC—河南联合基金重点项目:“煤矿重大灾害主动预控体系”(U1904210-4,2020—2023);上海市科技计划项目:“元易创新方法在港航物流工程与海洋装备关键技术领域的应用研究”(20040501300,2020—2023);国家重点研发专项:“风电机群服役全周期质量评估与调控技术研究”(2022YFF0608700,2022—2026);河南兴文化工程文化研究专项项目:“创新驱动战略下河南企业创新管理的机制及路径研究”(2022XWH082,2022—2024);河南省高校人文社会科学研究一般项目:“科技创新起高峰情境下河南本土企业创新方法理论及应用研究” (2023-ZZJH-039,2022—2024)。

Identification of technology opportunities based on the LDA model and co-occurrence network dynamic analysis

Wang Jinfeng1,2, Zhang Zhixin1, Feng Lijie1,3, Zhang Ke4   

  1. 1. School of Management, Zhengzhou University, Zhengzhou 450001, Henan, China;
    2. China (Shanghai) Institute of FTZ Supply Chain, Shanghai Maritime University, Shanghai 201306, China; 
    3. School of Logistics Engineering, Shanghai Maritime University, Shanghai 201306, China;
    4. Information Management School, Zhengzhou University, Zhengzhou 450001, Henan,China
  • Received:2022-05-10 Revised:2023-05-15 Online:2024-02-20 Published:2024-01-23
  • Supported by:
    Innovation Method Fund of China;Joint Funds of the National Natural Science Foundation of China;Shanghai Science and Technology Program

摘要:     直面未来科技发展的高度不确定性以及技术复杂性不断叠加的外部环境,亟须采用科学高效的分析方法识别技术创新机会。为此,本文从专利文本挖掘与关键词共现网络融合的动态分析视角构建了技术机会识别路径。首先,进行专利数据的获取及预处理,进而运用LDA模型提取具象领域的技术主题及关键词,并引入TF-IDF指标进行重要度分析;其次,分别构建技术关键词整体共现网络和基于技术主题的包含时间窗口的关键词共现子网络;再次,基于共现强度分析及共现网络动态分析探究技术主题及关键词的演化过程,以此将关键词划分为持续型、衰退型、新兴型和舍弃型等4种类型,进而识别出具有发展潜力的技术机会;最后,以无人船为例,识别了动力技术主题、姿态测量技术主题和定位导航技术主题中路径规划、传感器等持续型的技术机会和激光雷达、惯性导航等新兴型的技术机会。本文不仅弥补了静态共现网络难以揭示技术领域动态演化过程的局限性,而且避免了在创新过程中掩盖或误判一些技术机会的问题,为企业高效识别技术创新机会提供了可资借鉴的决策参考依据。

关键词: 技术机会识别, LDA模型, 共现网络, 动态分析, 无人船

Abstract:    The external environment for future technological development is characterized by high uncertainty and increasing technological complexity. It is crucial to adopt a scientific and efficient analytical approach to identify technological innovation opportunities. In the era of data science, scientific knowledge is experiencing explosive growth, making it increasingly difficult to evaluate and predict technological trends. Previous studies relying on qualitative or static analysis is no longer sufficient to accurately identify technical opportunities. In order to make informed decisions about scientific development policies, mitigate investment risks, and accurately grasp the direction of scientific development, it is necessary to increase research efforts on the scientific knowledge network and attempt to mine potential knowledge through the co-occurrence network. Therefore, this paper has constructed a path of technology opportunity identification from a dynamic analysis perspective that integrates patent text mining and keyword co-occurrence networks. Firstly, this paper obtained and preprocessed patent data, and then applied the LDA model to extract technology topics and keywords from the patent data of specific technology domains. In particularly, the LDA model operation was performed using the jieba word splitting tool in Python software and the scikit-learn library. This requires setting the parameters α and β separately and obtaining the optimal topic number K by calculating the perplexity. In the meanwhile, the TF-IDF indicator was also introduced for importance analysis. Secondly, the paper constructed the overall co-occurrence matrix of technology keywords and the co-occurrence matrix based on technology topics with time windows. Subsequently, the overall co-occurrence network of technology keywords and the keyword co-occurrence sub-network based on technology topics were generated based on the co-occurrence matrix. In particularly, this paper systematically investigated and judged the more important and influential technologies in a specific technology domain by constructing an overall co-occurrence network of technology keywords and calculating the co-occurrence intensity of technology keywords. In the meanwhile, the trends in the evolution of technical topics and keywords over time were further analyzed through co-occurrence sub-networks. This requires the generation of a dynamically changing co-occurrence sub-network using the well-established keyword lexicon and the co-occurrence matrix containing time windows. It visualized the keywords and their linkage relationships across years in the same three-dimensional coordinate system.Thirdly, high-frequency technology keywords were identified based on the overall co-occurrence network, while the evolution process of technology topics and keywords were analyzed based on the co-occurrence sub-network. And then, technology keywords were classified into four types: sustained, declining, emerging, and abandoned. Among them, sustained keywords are those with high co-occurrence intensity and a stable upward trend over time; declining keywords are those with high co-occurrence intensity but a decreasing trend over time; emerging keywords are those with low co-occurrence intensity but a stable upward trend over time; and abandoned` keywords are those with low co-occurrence intensity and a decreasing trend over time. In turn, sustained and emerging were identified from these four types as technology opportunities with growth potential. Finally, using unmanned ships as an example, the technology opportunities contained in the power supply technology topic, attitude measurement technology topic, and positioning and navigation technology topics were identified. For example, sustained technology opportunities such as path planning and sensors, emerging technology opportunities such as lidar and inertial navigation were identified. This paper not only addressed the limitations of static co-occurrence networks in revealing the dynamic evolution process of technological domains, but also avoided the problem of covering up or misjudging some technology opportunities in the innovation process. This will provide a useful decision-making reference for companies to efficiently identify technological innovation opportunities.

Key words: technology opportunity identification, LDA model, co-occurrence network, dynamic analysis, unmanned ship