科研管理 ›› 2022, Vol. 43 ›› Issue (3): 183-191.

• 论文 • 上一篇    下一篇

基于语义特征的潜在标准必要专利识别研究

翟东升1,金苑苑1,徐硕1,何喜军1,胡涵清2,甄柳林1   

  1. 1.北京工业大学经济与管理学院,北京100124;
    2.北京信息科技大学经济管理学院,北京100192

  • 收稿日期:2021-01-09 修回日期:2021-07-30 出版日期:2022-03-20 发布日期:2022-03-16
  • 通讯作者: 徐硕
  • 基金资助:
    国家自然科学基金项目:“科技关联视角下新兴技术弱信号扫描预判方法研究”(72074014,2020.12—2023.12)。

Identification of potential standard essential patents based on semantic features

Zhai Dongsheng1, Jin Yuanyuan1, Xu Shuo1, He Xijun1, Hu Hanqing2, Zhen Liulin1#br# (1. Schoo#br#   

  1. 1. School of Economics and Management, Beijing University of Technology, Beijing 100124, China;  2. School of Economics and Management, Beijing Information Science and Technology University, Beijing 100192, China

  • Received:2021-01-09 Revised:2021-07-30 Online:2022-03-20 Published:2022-03-16

摘要:    潜在标准必要专利在未来市场中具有极高的战略价值和经济价值,企业如何抢先识别这些专利对建设创新型国家、优化企业专利布局、加快技术创新、提升行业地位、规避专利挟持具有重要意义。但目前关于自动化识别潜在标准必要专利的研究尚少。本文从提取标准必要专利语义特征的视角下,提出利用Bert-CNN网络模型结合上下文对已知标准必要专利的隐性全局语义特征和高维层次语义特征双重提取,依据特征提取结果识别潜在标准必要专利,并通过计算Bert向量相似度预测潜在标准必要专利可能对应的标准。实证部分以ETSI欧洲标准化协会发布的标准必要专利构建数据验证集对模型的性能进行验证,结果显示本模型在大规模专利数据实验中的精准率、召回率、F1值优于已有研究。

关键词: 潜在标准必要专利识别, Bert, CNN, 语义特征

Abstract:    Under the background of innovation leading development and the strategy of rejuvenating the country through science and education, the standard essential patent (SEP), as the main carrier of technological innovation, is an important way for the country to master the core industrial technology. Holding many SEP of key technologies in advance is the centralized embodiment of national innovation ability, and an important link for enterprises to control technology dominance. SEP has extremely high strategic and economic value. Enterprises can dominate the entire market by licensing SEP to each other, charge high license fees, and establish market barriers to competitors. Although the Standardization Committee has disclosed many SEPs, more patents will be included in the standards in the future with the development of product technology. At the same time, enterprises apply for patents from multiple angles around the technical points of the standard, continue to layout patents around the existing standards. Therefore, exploring potential SEP has important practical significance for building an innovative country, optimizing the patent distribution of enterprises, enhancing the competitiveness of core enterprises in China, increasing license income and avoiding business risks.This paper proposes a model to identify potential standards from prospect of semantic feature, using global semantic features and higher-dimensional semantic features of patents. Firstly, construct the patent sample set. Collect the declared standard essential patents data, including the patent number, claims, title, abstract and other information, and identify these as the SEP. The same number of non-SEP patent data were randomly sampled as negative samples and mixed with SEP patents to complete the construction of patent sample set. Secondly, Bert model is used to extract structured implicit global semantic features from the context of patent claims, title and abstract, and output high-dimensional semantic vectors. Thirdly, CNN neural network is used to extract the high-dimensional semantic features of the high-dimensional vector output by Bert, and the potential SEP is identified according to the feature extraction results. Lastly, according to the semantic similarity measure of vector, the predicted standard code is outputted for the potential SEPs. The findings are as follows. Firstly, when the amount of data is large, the accuracy and consistency of identifying potential SEPs based on the semantic features of claims are better than that based on the title and abstract, so the claims can provide richer semantic information of patent semantic features. Secondly, compared with the Doc2Vec-RF model, in terms of potential SEP prediction, the greater the number of predicted patents data, the better the prediction effect. For corresponding standard prediction, the performance of this method is nearly 10% higher than that of previous studies, and the test results are stable. Lastly, changing the amount of data tested in this paper, it is found that the experimental results float less and have good robustness.The conclusion of this paper not only provides reference value for the innovation strategy, but also provides practical significance for management practice. On the one hand, this method assists patent holders to analyze their own patents and utilize their own patents for potential development. On the other hand, enterprises can use this method to exploit the undisclosed standard essential patents of competitive enterprises to warn potential patent hijacking from competitors, serving for market competition.

Key words: potential standard essential patent identification, Bert, CNN, semantic feature