专利作为世界上最大的技术信息源,受到企业的日益重视。本文提出了一种基于中文分词的专利挖掘分析过程,首先进行专利信息的检索、提取和清洗,然后利用中文分词对专利名称进行关键词组的提取,细化专利名称、摘要等专利信息,最后在此基础上挖掘出专利的技术发展路线,不同技术之间的关联关系以及相似专利簇等。该过程方法在空调行业专利数据中得到了应用,有助于企业进行专利地图绘制、技术研发和专利战略实施。
Abstract
As the world's largest source of the technical information, patents have received increasing attention by the enterprise. A patent mining and analysis process is presented based on Chinese word segmentation. First, the patent information is searched, extracted, and cleaned. Then some keywords have been extracted from patent title based on Chinese word segmentation, and the patent title, abstracts, and other patent information are refined. Last, patent technology roadmaps, the association between different technologies, similar patent cluster, etc. have been mined based on the previous research. This process and method has been applied to the patent of air-conditioning industry. It helps the enterprise in the patent mapping, technical development, and implementation of patent strategy.
关键词
专利挖掘 /
中文分词 /
关键词 /
空调
Key words
patent mining /
Chinese word segmentation /
keyword /
air conditioning
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] 邱洪华,余翔. 基于k-means聚类算法的专利地图制作方法研究[J]. 科研管理,2009,30(2):70-76.Qiu Hong-hua, Yu Xiang. Research on a method for building up a patent map based on k-means clustering algorithm[J]. Science Research Management, 2009,30(2):70-76.
[2] Yuen-Hsien Tseng, Chi-Jen Lin b, Yu-I Lin. Text mining techniques for patent analysis[J]. Information Processing and Management, 2007,43(5): 1216-1247.
[3] 叶作亮,高千惠,陈国海. 专利的相关性检索与集成应用研究[J]. 科研管理,2009,30(5):40-46.Ye Zuo-liang,Gao Qian-hui, Chen Guo-hai. Research on the correlative search of patent and its integrative applications[J]. Science Research Management, 2009,30(5):40-46.
[4] 暴海龙,李金林. 专利技术关联性分析方法研究[J]. 科研管理,2004,25(增):3-8.Bao Hai-long,Li Jin-lin. Study of patent technology association[J]. Science Research Management, 2004,25:3-8.
[5] 陈国海.企业专利管理分析系统中若干关键技术的研究和实现 . 浙江大学, 2007.Chen Guo-hai. Research on several essential technologies of enterprises patent management & analysis system . Zhejiang university, 2007.
[6] 王永红. 定量专利分析的样本选取与数据清洗[J]. 情报理论与实践,2007,30(1):93-96.Wang Yong-hong. Sample selection and data cleansing for quantitative analysis of patents[J]. Information Studies: Theory & Application,2007,30(1):93-96.
[7] 李保利,陈玉忠,俞士汶. 信息抽取研究综述[J]. 计算机工程与应用,2003,39(10):1-5.LI Bao-Li, CHEN Yu-Zhong, YU Shi-Wen. Research on information extraction:a survey[J]. Computer Engineering and Applications,2003,39(10):1-5.
基金
国家自然科学基金项目(60974083);国家高技术研究发展专项(2009AA04Z146)。