科研管理 ›› 2011, Vol. 32 ›› Issue (7): 138-142.

• 论文 • 上一篇    下一篇

基于中文分词的专利挖掘分析方法研究

徐河杭1, 顾新建1, 陈国海1, 王海军2, 张玉梅2   

  1. 1. 浙江大学现代制造工程研究所,浙江 杭州 310027;
    2. 海尔集团公司,山东 青岛 266103
  • 收稿日期:2009-10-27 修回日期:2010-04-21 出版日期:2011-07-27 发布日期:2011-07-14
  • 作者简介:徐河杭(1981-),女,浙江余姚人,博士生,从事网络化制造和数据挖掘研究。
  • 基金资助:

    国家自然科学基金项目(60974083);国家高技术研究发展专项(2009AA04Z146)。

The patent mining analysis method based on Chinese word segmentation

Xu Hehang1, Gu Xinjian1, Chen Guohai1, Wang Haijun2, Zhang Yumei2   

  1. 1. Institute of Manufacturing Engineering, Zhejiang University, Hangzhou 310027,China;
    2. Haier Enterprise,Qingdao 266103,China
  • Received:2009-10-27 Revised:2010-04-21 Online:2011-07-27 Published:2011-07-14

摘要: 专利作为世界上最大的技术信息源,受到企业的日益重视。本文提出了一种基于中文分词的专利挖掘分析过程,首先进行专利信息的检索、提取和清洗,然后利用中文分词对专利名称进行关键词组的提取,细化专利名称、摘要等专利信息,最后在此基础上挖掘出专利的技术发展路线,不同技术之间的关联关系以及相似专利簇等。该过程方法在空调行业专利数据中得到了应用,有助于企业进行专利地图绘制、技术研发和专利战略实施。

关键词: 专利挖掘, 中文分词, 关键词, 空调

Abstract: As the world's largest source of the technical information, patents have received increasing attention by the enterprise. A patent mining and analysis process is presented based on Chinese word segmentation. First, the patent information is searched, extracted, and cleaned. Then some keywords have been extracted from patent title based on Chinese word segmentation, and the patent title, abstracts, and other patent information are refined. Last, patent technology roadmaps, the association between different technologies, similar patent cluster, etc. have been mined based on the previous research. This process and method has been applied to the patent of air-conditioning industry. It helps the enterprise in the patent mapping, technical development, and implementation of patent strategy.

Key words: patent mining, Chinese word segmentation, keyword, air conditioning

中图分类号: