Science Research Management ›› 2015, Vol. 36 ›› Issue (3): 111-117.

Previous Articles     Next Articles

Patent content analysis method based on LDA topic model

Wang Bo, Liu Shengbo, Ding Kun, Liu Zeyuan   

  1. WISE Lab, Dalian University of Technology, Dalian 116024, Liaoning, China
  • Received:2013-10-12 Revised:2014-06-19 Online:2015-03-25 Published:2015-03-20

Abstract: Topic model is an effective modeling method for extracting implied themes in large-scale text. In this paper, Latent Dirichlet Allocation(LDA) topic model is introduced to patent content analysis for patent theme extraction, which solves the problems in previous patent subject classification, such problems as too rough classifications, lack of time-effectiveness and scientific nature, etc. And then, based on the original LDA model, an extended institution-topic model is developed in this paper. By joint modeling of patent subject and object, the internal relationships between patent themes and corresponding institutions is identified. Finally, a case study is carried out in the LTE technology of the telecommunication industry, and it is found out that these models can be used effectively for patent subject classification and identifying competition situation of patent subjects under each corresponding theme.

Key words: topic model (LDA), LDA institution-topic model, patent content analysis, LTE

CLC Number: