In this research, the development of a `concept-clumping algorithm' designed to improve the clustering of technical concepts is demonstrated . The algorithm developed first identifies a list of technically relevant noun phrases from a cleaned extracted list and then applies a rule-based algorithm for identifying synonymous terms based on shared words in each term. An assessment of the algorithm found that the algorithm has an 89—91% precision rate, was successful in moving technically important terms higher in the term frequency list, and improved the technical specificity of term clusters.
Journal of Information Science
Trumbach, CC., Payne, D., (2007) “Identifying Synonymous Terms in Preparation for Technology Mining” Journal of Information Science, 33(6), pp 660-677.