文章詳目資料

Journal of Computers EIMEDLINEScopus

  • 加入收藏
  • 下載文章
篇名 Topic Analysis in LDA Based on Keywords Selection
卷期 32:4
作者 Bing-Xin DuGuo-Ying Liu
頁次 001-012
關鍵字 inter-topic distanceLatent Dirichlet Allocation model perplexitytopic coherence measuretopic modelEIMEDLINEScopus
出刊日期 202108
DOI 10.53106/199115992021083204001

中文摘要

英文摘要

The Latent Dirichlet Allocation model in text analysis has weak generalization ability and poor interpretability of the topic words. In this paper, we address these issues using a topic analysis framework for Latent Dirichlet Allocation based on keyword selection. Our proposed solution extracts the keywords from scientific research articles and builds a keywords list according to filter rules. Then several words are selected in the abstracts of the articles based on the keywords list and the LDA model is used to analyze the topics of the selected words. To evaluate the performance of our proposed approach, journal articles in the field of educational technology are selected as data sources, and two types of comparative analysis are performed. Firstly, “verb”, and “verb + noun” word selection strategies are adopted to conduct a comparative study from aspects including domain expert analysis, model perplexity, topic coherence measure, and inter-topic distance analysis. Secondly, Hierarchical Dirichlet Process, Correlated Topic Models, and LDA-Word2Vec models are used to conduct a study from model perplexity and predictive log-likelihood aspects. The experimental results confirm that the topic analysis based on the keywords selection method overperforms others in both types of comparisons.

本卷期文章目次

相關文獻