文章詳目資料

圖書資訊學刊 CSSCIScopusTSSCI

  • 加入收藏
  • 下載文章
篇名 集成式人工神經網絡模型於分類實務之可行性:以社群媒體之健康消費者資訊分類為例
卷期 20:1
並列篇名 Practicability of Ensemble Artificial Neural Network Models for a Classification Task: An Optimal Approach for Reproducing Classification Practices of Health Consumers Generated Text on Social Media
作者 Sukjin YouMin Sook ParkSoohyung Joo
頁次 001-030
關鍵字 自動分類深度學習人工神經網絡集成分類模型知識組織Automated ClassificationDeep LearningArtificial Neural NetworkEnsemble Classification ModelKnowledge OrganizationTSSCI
出刊日期 202206
DOI 10.6182/jlis.202206_20(1).001

中文摘要

本文運用人工神經網絡(Artificial Neural Network, ANN)模型,再現社群媒體中健康資訊分類實務之準確性。本研究透過Yahoo!Answers健康類別之問答,提取健康資訊術語,並輔以醫學主題詞表(MeSH terms),訓練並比較數種類型的ANN模型和集成式模型的效能。研究顯示,ANN模型分類準確率約90%;其中,深度神經網絡(Deep Neural Network, DNN)與卷積神經網絡(Convolutional Neural Network, CNN)和長短期記憶模型(long short-term memory, LSTM)相比,分類表現更佳。基於分類結果的集成模型不僅優於以基於異質ANN結構的集成模型,也優於單一深度學習模型;本研究也發現問題和最佳答案的組合是最有效的訓練集,並可以建構準確的預測模型。研究結果顯示,ANN模型可有效輔助分類健康消費者以自然語言生成之線上健康資訊。

英文摘要

This paper reports the classification accuracy of artificial neural network (ANN) models in reproducing health consumers’ classification practices in social media. Social media have driven the growth of unstructured text data across domains including health, which motivates researchers to reconsider the epistemological approach to automated classification. This study compared the performance of several types of ANN models and ensemble models based on classification results and the integration of multiple ANN structures. To train these models, two dictionaries were employed: health consumers’ terms extracted from questions and answers in the health categories of Yahoo!Answers and MeSH terms. All three types of individual classifiers demonstrated accuracies of around 90%. In particular, the fully connected ANN with two layers produced relatively higher classification performances than a convolutional neural network and long short-term memory. Ensemble models based on classification results outperformed not only the ensemble models based on the integration of heterogeneous ANN structures but also individual deep-learning models. The combination of questions and best answers were found to be most effective as a training dataset to build an accurate prediction model. The findings suggest that ANN models can be an effective assistive tool in classifying online health resources generated by health consumers in natural language.

相關文獻