文章詳目資料

International Journal of Applied Science and Engineering Scopus

  • 加入收藏
  • 下載文章
篇名 Prediction of RNA Polymerase Binding Sites Using Purine-Pyrimidine Encoding and Hybrid Learning Methods
卷期 2:2
作者 Cheng-Jian LinChun-Cheng PengChi-Yung Lee
頁次 177-188
關鍵字 E. colipromoter predictionpurine-pyrimidineexpectation maximization algorithmlearning vector quantization networksScopus
出刊日期 200407

中文摘要

英文摘要

Escherichia coli (E. coli) K12 was sequenced in 1997. The 4,639,221-base pair DNA sequence consists of 4288 annotated protein-coding genes, 38 percent of which have no attributed function. One of the major problems in predicting prokaryotic promoters is locating the spacers between the -35 box and -10 box and between the -10 box and transcription start site. In this paper, we use the adopted expectation maximization (EM) algorithm to accurately find the localizations of the promoter regions. A brand new purine-pyrimidine encoding method is proposed to reduce the dimensions of the training data. The heavy demand on systems for both computation and memory space can then be avoided through the choice of coding factor. The most representative features are used for training learning vector quantization networks. The simulation results of the proposed coding approach reveal that the precision of promoter prediction using the proposed approach is approximately the same as the precision using the traditional encoding method.

相關文獻