Untitled Document
主页   |   SRS   |   最新消息   |   镜像站点 |   在线教程   |   关于CBI  |   招生信息  
国外站点
  NCBI
  EBI
  APBionet
  More>>

国内站点
 上海生物信息中心
 华大基因组信息中心
 天大生物信息中心
 计算所生物信息中心
 复旦理论生物中心
  More>> new!

友情链接
  国家教育科研网
 国家学术文献系统
 教育部网上合作中心
 北京大学
 北大图书馆
 北大耶鲁合作中心

搜索引擎
  Google
  北大天网
 
 

A Class of Edit Kernels for SVMs to Predict Translation Initiation Sites in Eukaryotic mRNAs

 

Dr Tao Jiang
Dept of Computer Science
UC Riverside

Room 610, CBI
2:00PM, Tuesday, 6 July 2004


The prediction of translation initiation sites (TISs)
in eukaryotic mRNAs has been a challenging problem in computational molecular biology. In this paper, we present a new algorithm to recognize TISs with a very high accuracy. Our algorithm includes two novel ideas. First, we introduce a class of new sequence-similarity kernels based on string edit, called the edit kernels, for use with support vector machines
(SVMs) in a discriminative approach to predict TISs. SVMs are among the most powerful and popular machine learning techniques. The edit kernels are simple and have significant biological and probabilistic interpretations. Second, we convert the region of an input mRNA sequence downstream to a putative TIS into an amino acid sequence before applying SVMs to avoid the high redundancy in the genetic code. The algorithm has been implemented and tested on previously published data. Our experimental results on real mRNA data show that both ideas improve the prediction accuracy greatly and our method performs significantly better than those based on neural networks and SVMs with polynomial kernels or Salzberg kernel.

This is joint work with Haifeng Li.

 
 

©北大生物信息中心 版权所有。意见建议请与cbi@pku.edu.cn联系!
211工程公共服务体系CERNET高速地区网和重点学科信息服务体系建设分子生物信息资源系统