ebook img

Protein Sequences Classification : HMM based Kernels PDF

23 Pages·2014·0.47 MB·English
by  
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Protein Sequences Classification : HMM based Kernels

Marginalized kernels for biological sequences Koji Tsuda, Taishin Kin and Kiyoshi Asai AIST,2-41-6AomiKoto-ku,Tokyo,Japan Presented by Shihai Zhao May 8, 2014 KojiTsuda,TaishinKinandKiyoshiAsai Marginalizedkernels May8,2014 1/23 Overview 1 Introduction kernel functions hidden Markov model 2 Methods new kernel connections to the Fisher kernel 3 Results and Conclusion KojiTsuda,TaishinKinandKiyoshiAsai Marginalizedkernels May8,2014 2/23 kernel functions In kernel methods such as S.V.M., a kernel function should be determined a priori. Supervised learning Objective function is clear. Kernels are designed to optimize the function. Unsupervised learning The choice of kernel is subjective. It is determined to reflect the user’s notion of similarity. KojiTsuda,TaishinKinandKiyoshiAsai Marginalizedkernels May8,2014 3/23 kernel functions for sequences Texts Count features, which represent the number of each symbol contained in a sequence Biological sequences Count does not work ’out of the box’ primary due to frequent context change KojiTsuda,TaishinKinandKiyoshiAsai Marginalizedkernels May8,2014 4/23 A DNA sequence with hidden context information. Suppose the hidden variable (’h’) indicates coding/noncoding regions. KojiTsuda,TaishinKinandKiyoshiAsai Marginalizedkernels May8,2014 5/23 New way to design a kernel Visible Joint & Marginalized HMM Hidden KojiTsuda,TaishinKinandKiyoshiAsai Marginalizedkernels May8,2014 6/23 HMM A hidden Markov model (HMM) is a statistical Markov model in which the system being modeled is assumed to be a Markov process with unobserved states. KojiTsuda,TaishinKinandKiyoshiAsai Marginalizedkernels May8,2014 7/23 Example of HMM A limited number of sequences whose structures are known. We want to train the four HMMs of secondary structures to make the prediction Helix Sheet Turn Other KojiTsuda,TaishinKinandKiyoshiAsai Marginalizedkernels May8,2014 8/23 block diagram KojiTsuda,TaishinKinandKiyoshiAsai Marginalizedkernels May8,2014 9/23 HMMs of secondary structures Combined HMM for prediction KojiTsuda,TaishinKinandKiyoshiAsai Marginalizedkernels May8,2014 10/23

Description:
Objective function is clear. Kernels are designed to . 84 amino acid sequences from 5 genera in Actinobacteria. The number of sequences in each
See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.