Learning from uncertain kernel in bioinformatics

Journal of Computer Science & Systems Biology

ISSN: 0974-7230

Open Access

Learning from uncertain kernel in bioinformatics

6th International Conference on Bioinformatics & Systems Biology

August 22-23, 2016 Philadelphia, USA

Sahely Bhadra

Aalto University, Finland

Posters & Accepted Abstracts: J Comput Sci Syst Biol

Abstract :

In this talk I will present the problem of designing machine learning methods when the biological data samples are contaminated with noise or there are missing values. Automatic kernelized classification techniques by studying structural or metabolic similarities among samples are popular for deriving a variety of insights, such as the ancestral links, the shared regulation or function of biochemical components. However, experimental measurements of biological processes and structures are subject to substantial noise stemming from both technical and biological variability. Again, different biological data sets provide information on different features and samples. Considering such multiple data sets for better learning poses the challenge that information for each point are not available in all data sets creating missing entries. First part of this talk is about the problem of designing SVM classifiers when the similarity measure or the kernel matrix, K, is affected by uncertainty, i.e., an expert is not fully certain about the value. In this work the uncertainty in the kernel matrix is modeled by a convex and bounded uncertain set which encompasses possible realizations of K. This new approach leads to an optimization problem which could be cast as a saddle point (minimax) problem. Due to favourable conditions satisfied by the problem, we are able to use a efficient algorithm which uses only the first order information, yet proved to decrease the initial error by a factor of O(1/T2), after T iterations. A comprehensive empirical study on real-world protein structure data sets show that the proposed formulations achieve the desired robustness to structural noise. The second part will discuss about a novel technique to complete missing rows and columns in the kernel matrix with the help of other available informations in case of multiple data sources. The proposed approach predicts missing (rows as well as columns) entries by modeling both within-view and between-view relationships among kernel values.

Biography :


Google Scholar citation report
Citations: 2279

Journal of Computer Science & Systems Biology received 2279 citations as per Google Scholar report

Journal of Computer Science & Systems Biology peer review process verified at publons

Indexed In

arrow_upward arrow_upward