Universiti Teknologi Malaysia Institutional Repository

Neural network for prediction of cysteine disulphide bridge connectivity in proteins

Bostan, Hamed (2010) Neural network for prediction of cysteine disulphide bridge connectivity in proteins. Masters thesis, Universiti Teknologi Malaysia, Faculty of Computer Science and Information Systems.

[img]
Preview
PDF
88Kb
[img]
Preview
PDF
106Kb
[img]
Preview
PDF
134Kb

Abstract

The goal of this thesis is to develop a computational method based on machine learning techniques for predicting disulfide-bonding states of Cysteine residues in proteins, which is a sub-problem of the bigger and yet unsolved problem of protein structure prediction. First, we preprocessed the datasets from Protein Data Bank (PDB) and filtered mutations and low resolution files out. A number of descriptors in two dimensional (2D) protein sequences are studied. These descriptors are based on local feature values of adjacent amino acid to Cystein residue, namely encoded, propensity value and averaged propensity value. We have used Artificial Neural Network (ANN) as a machine learning technique to develop our prediction method. We use ‘trainlm’, ‘trainrp’ and ‘trainscg’ training functions for training out network and also a 5-fold validation is implemented. Our results show that we can predict the state of Cystein disulphide bond formation. It shows that using propensity valued descriptor and ‘trainscg’ training function is better to be used for Cystein bond state prediction compared to the other training functions and descriptors in this study. The accuracy of prediction in this study is 80.85% on a propensity value descriptor dataset which had been trained by ‘trainscg’ with a dataset of over than 400 thousand protein patterns. Results of this work will have direct implications in site directed mutational studies of protein, protein engineering and the problem of protein folding.

Item Type:Thesis (Masters)
Additional Information:Supervisor : Prof. Dr. Naomie Salim; Thesis (Sarjana Sains (Sains Komputer)) - Universiti Teknologi Malaysia, 2010
Uncontrolled Keywords:neural networks, computer science
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computer Science and Information System (Formerly known)
ID Code:18275
Deposited By: Kamariah Mohamed Jong
Deposited On:16 Jan 2012 08:48
Last Modified:16 Jan 2012 08:52

Repository Staff Only: item control page