Universiti Teknologi Malaysia Institutional Repository

A study on biomolecular sequence alignment using machine learning techniques

Othman, Muhamad Razib and Salim, Naomie and Abdul Jalil, Rozita and Deris, Safaai and Mat Yatim, Safie and Md. Illias, Rosli (2004) A study on biomolecular sequence alignment using machine learning techniques. Project Report. Faculty of Cmputer Sience and Information System, Skudai, Johor. (Unpublished)

[img]
Preview
PDF (Full Text)
2687Kb

Abstract

Pairwise sequence alignment is used to compare the sequence of nucleotides or protein with the aims of inferring structural, functional and evolutionary relationships. The main reason of sequence alignment is to find an optimal alignment. The most used method in research and have been certify to produce an optimal sequence alignment are dynamic programming methods Smith-Waterman for local alignment. Based from the previous research, scoring schemes in dynamic programming can be improved by using substitutions matrices and introduction of gap in alignment with gap penalty function. The reasons are to optimize result of alignments with perpetuate biology concept like evolution changes in molecular structures caused by mutation. Today, no general theory guides the selection of substitution matrices and gap penalties for local sequence alignment. Because of that, this project will implement dynamic programming method Smith-Waterman with different parameter of substitution matrices and gap penalty function in scoring schemes. Substitution matrices that will be used are BLOSUM45, BLOSUM62 and BLOSUM80. While linear gap penalty with range values parameter from (–d=1 to –d=10) or affine gap penalty with range values parameter for opening gap from (–d=1 to –d=12) and extension gap from (–e=1 to–e=5). Intensive comparison will be done to test the efficiency and determine the effective substitution matrices and gap penalty parameter for sequence alignment. 27 sets of data protein sequences categorized by length and percentage similarity identity will be used for sequence alignment. The results will give the guideline for the selection of effective substitution matrices and gap penalty parameter for sequence alignment

Item Type:Monograph (Project Report)
Uncontrolled Keywords:Pairwise sequence alignment, scoring schemes, machine learning techniques
Subjects:Z Bibliography. Library Science. Information Resources > ZA Information resources > ZA4050 Electronic information resources
Divisions:Computer Science and Information System (Formerly known)
ID Code:4400
Deposited By: Azrin Ariffin
Deposited On:31 Oct 2007 09:38
Last Modified:01 Jun 2010 03:17

Repository Staff Only: item control page