NextBio

Khader Shameer
positions
current
Senior Research Fellow in Prof. R. Sowdhamini's Lab, National Centre for Biological Sciences - Tata Institute of Fundamental Research, Bangalore, India
education
Master of Science, Bioinformatics, Mahatma Gandhi University, Kottayam, India
Nov 2002-Dec 2004
Bachelor of Science, Physics, University of Calicut, Calicut, India
Aug 1999-Aug 2002
awards and honors
MSc Bioinformatics, Mahatma Gandhi University, Gold Medal(2004)
research summary
I am working on large scale sequence and structural analysis of therapeutically important protein families and protein structural features. I am using a variety of computational approaches including machine learning and data mining for such analysis. I am also working on the design and development of tools and database to help such large scale analysis. My other interests are in data integration, Web 2.0 technologies, semantic approaches to bioinformatics.
research projects
HARMONY : Protein Structure Validation
Protein structure validation is an important step in computational modeling and structure determination. Stereochemical assessment of protein structures examine internal parameters such as bond lengths and Ramachandran (varphi,psi) angles. Gross structure prediction methods such as inverse folding procedure and structure determination especially at low resolution can sometimes give rise to models that are incorrect due to assignment of misfolds or mistracing of electron density maps. Such errors are not reflected as strain in internal parameters. HARMONY is a procedure that examines the compatibility between the sequence and the structure of a protein by assigning scores to individual residues and their amino acid exchange patterns after considering their local environments. Local environments are described by the backbone conformation, solvent accessibility and hydrogen bonding patterns. We are now providing HARMONY through a web server such that users can submit their protein structure files and, if required, the alignment of homologous sequences. Scores are mapped on the structure for subsequent examination that is useful to also recognize regions of possible local errors in protein structures. HARMONY server is located at http://caps.ncbs.res.in/harmony/
PURE : Prediction of Unassigned Regions
Protein domains are the structural and functional units of proteins. The ability to parse proteins into different domains is important for effective classification, understanding of protein structure, function, and evolution and is hence biologically relevant. Several computational methods are available to identify domains in the sequence. Domain finding algorithms often employ stringent thresholds to recognize sequence domains. Identification of additional domains can be tedious involving intense computation and manual intervention but can lead to better understanding of overall biological function. In this context, the problem of identifying new domains in the unassigned regions of a protein sequence assumes a crucial importance. We had earlier demonstrated that accumulation of domain information of sequence homologues can substantially aid prediction of new domains. In this work, we propose a computationally intensive, multi-step bioinformatics protocol as a web server named as PURE (Prediction of Unassigned REgions in proteins) for the detailed examination of stretches of unassigned regions in proteins. Query sequence is processed using different automated filtering steps based on length, presence of coiled-coil regions, transmembrane regions, homologous sequences and percentage of secondary structure content. Later, the filtered sequence segments and their sequence homologues are fed to PSI-BLAST, cd-hit and Hmmpfam. Data from the various programs are integrated and information regarding the probable domains predicted from the sequence is reported. We have implemented PURE protocol as a web server for rapid and comprehensive analysis of unassigned regions in the proteins. This server integrates data from different programs and provides information about the domains encoded in the unassigned regions.
publications

The expressions of proteins in the cell are carefully regulated by transcription factors that interact with their downstream targets in specific signal transduction cascades. Our understanding of the regulation of functional genes responsive to stress sign…
Bioinformation. 2008

BACKGROUND: Protein domains are the structural and functional units of proteins. The ability to parse proteins into different domains is important for effective classification, understanding of protein structure, function, and evolution and is hence biolog…
BMC bioinformatics. 2008

Rapid increase in protein sequence information from genome sequencing projects demand the intervention of bioinformatics tools to recognize interesting gene-products and associated function. Often, multiple algorithms need to be employed to improve accurac…
Bioinformation. 2007

Protein structure validation is an important step in computational modeling and structure determination. Stereochemical assessment of protein structures examine internal parameters such as bond lengths and Ramachandran (varphi,psi) angles. Gross structure …
Nucleic acids research. 2006 Jul 1