Analysing Drug Targets Using Ligand Similarity

Analysing Drug Targets Using Ligand Similarity

Advisor: 

Arzucan Ozgur

Assigned to: 

Hakime Ozturk

Type: 

Year: 

2014

Status: 

Summary:

Analysis of the interactions between target proteins and drugs is crucial not only for drug discovery, but also for a better understanding of the possible evolutionary pressure that the drugs exert on the proteins. Based on the hypothesis that similar proteins bind to similar ligands, ligand similarity is utilized with two different approaches. We first introduce ligand-centric network models to analyse the relationships of protein family members via the drugs that they bind to. We build three different types of networks in which the proteins are represented as nodes, and two proteins are connected by an edge with a weight that depends on the number of shared identical or similar ligands. As a test case, we focus on Beta-lactamases and Penicillin-Binding Proteins. The use of ligand sharing information to cluster proteins results in modules comprising proteins both with sequence and functional similarity. Consideration of ligand similarity not only enhances the clustering of the target proteins, but also highlights some interactions that were not detected in the identical ligand network. In the second part, we follow a machine learning approach for predicting protein-ligand interactions using Support Vector Machines (SVM) where we focus on comparing different ligand similarity kernels. For this task, a larger data set of GPCR and ion channels is examined. Among the 15 different ligand kernels we experiment with, LINGO based TF-IDF cosine similarity achieves a 0.009 better AUC score than the widely used 2D Fingerprint Tanimoto model on the GPCR data set.

Özet:

Analysis of the interactions between target proteins and drugs is crucial not only for drug discovery, but also for a better understanding of the possible evolutionary pressure that the drugs exert on the proteins. Based on the hypothesis that similar proteins bind to similar ligands, ligand similarity is utilized with two different approaches. We first introduce ligand-centric network models to analyse the relationships of protein family members via the drugs that they bind to. We build three different types of networks in which the proteins are represented as nodes, and two proteins are connected by an edge with a weight that depends on the number of shared identical or similar ligands. As a test case, we focus on Beta-lactamases and Penicillin-Binding Proteins. The use of ligand sharing information to cluster proteins results in modules comprising proteins both with sequence and functional similarity. Consideration of ligand similarity not only enhances the clustering of the target proteins, but also highlights some interactions that were not detected in the identical ligand network. In the second part, we follow a machine learning approach for predicting protein-ligand interactions using Support Vector Machines (SVM) where we focus on comparing different ligand similarity kernels. For this task, a larger data set of GPCR and ion channels is examined. Among the 15 different ligand kernels we experiment with, LINGO based TF-IDF cosine similarity achieves a 0.009 better AUC score than the widely used 2D Fingerprint Tanimoto model on the GPCR data set.

Bize Ulaşın

Bilgisayar Mühendisliği Bölümü, Boğaziçi Üniversitesi,
34342 Bebek, İstanbul, Türkiye

  • Telefon: +90 212 359 45 23/24
  • Faks: +90 212 2872461
 

Bizi takip edin

Sosyal Medya hesaplarımızı izleyerek bölümdeki gelişmeleri takip edebilirsiniz