Exploring evolution of brain genes involved in microcephaly through phylogeny and synteny analysis
© Rauf and Mir; licensee BioMed Central Ltd. 2013
Received: 8 June 2013
Accepted: 16 October 2013
Published: 22 October 2013
Human brain development is a complicated process. When normal growth and development of brain or central nervous system is impaired, it leads to neurodevelopemental disorders (NDDs). Autosomal Recessive Primary Microcephaly (MCPH) is one of those, for which seven loci (MCPH1-MCPH7) with the corresponding genes (MCPH1, WDR62, CDK5RAP2, CEP152, ASPM, CENPJ, and STIL) have been reported so far. An important field of study is to find out diversity among organisms due to evolution. How species are related to each other can be inferred through finding evolutionary relationship between organisms in the form of ancestors and descendents.
MEGA5 was used for phylogenetic tree reconstruction. Pair-wise and multiple alignment was built through ClustalW algorithm. Neighbor joining (NJ) and maximum parsimony (MP) methods were used for tree reconstruction. Bootstrap analysis was done to check the reliability of trees. Synteny analysis was performed using Ensemble synteny view in ensemble database and genome synteny viewer (GSV).
Evolutionary time for single gene trees showed that CENPJ (0.02) evolving rapidly while CDK5RAP2 (0.1) evolving with least rate as compare to other genes. All trees were reconciling the species divergence time. Chimpanzee was inferred as closest specie of Human. In MCPH combined tree, five duplications were observed. Four duplications were before and one was after vertebrate and invertebrate divergence. Two genes MCPH1 and WDR62 were closely related with each other. Synteny analysis indicated that maximum conservation of Human was with Chimpanzee. Highly conserved synteny was observed for Human and Chimpanzee in case of CENPJ with no deletion.
It has been hypothesized that due to having closest relationship, mutations can affect Chimpanzee likewise as these affect Human. Conservation shows that apart from sequence similarity, function of MCPH genes in closely related species is also same and this function disrupts as a result of mutation and hence leads to the diseased state. Huge genomic and proteomic data is available today which enables us to perform In Silico analysis. Our cost and time effective analysis has opened many insights into disease understanding and it will definitely provide a way towards accurate diagnosis.
Evolution is the change that leads towards the diversity. This diversity can be at any biological level including species, organisms, and also at molecular level i.e. DNA and Proteins . An important evolutionary study is the reconstruction of phylogenetic trees. Phylogenetic tree reconstruction is to estimate the evolutionary relationship between organisms. From genetic sequence data, trees can be reconstructed using many different techniques. The relationship is represented in the form of a branching tree sort of diagrams showing ancestors and the evolved descendents .
Exhaustive-search which examine all possible trees or their large number and finally select the best one on the basis of certain criterion or threshold for example Maximum-parsimony (MP) method , the Fitch-Margoliash (FM) method , the maximum-likelihood (ML) method  and Bayesian approach 
Stepwise clustering method which constructs the best tree in a step wise fashion after examining local topological relationships of a tree. Example of this category is neighbor-joining (NJ) method 
NJ seems to be a method of choice as in obtaining the correct tree, it shows a high performance. When there is an assumption of constant rate of nucleotide substitution then ML method proves to be slightly inferior to NJ, but it is slightly better than other two methods (MP and FM) when among the branches, the evolutionary rate varied drastically . Neighbor joining method (Distance method) reconstructs the phylogenetic tree from evolutionary distance data. It works on the principle that it finds neighbors or pairs of operational taxonomic units (OTUs) and joins them or put into a cluster . Maximum Parsimony is another widely used method for phylogenetic tree reconstruction which is based on sequences . It is character based method.
Alteration in gene/genes or chromosomes is the basal root of any genetic disease. Individuals born as a result of consanguineous union have homozygous segments of their genomes. It is due to inheriting identical ancestral genomic segments through both parents. An increased incidence of recessive diseases within these sibships is one of its consequences . One important example of such type of diseases is Autosomal Recessive Primary Microcephaly (MCPH) which is a rare neurodevelopmental disorder or a neurogenic mitosis disorder. During the process of embryonic neurogensis, generated cerebral cortical neurons are reduced in number. Due to this reason size of MCPH patient’s brain decreases and becomes to 1/3 of its normal volume .
Seven loci (MCPH1-MCPH7) with the corresponding genes (MCPH1, WDR62, CDK5RAP2, CEP152, ASPM, CENPJ, and STIL) have been discovered so far from different world populations. It has been proposed that disease phenotype can produced due to mutations in any of genes of MCPH. ASPM and WDR62 gene mutations have a contribution of more than 50% in MCPH Worldwide . WDR62 has been identified as the second most common cause and contributor gene (after ASPM) of MCPH .
Computational approaches aims to enhance understanding of biological mechanisms, with primary focus on creating and applying intensive techniques. Phylogenetic analysis and synteny analysis are two most important researches in this discipline. The current study involves this analysis for reported seven Human MCPH genes in order to find out evolutionary relationship and conservation, respectively with respect to various selected ortholog species.
Materials and methods
Synteny analysis was performed using Ensemble synteny view in ensemble database  and the visual analysis of conserved regions was carried out using web-based genome synteny viewer GSV . For this analysis only four ortholog species of Human have been considered.
Neighbor joining tree for MCPH1 is shown in Figure 3. This is reconstructed tree after deleting three sequences (Opossum, Frog and Platypus) from original tree. These sequences were not according to the time of divergence hence removed from the tree. The reconstructed tree (Figure 3) is reconciling the species divergence time except mouse which has shown instant divergence from Human. According to tree, Human and Chimpanzee are in one cluster with a bootstrap value of 99 while Macaque is close to Human/Chimpanzee with 100 as a bootstrap value. Vertebrates Ciona intestinalis and Fruitfly are as outgroup in this tree. Evolutionary time for the tree is 0.05.
The tree was initially constructed using fourteen ortholog species of Human. Two orthologs Anole Lizard and Opposum have diverged sequences as compare to the rest of species due to which these have been excluded. Chicken, Frog, Guinea Pig and Mouse are not according to the time of divergence. These four orthologs were deleted and the tree was reconstructed. The reconstructed tree is shown in Figure 4 and it is reconciling the species divergence time. According to this tree Human is closely related to Macaque and Chimpanzee cluster with the bootstrap value of 100. Invertebrates Ciona intestinalis and Fruitfly are in one cluster with 100 as a bootstrap value. 0.05 is the evolutionary value of tree.
Neighbor joining tree for CDK5RAP2 is shown in Figure 5. Frog and Chicken were removed from the initial tree as they were not according to the time of divergence. The tree was reconstructed after deleting these two orthologs. The reconstructed tree (Figure 5) having same results for Human and Chimpanzee cluster with bootstrap value of 91. Evolutionary time for tree is 0.1. According to tree invertebrates Ciona intestinalis and Fruitfly are in one cluster and are as out group. Zebrafish and Fugu are in one cluster with a bootstrap value of 99. Similarly Opossum/Platypus are making cluster with 97 as bootstrap value.
In the initial tree, Frog, Platypus, Mouse and Guinea Pig were not according to the time of divergence hence they were deleted from the tree. The tree was reconstructed after deleting these sequences and is shown in the Figure 6. The reconstructed tree is reconciling the species divergence time. According to this tree, invertebrates Ciona intestinalis and Fruitfly are as out group. Human and Chimpanzee are in one cluster (have same ancestor) with 99 as a bootstrap value indicating reliability of this cluster. Macaque and ancestor of Human/Chimpanzee are evolving from the same ancestor. Macaque is evolving with a bootstrap value of 100 and is closely related to the cluster of Human/Chimpanzee. Zebrafish and Fugu are in one cluster with a bootstrap value of 100. Similarly Chicken/Anole Lizard and Dog/Megabat are making clusters with 74 and 88 as bootstrap values, respectively. Evolutionary time is 0.05 for the tree.
Anole Lizard, Fruitfly, Frog and Mouse have been deleted from the tree constructed initially as they were not according to the time of divergence. The tree was reconstructed after deleting these four orthologs and is shown in Figure 7 having rate of evolution as 0.05. In this tree Human is making cluster with Macaque instead Chimpanzee. Chimpanzee is evolving with a bootstrap value of 100 and is close to Human/Macaque cluster. Opossum and Platypus are making cluster with 87 as bootstrap value.
Two orthologs Chicken and Macaque have diverged sequences as compare to the rest of species due to which these have been excluded. Megabat is deleted as it is not according to the time of divergence. After deletion tree was reconstructed as shown in Figure 8 and has evolutionary time of 0.02. According to this tree, invertebrates Ciona intestinalis and Fruitfly are as outgroup. Human is making cluster with Chimpanzee with 100 as a bootstrap value indicating the reliability of cluster. Human/Chimpanzee cluster in original tree is evolving from the same ancestor with a bootstrap value of 99 while in reconstructed tree their bootstrap value is 100. Zebrafish/Fugu and Guinea Pig/Mouse are making cluster with bootstrap values 100 and 75, respectively.
From the initial tree, Frog, Platypus, Guinea Pig and Mouse were deleted and tree was reconstructed after deleting these four orthologs and it is shown in Figure 9. The rate of evolution is 0.05. According to this tree invertebrates Ciona intestinalis and Fruitfly are in one cluster and are as out group. Zebrafish and Fugu are making a cluster with a bootstrap value of 92. In this tree Human/Chimpanzee is in one cluster with 97 as a bootstrap value.
Combined tree for seven MCPH genes
An overall tree for all the seven genes constructed through MP method is shown in Additional file 1. This tree shows that ASPM Ciona intestinalis and CEP152 Ciona intestinalis are in one cluster, which shows that there are two copies of same gene. Hence one copy i.e. ASPM Ciona intestinalis has been deleted. All sequences of gene STIL have been deleted as these sequences seem too divergent. The sequences; MCPH1 Opposum, MCPH1 fruitfly, CEP152 fruitfly, CENPJ Megabat, CENPJ Opposum, CENPJ Platypus, CENPJ chicken, CDK5RAP2 Fruitfly, and CDK5RAP2 Ciona intestinalis were not according to the time of divergence, hence these have also been deleted from the tree.
We have reconstructed tree by removing all of the sequences mentioned above. The tree which is reconciling the species divergence time is given in Additional file 2.
Genome synteny analysis
In order to find out the genomic elements that are functionally conserved, we find out set of genomic features (genes or loci) that are conserved, in the same relative ordering on a set of homologous chromosomes (of human and its four orthologs). We studied conservation of human 15 genes (both upstream and downstream of seven MCPH genes) with genes of its four orthologs. Data collected from ensembl syntenyview in ensembl database and its summary is given in Additional file 3.
According to Additional file 4, common deletions in four orthologs in relevance to Human are two in case of MCPH1 i.e. DEFA6, SPAG11B. In case of CEP152, only one common deletion in four orthologs occurs i.e. RP11-90J19.1 while in remaining five genes no common deletions have been found. All seven MCPH genes (MCPH1, WDR62, CDK5RAP2, CEP152, ASPM, CENPJ, and STIL) are present in four ortholog species (Chimpanzee, Mouse, Dog and Chicken) in relevance to Human except WDR62 which is deleted in Chicken only. This shows the importance of MCPH genes in these species.
An important research area in the field of computational biology is phylogenetic analysis which aims to study and estimate evolutionary relationship between organisms. Evolution is the change that leads towards the diversity. This diversity can be at any biological level including species, organisms, and also at molecular level i.e. DNA and Proteins.
Trees for seven MCPH genes (MCPH1, WDR62, CDK5RAP, CEP152, ASPM, CENPJ and STIL) were constructed through NJ method which showed evolutionary relationship among Human and its orthologs. Through this evolutionary relationship it has been determined how much species are closely related or deviated from Human. Rate of evolution for constructed trees showed that CENPJ (0.02) evolving rapidly as compare to rest of the genes. CDK5RAP2 with maximum evolutionary rate (i.e. 0.1) showed that this gene evolving with least rate as compare to others MCPH gene. All MCPH trees are reconciling the species divergence time. Bootstrap values in all trees have helped in the validation of clusters in the tree. These values clearly indicate the reliability of clusters. In WDR62, Human is closely related to the cluster of Macaque and Chimpanzee (with bootstrap value of 100). Similarly Human is making cluster with Chimpanzee in MCPH1, CDK5RAP2, CEP152, CENPJ and STIL with bootstrap values of 99, 91, 99, 100, and 97 respectively, while it is making cluster with Macaque in ASPM with 50 as a bootstrap value. Chimpanzee is evolving with a bootstrap value of 100 in ASPM tree and is close to Human/Macaque cluster. Function of MCPH genes present in Human ortholog species is same as function of Human MCPH genes and this function in these species remains intact unless and until mutation comes. Only the difference is in sequences of their genes (which lead to phenotypic changes as well) and through our results we demonstrated the level of difference. Our results showed how close an ortholog speice is to the query (Human) in reference to each MCPH gene. In case of every MCPH gene, ortholog species present in cluster with Human or near the cluster of Human were most likely (with less difference in their sequences) as compare to those which were present away from Human in the tree. In the combined tree of MCPH genes, five duplications have been observed dividing ancestral gene into descendent genes. Two genes MCPH1 and WDR62 found to be closely related evolved at the end as a result of fifth duplication and are in one cluster. Four duplications have been observed before vertebrates and invertebrates divergence and only one duplication took place after vertebrate and invertebrate divergence i.e. first duplication.
Syntenic relationship for all MCPH genes indicated that maximum conservation of Human has been found with Chimpanzee in five genes: MCPH1, WDR62, CDK5RAP2, CEP152, ASPM, and CENPJ while with Mouse in case of gene STIL. Highly conserved synteny has been observed for Human and Chimpanzee in case of CENPJ with no deletion.
Current study shows that CENPJ is evolving rapidly as compare to others. Maximum evolutionary rate is of gene CDK5RAP2 provides us the hypothesis that it is evolving with least rate as compare to others. In WDR62, Human is closely related to the cluster of Macaque and Chimpanzee. Similarly Human is making cluster with Chimpanzee in MCPH1, CDK5RAP2, CEP152, CENPJ and STIL, while it is making cluster with Macaque in ASPM. The closest specie of Human in our analysis have been found to be Chimpanzee as maximum genes are showing their cluster and hence direct relationship of both ortholog species. According to our understanding, there are five duplication events in tree. Four duplications takes place before divergence of vertebrates and invertebrates and only one duplication is taking place after vertebrate and invertebrate divergence. Duplication events showed that MCPH1 and WDR62 are closely related to each other and evolved at the end as compare to other genes. According to synteny analysis maximum conservation of Human has been found with Chimpanzee in MCPH1, WDR62, CDK5RAP2, CEP152, ASPM, and CENPJ and with Mouse in case of gene STIL.
From our present results, we hypothesized that due to having closest relationship, it is possible that mutations can affect Chimpanzee (closest Human relative according to our results) likewise as these affect Human and can lead to microcephaly. It also shows genes of microcephaly in closest relative species (Human/Chimpanzee) have maximum similarity in their sequences and share a close syntenic relationship. Conservation shows that apart from sequence similarity, function of MCPH genes in closely related species is also same and this function disrupts as a result of mutation and hence leads to the diseased state.
We are thankful to Department of Bioinformatics and Biotechnology, International Islamic University, Islamabad for providing us a working platform. We are also thankful for the online available tools, databases and softwares which helped in making this research come to a conclusion.
- Hall KB, Hallgrímsson B: Strickberger’s evolution. 2008, Jones and Bartlett PublishersGoogle Scholar
- Brinkman LSF, Leipe DD: Phylogenetic Analysis. Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins. Volume 43. 2002, 323-358. 2Google Scholar
- Saitou N, Imanishi T: Relative efficiencies of the fitch-margoliash, maximum- parsimony, maximum-likelihood, minimum-evolution, and neighbor-joining methods of phylogenetic tree construction in obtaining the correct tree. Mol Biol Evol. 1989, 6: 514-525.Google Scholar
- Fitch MW: On the problem of discovering the most parsimonious tree. American Naturalist. 1977, 111: 223-257.View ArticleGoogle Scholar
- Fitch MW, Margoliash E: Construction of phylogenetic trees. Science. 1967, 155: 279-284.View ArticlePubMedGoogle Scholar
- Felsenstein J: Evolutionary trees from DNA sequences: a maximum likelihood approach. J Mol Evol. 1981, 17: 368-376.View ArticlePubMedGoogle Scholar
- Holder M, Lewis PO: Phylogeny estimation: traditional and Bayesian approaches. Nat Rev Genet. 2003, 4: 275-84.View ArticlePubMedGoogle Scholar
- Saitou N, Nei M: The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987, 4: 406-425.PubMedGoogle Scholar
- Mount WD: Maximum Parsimony Method for Phylogenetic Prediction. 2008, Cold Spring Harbor ProtocolsGoogle Scholar
- Woods GC, Cox J, Springel K, Hampshire JD, Mohamed DM, McKibbin M, Stern R, Raymond LF, Sandford R, Sharif MS, Karbani G, Ahmed M, Bond J, Clayton D, Inglehearn FC: Quantification of homozygosity in consanguineous individuals with autosomal recessive disease. Am J Hum Genet. 2006, 78: 889-896.PubMed CentralView ArticlePubMedGoogle Scholar
- Mahmood S, Ahmad W, Hassan JM: Autosomal recessive primary microcephaly (MCPH): clinical manifestations, genetic heterogeneity and mutation continuum. Orphanet J Rare Dis. 2011, 6: 39-PubMed CentralView ArticlePubMedGoogle Scholar
- Nicholas KA, Khurshid M, Desir J, Carvalho PO, Cox JJ, Thornton G, Kausar R, Ansar M, Ahmad W, Verloes A, Passemard S, Misson PJ, Lindsay S, Gergely F, Dobyns BW, Roberts E, Abramowicz M, Woods GC: WDR62 is associated with the spindle pole and is mutated in human microcephaly. Nat Genet. 2010, 42: 1010-1014.View ArticlePubMedGoogle Scholar
- Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28: 2731-2739.PubMed CentralView ArticlePubMedGoogle Scholar
- Efron B, Tibshirani JR: An introduction to the bootstrap. 1993, CRCView ArticleGoogle Scholar
- Johnson M, Zaretskaya I, Raytselis Y, Merezhuk Y, McGinnis S, Madden LT: NCBI BLAST: a better web interface. Nucleic Acids Res. 2008, 36: W5-W9.PubMed CentralView ArticlePubMedGoogle Scholar
- Hubbard T, Barker D, Birney E, Cameron G, Chen Y, Clark L, Cox T, Cuff J, Curwen V, Down T, Durbin R, Eyras E, Gilbert J, Hammond M, Huminiecki L, Kasprzyk A, Lehvaslaiho H, Lijnzaad P, Melsopp C, Mongin E, Pettett R, Pocock M, Potter S, Rust A, Schmidt E, Searle S, Slater G, Smith J, Spooner W, Stabenau A, Stalker J, Stupka E, Ureta-Vidal A, Vastrik I, Clamp M: The Ensembl genome database project. Nucleic Acids Res. 2002, 30: 38-41.PubMed CentralView ArticlePubMedGoogle Scholar
- Revanna VK, Chiu CC, Bierschank E, Dong Q: GSV: a web-based genome synteny viewer for customized data. BMC Bioinformatics. 2011, 12: 316-PubMed CentralView ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.