- Open Access
Theoretical study of the Usutu virus helicase 3D structure, by means of computer-aided homology modelling
Theoretical Biology and Medical Modellingvolume 6, Article number: 9 (2009)
Usutu virus belongs to the Flaviviridae viral family and constitutes an important pathogen. The viral helicase is an ideal target for inhibitor design, since this enzyme is essential for the survival, proliferation and transmission of the virus.
Towards a drug-design approach, the 3D model of the Usutu virus helicase structure has been designed, using conventional homology modelling techniques and the known 3D-structure of the Murray Valley Encephalitis virus helicase, of the same viral family, as template. The model was then subjected to extended molecular dynamics simulations in a periodic box, filled with explicit water molecules for 10 nanoseconds. The reliability of the model was confirmed by obtaining acceptable scores from a variety of in silico scoring tools, including Procheck and Verify3D.
The 3D model of the Usutu virus helicase exhibits in silico all known structural characteristics of the Flaviviridae viral family helicase enzymes and could provide the platform for further de novo structure-based design of novel anti-Usutu agents.
The viral family Flaviviridae comprises the genera Flavivirus, Pestivirus and Hepacivirus and includes numerous important human and animal pathogens . The small, enveloped virions of the different members of the Flaviviridae family contain a single-stranded, positive-sense RNA genome of about 9.5–12.5 kb. The genome consists of a single, long open reading frame (ORF), which is flanked by untranslated regions (UTRs) at the 5' and 3' ends.
Recent studies on sub-genomic Pestivirus and Flavivirus RNA replicons have revealed that the non-structural (NS) proteins, which are encoded by the C-terminal part of the polyprotein, play a crucial role in viral RNA replication [2, 3]. Accordingly, these proteins are assumed to form replication complexes in conjunction with genomic RNA and possibly with other cellular factors.
Sequence alignments of the Usutu viral helicase identified several conserved sequence motifs that are important for biological functions. So far, the crystal structures of helicases from various RNA viruses have been determined, including the helicases from hepatitis C virus, Dengue virus, Yellow Fever Virus and Kunjin virus .
Usutu virus is the cause of one of the most important arthropod-borne viral (arbovirus) diseases, transmitted mainly by the Culex pipiens mosquito . Usutu virus was first detected in Africa, and four decades later it emerged in Austria. Usutu virus is closely related to the more common West Nile virus, Japanese Encephalitis virus and Yellow Fever virus. Usutu virus caused mass mortalities of birds, especially blackbirds (Turdus merula) and great grey owls (Strix nebulosa) initially in Austria, but then spread to other central European countries such as Hungary, Switzerland, and Northern Italy (in chronological order) .
In the present work, the three-dimensional structure of the helicase enzyme of Usutu virus was modelled by applying homology modelling techniques and using the crystal structure of the Murray Valley Encephalitis virus helicase of the Flaviviridae family as template.
The amino acid sequence of Usutu viral helicase was obtained from the GenBank database (accession no.: NC_006551, entry name: Usutu virus, complete genome). Using the Gapped-BLAST  through NCBI . the homologous structure of Murray Valley Encephalitis virus helicase was identified, which was used as template for the homology modelling of the Usutu viral helicase. The sequence alignment was done using the online version of ClustalW .
Secondary structure prediction
Secondary structure predictions were performed using the NPS (Network Protein Sequence Analysis) web-server http://npsa-pbil.ibcp.fr and the GeneSilico MetaServer http://genesilico.pl/ in order to confirm that the chosen Murray Valley Encephalitis virus helicase template is the most appropriate one .
The homology modelling of the Usutu viral helicase was carried out using the MOE (Molecular Operating Environment, version: 2004.03) package and its built-in homology modelling application . The RCSB entry 2V80, which corresponds to the crystal structure of the Murray Valley Encephalitis virus helicase was used as template. The sequence alignment between the raw sequence of the Usutu and the sequence of the Murray Valley Encephalitis virus template revealed almost 90% identity, which allows for a conventional homology modelling approach to be considered. The homology model method of MOE comprises the following steps: First an initial partial geometry specification, where an initial partial geometry for each target sequence is copied from regions of one or more template chains. Secondly, the insertions and deletions task, where residues that still have no assigned backbone coordinates are modelled. Those residues may be in loops (insertions in the model with respect to the template), they may be outgaps (residues in a model sequence which are aligned before the C-terminus or after the N-terminus of its template) or may be deletions (regions where the template has an insertion with respect to the model). For this study though outgaps have not been included in the homology modelling process. Third step is the loop selection and sidechain packing, where a collection of independent models is created. Last step is the final model selection and refinement one, where the final models are scored and ranked, after they have been stereochemically tested and evaluated with the built-in module "Protein Geometry" for errors.
Molecular electrostatic potential (MEP)
Electrostatic potential surfaces were calculated by solving the nonlinear Poisson-Boltzmann equation using finite difference method as implemented in the PyMOL Software . The potential was calculated on grid points per side (65, 65, 65) and the grid fill by solute parameter was set to 80%. The dielectric constants of the solvent and the solute were set to 80.0 and 2.0, respectively. An ionic exclusion radius of 2.0Å, a solvent radius of 1.4Å and a solvent ionic strength of 0.145 M were applied. Amber99  charges and atomic radii were used for this calculation.
Energy minimisation was done in MOE initially using the Amber99  forcefield implemented into the same package, up to a RMSd gradient of 0.0001 to remove the geometrical strain. The model was subsequently solvated with SPC water using the truncated octahedron box extending to 7Å from the model and molecular dynamics were performed after that at 300 K, 1 atm with 2 fsecond step size and for a total of ten nanoseconds, using the NVT ensemble in a canonical environment. NVT stands for Number of atoms, Volume and Temperature that remain constant throughout the calculation. The results of the molecular dynamics simulation were collected into a database by MOE and can be further analysed.
The produced models were initially evaluated within the MOE package by a residue packing quality function, which depends on the number of buried non-polar side chain groups and on hydrogen bonding. Moreover, the suite PROCHECK  was employed to further evaluate the quality of the produced Usutu virus helicase model. Finally, Verify3D [15, 16] was used to evaluate whether the model of Usutu virus helicase is similar to known protein structures.
Results and discussion
The NS3 domain of Flaviviridae contains both the protease and the helicase coding regions. For this study, the Usutu virus helicase sequence was aligned with the Murray Valley Encephalitis virus helicase template sequence and all the major helicase motifs, characteristic of the Flaviviridae family were found to be conserved (Figure 1).
The Murray Valley Encephalitis virus helicase structure has been established by X-ray crystallography at 1.9 Å resolution . The Murray Valley Encephalitis virus helicase is an appropriate template because the virus belongs to the same viral family (Flaviviridae). Among the available helicase structureswithin this family, the Murray Valley Encephalitis virus enzyme presented the highest identity percentage to the Usutu virus helicase. That is also supported by the analysis of the helicase structures available for the whole Flaviviridae family that revealed that the overall 3D structure for the Murray Valley Encephalitis virus helicase ishighly conserved [17, 18]. Moreover using a fold recognition approach (FR), the Murray Valley Encephalitis virus helicase was confirmed as the most appropriate model. The secondary structure prediction for the Usutu virus helicase turned out to be very similar to the actual structure of the Murray Valley Encephalitis virus helicase. Methods of protein fold recognition attempt to detect similarities between protein 3D structure that are not accompanied by any significant sequence similarity. There are many approaches, but the unifying theme is to try and find folds that are compatible with a particular sequence. Unlike sequence-only comparison, these methods take advantage of the extra information made available by 3D structure information. In effect, the turn the protein folding problem on it's head: rather than predicting how a sequence will fold, they predict how well a fold will fit a sequence [19–22].
The model was firstly structurally superimposed and subsequently compared to its template, where exhibited an alpha-carbon RMSD less than 0,65 angstroms. Furthermore it was evaluated with MOE and PROCHECK for its geometry [see additional file 1] and then subjected to the Verify3D algorithm for a more thorough evaluation. Verify3D assessed the compatibility of the 3D model of Usutu virus helicase with its own amino acid sequence. Based on location and environment, a structural class is assigned for each residue. A collection of reference structures is used as a control in order to calculate a score for each residue. The Usutu virus helicase model ranged from +0.32 to +0.84. That confirmed that the model is of high quality, since it is Verify3D scores below +0.1 that are indicative of serious problems in the model . A more important criterion was the compliance of the model with the known unique characteristics of the helicases that belong to the Flaviviridae family of viruses. The Murray Valley Encephalitis virus, Hepatitis C virus, Dengue virus, West Nile virus and Usutu virus helicases all belong to the superfamily II of helicases and share seven common motifs within their domains, all of which have been structurally conserved on the Usutu virus helicase model .
Description of the Usutu virus helicase model
As expected from the sequence alignment (Figure 1) and the secondary structure prediction (data not shown), the Usutu virus helicase model exhibited the structural features of known Flaviviridae helicases. Namely, the three distinct domains of helicases as well as the various motifs (as reported by Kim  – Figure 1) were structurally conserved in the Usutu virus helicase model (Figures 2 and 3). One of the most crucial motifs in Flaviviridae helicases is the GxGKT/S motif in domain 1, which is conserved to the same loop in kinases. It is a Walker A motif and its role involves binding of the β-phosphate of ATP . Site directed mutagenesis studies of that motif have reported that the mutant protein is inactive. Furthermore another crucial motif for the helicase is the DExH motif. The DExH motif is responsible for the binding of the Mg2+-ATP substrate. Studies in adenylate and thymidine kinases revealed that an aspartate (Asp170) binds the Mg2+ and helps to establish the optimum orientation of ATP for nucleophilic attack . Finally, another crucial motif is the QRxGRxGR motif. The role of this motif is exceptionally crucial to the Flaviviridae helicase function as it is involved in nucleic acid binding [27–29].
The model shared a very similar topology (shape and size) to its template (Figures 2 and 3). The Flaviviridae helicases, in general, have three domains in total, which are separated by two channels. The first and third domains are interacting much more together than they do with domain two. Domain two is supposed to undergo significant movements compared to the other two domains, during the unwinding of double-stranded nucleic acids. It is the channel between domain 3 and 1–2 that accommodates the ssRNA during the viral unwinding process. RNA binds to the helicase at the arginine-rich site of the 2nd domain. The ATP and ssRNA sites were found to have been conserved on the Usutu virus helicase model (Figures 4 and 5) .
ssRNA – ATP substrates and MD simulations
In order to evaluate the substrate binding site, the model was subjected to energy minimization and molecular dynamics (MD) simulations in the presence of the helicase substrates. The coordinates of the ssRNA and ADP/Mn++ were transferred to the model from the Hepatitis C (1A1V) and the Dengue virus helicase structure (2JLS) respectively, for this purpose. Invariant residues of various motifs in the vicinity of either substrate in the Murray Valley Encephalitis virus template structure were conserved structurally in the Usutu virus helicase model (Figures 2 and 4). Most interactions between the ssRNA fragment and the Usutu helicase enzyme are established with the backbone of the ssRNA fragment, which is typical for non-specific protein – nucleic acid interactions. The bases in the middle of the ssRNA do not seem to interact much with the receptor (Figure 2D). The receptor's contacts emerge mostly from domains one and two of the Usutu helicase and most specifically, by loops between secondary structure elements of the latter domains. More detailed (i.e. per residue) comparison of the ssRNA interaction pattern between the Usutu virus helicase model and the Murray Valley Encephalitis virus helicase structure has been drawn using LigPlot, which is a built-in module of MOE, and is shown in Figure 2D.
The 10 ns MD simulations revealed that soon the Usutu virus helicase model reached a conformational equilibrium similar to that of Murray Valley Encephalitis virus helicase structure (Figure 5B and 5C). Taken together these observations illustrated the viability of the homology modelling of the Usutu virus helicase model. To evaluate further the Usutu virus helicase model, the latter was compared with its template structure by calculating the root mean square deviations (RMSd) between equivalent atoms for the full MD course. Large RMSd values are indicative of systems of poor quality. The Cα RMSd of the Usutu virus helicase model from the equivalent domains of the template structures was less than 0.65Ǻ. The low RMSd value indicated that this model remains conformationally close to the template structure upon the minimization and the molecular dynamics simulation course that followed, reflecting its high quality.
Molecular surface analysis
In order to analyze the molecular surface of the produced Usutu virus helicase model, the electrostatic potential surface was calculated. For direct comparison with the template structures used in this study, electrostatic potential surfaces were also calculated for the Murray Valley Encephalitis virus helicase (Figure 6). The two helicases exhibited almost identical electrostatic surfaces, sharing common features such as a negatively charged ssRNA entrance to the helicase tunnel. This observation verified the validity of the model, which was found to share similar electrostatic surface to its X-ray crystal structure template.
The 3D model of the Usutu virus helicase was designed using the homologous X-ray crystal structure of the Murray Valley Encephalitis viral helicase as template. The model was successfully evaluated both in terms of its geometry, fold recognition and compliance to the criteria required as a member to the Flaviviridae viral family. It is therefore proposed that the Usutu virus helicase model will be suitable for further in silico structure-based de novo drug design experiments. These computer-based methodologies are now becoming integral part of the drug discovery process that may eventually lead to the development of potential inhibitor structures of the Usutu virus helicase enzyme in the future.
Calisher CH, Gould EA: Taxonomy of the virus family Flaviviridae. Adv Virus Res. 2003, 59: 1-19.
Behrens S-E, Grassmann CW, Thiel H-J, Meyers G, Tautz N: Characterization of an autonomous subgenomic pestivirus RNA replicon. Journal of Virology. 1998, 72: 2364-2372.
Khromykh AA, Westaway EG: Subgenomic replicons of the flavivirus Kunjin: construction and applications. Journal of Virology. 1997, 71: 1497-1505.
Wu J, Bera AK, Kuhn RJ, Smith JL: Structure of the flavivirus helicase: implications for catalytic activity, protein interactions, and proteolytic processing. J Virol. 2005, 79: 10268-10277.
Bakonyi T, Erdélyi K, Ursu K, Ferenczi E, Csörgo T, Lussy H, Chvala S, Bukovsky C, Meister T, Weissenböck H, Nowotny N: Emergence of Usutu virus in Hungary. J Clin Microbiol. 2007, 45: 3870-3874.
Bakonyi T, Gould EA, Kolodziejek J, Weissenböck H, Nowotny N: Complete genome analysis and molecular characterization of Usutu virus that emerged in Austria in 2001: comparison with the South African strain SAAR-1776 and other flaviviruses. Virology. 2004, 328 (2): 301-310.
Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25: 3389-3402.
Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL: GenBank. Nucleic Acids Res. 2007, 35: 21-25.
Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22: 4673-4680.
Kurowski MA, Bujnicki JM: GeneSilico protein structure prediction meta-server. Nucleic Acids Res. 2003, 31: 3305-3307.
MOE, Chemical Computing Group, 1010 Sherbrooke St. West, Suite 910, Montreal, Canada, H3A 2R.
Wang J, Cieplak P, Kollman PA: How well does a restrained electrostatic potential (resp) model perform in calculating conformational energies of organic and biological molecules. J Comp Chem. 2000, 21: 1049-1074.
Laskowski RA, Rullmannn JA, MacArthur MW, Kaptein R, Thornton JM: AQUA and PROCHECK-NMR: programs for checking the quality of protein structures solved by NMR. J Biomol NMR. 1996, 8: 477-486.
Eisenberg D, Lüthy R, Bowie JU: VERIFY3D: assessment of protein models with three-dimensional profiles. Methods Enzymol. 1997, 277: 396-404.
Lüthy R, Bowie JU, Eisenberg D: Assessment of protein models with three-dimensional profiles. Nature. 1992, 356: 83-85.
Mancini EJ, Assenberg R, Verma A, Walter TS, Tuma R, Grimes JM, Owens RJ, Stuart DI: Structure of the Murray Valley encephalitis virus RNA helicase at 1.9 Angstrom resolution. Protein Sci. 2007, 16 (10): 2294-300.
Sampath A, Padmanabhan R: Molecular targets for flavivirus drug discovery. Antiviral Res. 2009, 81 (1): 6-15.
Wodak SJ, Rooman MJ: Generating and testing protein folds. Current Opinion in Structural Biology. 1993, 3: 247-259.
Jones D, Thornton J: Protein fold recognition. Journal of Computer Aided Molecular Design. 1993, 7: 439-456.
Bowie JU, Eisenberg D: Inverted protein structure prediction. Current Opinion in Structural Biology. 1993, 3: 437-444.
Lemer C, Rooman MJ, Wodak SJ: Protein Structure Prediction By Threading Methods: Evaluation Of Current Techniques. Proteins. 1996, 23 (3): 337-355.
Bujnicki Janusz, Rychlewski Leszek, Fischer Daniel: Fold-recognition detects an error in the Protein Data Bank. Bioinformatics. 2002, 18 (10): 1391-1395.
Kim JL, Morgenstern KA, Griffith JP, Dwyer MD, Thomson JA, Murcko MA, Lin C, Caron PR: Hepatitis C virus NS3 RNA helicase domain with a bound oligonucleotide: the crystal structure provides insights into the mode of unwinding. Structure. 1998, 6: 89-100.
Saraste M, Sibbald PR, Wittinghofer A: The P-loop – a common motif in ATP- and GTP-binding proteins. Trends Biochem Sci. 1990, 15: 430-434.
Ruff M, Krishnaswamy S, Boeglin M, Poterszman A, Mitschler A, Podjarny A, Rees B, Thierry JC, Moras D: Class II aminoacyl transfer RNA synthetases: crystal structure of yeast aspartyl-tRNA synthetase complexed with tRNA(Asp). Science. 1991, 252: 1682-1689.
Lohman TM, Bjornson KP: Mechanisms of helicase-catalyzed DNA unwinding. Annu Rev Biochem. 1996, 65: 169-214.
Pause A, Sonenberg N: Mutational analysis of a DEAD box RNA helicase: the mammalian translation initiation factor elF-4A. EMBO J. 1992, 11: 2643-2654.
Gross CH, Shuman S: The QRxGRxGRxxxG motif of the vaccinia virus DExH box RNA helicase NPH-II is required for ATP hydrolysis and RNA unwinding but not for RNA binding. J Virol. 1996, 70: 1706-1713.
Luo D, Xu T, Watson RP, Scherer-Becker D, Sampath A, Jahnke W, Yeong SS, Wang CH, Lim SP, Strongin A, Vasudevan SG, Lescar J: Insights Into RNA Unwinding and ATP Hydrolysis by the Flavivirus Ns3 Protein. EMBO J. 2008, 27: 3209-3219.
The author declares that they have no competing interests.
DV is the sole author of this paper and is responsible for developing the concepts and for writing and revising the manuscript.
Electronic supplementary material
Additional file 1: Extended Procheck Results for both the Usutu virus helicase model and the Murray Valley Encephalitis virus helicase template. Extended Procheck Results for both the Usutu virus helicase model (LEFT COLUMN) and the Murray Valley Encephalitis virus helicase template (RIGHT COLUMN, X-ray structure: 2V80). A: Ramachandran plot, B: Bond Length Plot, C: Bond Angles plot, D: Dihedrals plot, E: Rotamers plot and F: Contact Energies Plot. It is therefore concluded that the Usutu virus helicase model has inherited all structural characteristics of its Murray Valley Encephalitis virus helicase template. (JPEG 270 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.