- Open Access
IMGT/Collier-de-Perles: a two-dimensional visualization tool for amino acid domain sequences
Theoretical Biology and Medical Modellingvolume 10, Article number: 14 (2013)
IMGT/Collier-de-Perles is a tool that allows the user to analyze and draw two-dimensional graphical representations (or IMGT Collier de Perles) of protein domains (e.g., hydropathy plots). The IMGT/Collier-de-Perles specializes in the area of immunoglobulins (IG) or antibodies, T cell receptors (TR) and major histocompatibility (MH) of human and other vertebrate species as well as other proteins of the immunoglobulin superfamily (IgSF) and of the major histocompatibility superfamily (MhSF) and related proteins of the immune system of vertebrates and invertebrates.
Amino acids can be defined and classified in a number of ways, depending on the perspective they are being examined from each time. Thereby, they can be categorized according to the functional groups of their side chains, which determine their physicochemical characteristics .
Taking into account the importance of proteins, made of amino acids, as a structural component of all living organisms, the significance of a method or tool that could seamlessly manipulate this data would be extremely practical. Indeed, scientists have been using computational tools that enable them to compare and examine amino acid sequences in a number of ways.
Among the different classes of amino acid properties, hydrophobicity determines how strongly an amino acid is attracted to or repelled by water. A series of different hydrophobicity scales have been developed [2–8]. The higher the index value is in a scale, the more hydrophobic is the amino acid. Differences between the scales mainly depend on the method or on the algorithm used to measure or to define hydrophobicity [6, 9–12]. Hydrophobicity scales are commonly used to predict the leader region (or signal peptides) or the transmembrane region of proteins. When measuring sequential amino acids of a protein, fluctuations in value indicate protein hydrophobic regions potentially located inside the membrane lipid layer  or contributing to the hydrophobic core of a protein . Hydropathy and other amino acid properties are keys for a better understanding of protein interactions and domain structures.
The IMGT/Collier-de-Perles tool
The IMGT/Collier-de-Perles  tool was created by LIGM (Université Montpellier 2, CNRS) and is part of IMGT®, the international ImMunoGeneTics information system® [15, 16] (IMGT®, http://www.imgt.org), which is acknowledged as the global reference in immunogenetics and immunoinformatics.
IMGT/Collier-de-Perles can provide upon selection three types of displays: the hydropathy plot with 3 classes (hydrophobic, neutral, hydrophilic), the volume plot with 5 classes, and the physicochemical plot, which is the most informative one, with eleven IMGT physicochemical classes (which were defined taking into account hydropathy, volume and chemical characteristics properties) [1, 17] (Figure 1A). Eleven IMGT physicochemical classes of the 20 common amino acids have been defined by the physicochemical properties of their side chains . These standardized classes are used in the IMGT/Collier-de-Perles tool.
IMGT Colliers de Perles can currently be drawn for three domain types: variable (V) domain and constant (C) domain of immunoglobulins (IG) or antibodies and T cell receptors (TR) and immunoglobulin superfamilies (IgSF) proteins other than IG and TR, and groove domain (G) of the major histocompatibility (MH) and MH superfamily (MhSF) other than MH [18–21]. In order for an IMGT Collier de Perles to be created, each sequence has to be gapped according to the IMGT unique numbering [22–25], using IMGT/DomainGapAlign [26–28].
IMGT/DomainGapAlign allows the creation of gaps in the user’s V, C or G domain amino acid sequence, by aligning the user sequence to the corresponding IMGT domain reference directory and identifies, for the IG or TR V domain, the closest germline V-REGION and J-REGION, and for all other cases (V domain of IgSF other than IG or TR, C domain and G domain) the closest V, C or G domain of the reference gene and/or allele, respectively, and finally obtains the IMGT Collier de Perles. Amino acids which differ from the closest reference sequence are highlighted in the IMGT Collier de Perles (pink border, online) and the IMGT amino acid change characteristics detailed in accompanying tables [26–28].
The resulting IMGT Colliers de Perles (Figure 1B) help us determine which amino acids are important for the 3D structural configuration and, for the IG and TR V domain, delineate the standardized framework regions (FR-IMGT) (formed by the nine antiparallel beta strands) and complementarity determining regions (CDR-IMGT) (formed by the three loops binding the antigen). The length of the strands, loops and turns in IMGT Colliers de Perles provide critical information in the characterization of each V, C or G domain [21, 25].
How can we use IMGT Colliers de Perles?
The first among numerous features of the IMGT Collier de Perles is based on the way that domains of the antibody or T cell receptor are characterized. Each domain is described by the length of its loops, strands and turns, and helix (for G domain) [21, 25]. This way, the usual but confusing distinction made in the literature and generalist databases, between C1, C2 and I1 sets is often inappropriate upon the absence of structural data and can be ignored .
Unlike generalist databases such as UniProt/Swiss-Prot, IMGT standardization defines the different domains by comparing amino acid or cDNA sequences with genomic sequences, therefore identifying the splicing sites and giving a more accurate delimitation of the existing domains .
Another strong feature of the IMGT Collier de Perles is that it produces a standardized graphical representation and allows to visualize and to localize the differences between domains of proteins whatever the species even when 3D data are unavailable . This can be a great tool for molecular engineers. Antibody humanization, particularly, is greatly benefited by an interface that seamlessly displays and compares FR-IMGT and CDR-IMGT among several species [30–33].
IMGT Collier de Perles can also be used to compare a given amino acid sequence against an IMGT reference sequence, in order to facilitate the identification of potential immunogenic residues at certain positions of humanized antibodies or to assess the immunogenicity of therapeutic antibodies (Figure 1B). The reference sequence is in essence a statistical profile created for the human IG heavy, kappa and lambda expressed variable domains sets and is based, as the physicochemical plots, on the description of the 11 IMGT amino acid physicochemical classes (that include hydropathy, volume and chemical characteristics of the 20 common amino acids) .
Besides being a 2D visualization tool, IMGT/Collier-de-Perles can also take advantage of 3D structures when those are available [26, 34], by displaying IMGT Colliers de Perles on two layers with hydrogen bonds between amino acids of V or C domains of the antibody (Figure 1C). In Figure 1C, the FR-IMGT is made up of 9 strands (arrows) and turns is in green and the 3 CDR-IMGT are in red, orange and purple, respectively (http://www.imgt.org, The IMGT Biotechnology page, Antibody engineering, FR-IMGT and CDR-IMGT).
Additional information such as atom contact types and categories can be provided for each amino acid separately, by clicking on each amino acid in the IMGT Collier de Perles.
Based on the IMGT unique numbering concept generated from the NUMEROTATION axiom of IMGT-ONTOLOGY , the IMGT/Collier-de-Perles tool is a very friendly tool which allows users to create visual representation of their own amino acid sequences for V, C and G domains [21, 25]. The tool can be used on its own or as an output functionality of IMGT/DomainGapAlign [26–28]. It has also been integrated in IMGT/3Dstructure-DB (http://www.imgt.org), the IMGT three-dimensional (3D) structure database [26, 34]. Thus the users can compare their own IMGT Colliers de Perles with those provided in the database for analysis of interactions, e.g., those of the V domains of IG and TR in complex with their antigen (query on IG/Ag and TR/pMH) or those of the C or G domains of the FcR in complex with the IG Fc. IMGT Colliers de Perles provide a great help for understanding relations between sequences and structures in the design of therapeutic monoclonal antibodies (antibody engineering and humanization) [30, 31, 33] with their functional properties (specifity, affinity, immunogenicity, allotype expression , etc.), and more generally for characterizing the V, C and G domains of the proteins belonging to the IgSF and MhSF superfamilies of all vertebrates and invertebrates.
: IMGT classes of the 20 common amino acids. IMGT Education> IMGT Aide-mémoire> Amino acids. http://www.imgt.org (11 November 2012, date last accessed)
Janin J: Surface and inside volumes in a globular protein. Nature. 1979, 277: 491-492. 10.1038/277491a0.
Wolfenden R, Andersson L, Cullis P: Affinities of amino acid side chains for solvent water. Biochemistry. 1981, 20: 849-855. 10.1021/bi00507a030.
Kyte J, Doolittle RF: A simple method for displaying the hydropathic character of a protein. J Mol Biol. 1982, 157 (1): 105-132. 10.1016/0022-2836(82)90515-0.
Rose G, Geselowitz A, Lesser G: Hydrophobicity of amino acid residues in globular proteins. Science. 1985, 229: 834-838. 10.1126/science.4023714.
Engelman DM, Steitz TA, Goldman A: Identifying nonpolar transbilayer helices in amino acid sequences of membrane proteins. Annu Rev Biophys Biophys Chem. 1986, 15: 321-353. 10.1146/annurev.bb.15.060186.001541.
Cornette JL, Cease KB, Margalit H: Hydrophobicity scales and computational techniques for detecting amphipathic structures in proteins. J Mol Biol. 1987, 195 (3): 659-685. 10.1016/0022-2836(87)90189-6.
Wimley WC, White SH: Experimentally determined hydrophobicity scale for proteins at membrane interfaces. Nat Struct Biol. 1996, 3 (10): 842-848. 10.1038/nsb1096-842.
Charton M, Charton BJ: The structural dependence of amino acid hydrophobicity parameters. J Theor Biol. 1982, 99: 629-644. 10.1016/0022-5193(82)90191-6.
Eisenberg D: Three-dimensional structure of membrane and surface proteins. Ann Rev Biochem. 1984, 53: 595-623. 10.1146/annurev.bi.53.070184.003115.
Rose GD, Wolfenden R: Hydrogen bonding, hydrophobicity, packing, and protein folding. Annu Rev Biophys Biomol Struct. 1993, 22: 381-415. 10.1146/annurev.bb.22.060193.002121.
Biswas KM, DeVido DR, Dorsey JG: Evaluation of methods for measuring amino acid hydrophobicities and interactions. J Chromatogr A. 2003, 1000: 637-655. 10.1016/S0021-9673(03)00182-1.
Clements JD, Martin RE: Identification of novel membrane proteins by searching for patterns in hydropathy profiles. Eur J Biochem. 2002, 269: 2101-2107. 10.1046/j.1432-1033.2002.02859.x.
Ehrenmann F, Giudicelli V, Duroux P: IMGT/Collier de Perles: IMGT Standardized Representation of Domains (IG, TR, and IgSF Variable and Constant Domains, MH and MhSF Groove Domains). Cold Spring Harb Protoc. 2011, 6: 726-736.
Lefranc M-P, Giudicelli V, Ginestoux C: IMGT®, the international ImMunoGeneTics information system®. Nucl Acids Res. 2009, 37: D1006-D1012. 10.1093/nar/gkn838.
Lefranc M-P: IMGT, the International ImMunoGeneTics Information System. Cold Spring Harb Protoc. 2011, 6: 595-603.
Pommié C, Levadoux S, Sabatier R: IMGT standardized criteria for statistical analysis of immunoglobulin V-REGION amino acid properties. J Mol Recognit. 2004, 17: 17-32. 10.1002/jmr.647.
Ruiz M, Lefranc M-P: IMGT gene identification and Colliers de Perles of human immunoglobulin with known 3D structures. Immunogenetics. 2002, 53: 857-883. 10.1007/s00251-001-0408-6.
Kaas Q, Lefranc M-P: IMGT Colliers de Perles: standardized sequence-structure representations of the IgSF and MhcSF superfamily domains. Curr Bioinforma. 2007, 2: 21-30. 10.2174/157489307779314302.
Kaas Q, Ehrenmann F, Lefranc M-P: IG, TR, MHC, IgSf and MhcSF: what do we learn from the IMGT Colliers de Perles?. Brief Funct Genomic Proteomic. 2007, 6: 253-264.
Lefranc M-P: IMGT Collier de Perles for the Variable (V), Constant (C), and Groove (G) Domains of IG, TR, MH, IgSF, and MhSF. Cold Spring Harb Protoc. 2011, 6: 643-651.
Lefranc M-P, Pommié C, Ruiz M: IMGT unique numbering for immunoglobulin and T cell receptor variable domains and Ig superfamily V-like domains. Dev Comp Immunol. 2003, 27: 55-77. 10.1016/S0145-305X(02)00039-3.
Lefranc M-P, Pommié C, Kaas Q: IMGT unique numbering for immunoglobulin and T cell receptor constant domains and Ig superfamily C-like domains. Dev Comp Immunol. 2005, 29: 185-203. 10.1016/j.dci.2004.07.003.
Lefranc M-P, Duprat E, Kaas Q: IMGT unique numbering for MHC groove G-DOMAIN and MHC superfamily (MhcSF) G-LIKE-DOMAIN. Dev Comp Immunol. 2005, 29: 917-938. 10.1016/j.dci.2005.03.003.
Lefranc M-P: IMGT Unique Numbering for the Variable (V), Constant (C), and Groove (G) Domains of IG, TR, MH, IgSF, and MhSF. Cold Spring Harb Protoc. 2011, 6: 633-642.
Ehrenmann F, Kaas Q, Lefranc M-P: IMGT/3Dstructure-DB and IMGT/ DomainGapAlign: A database and a tool for immunoglobulins or antibodies, T cell receptors, MHC, IgSF and MhcSF. Nucl Acids Res. 2010, 38: D301-D307. 10.1093/nar/gkp946.
Ehrenmann F, Lefranc M-P: IMGT/DomainGapAlign: IMGT Standardized Analysis of Amino Acid Sequences of Variable, Constant, and Groove Domains (IG, TR, MH, IgSF, MhSF). Cold Spring Harb Protoc. 2011, 6: 737-749.
Ehrenmann F, Lefranc M-P: Immunogenetics, chap 33. Edited by: Tait B, Christiansen F. 2012, New York, USA: Humana Press, Springer, 605-633. 882, IMGT/DomainGapAlign: the IMGT® tool for the analysis of IG, TR, MHC, IgSF and MhcSF domain amino acid polymorphismMethods Mol Biol,
Garapati VP, Lefranc M-P: IMGT Colliers de Perles and IgSF domain standardization for T cell costimulatory activatory (CD28, ICOS) and inhibitory (CTLA4, PDCD1 and BTLA) receptors. Dev Comp Immunol. 2007, 31: 1050-1072. 10.1016/j.dci.2007.01.008.
Lefranc M-P: Antibody databases and tools: The IMGT® experience. Therapeutic monoclonal antibodies: from Bench to Clinic, Volume chap 4. Edited by: An Z. 2009, Hoboken, New Jersey, USA: John Wiley & Sons, Inc, 91-114.
Ehrenmann F, Duroux P, Giudicelli V: Standardized sequence and structure analysis of antibody using IMGT®. Antibody engineering, Vol. 2. Edited by: Kontermann R, Dübel S. 2010, Berlin Heidelberg: Springer-Verlab, 11-31. chap. 2
Lefranc M-P, Lefranc G: Immunogenetics, chap 34. Edited by: Tait B, Christiansen F. 2012, New York, USA: Humana Press, Springer, 635-680. 882, Human Gm, Km and Am allotypes and their molecular characterization: a remarkable demonstration of polymorphism,Methods Mol Biol,
Lefranc M-P, Ehrenmann F, Ginestoux C: Use of IMGT® databases and tools for antibody engineering and humanization. In: P. Chames (Ed.), Antibody engineering, chap 1, Humana Press, Springer, New York, USA. Methods Mol Biol. 2012, 907: 3-37.
Ehrenmann F, Lefranc M-P: IMGT/3Dstructure-DB: Querying the IMGT Database for 3D Structures in Immunology and Immunoinformatics (IG or Antibodies, TR, MH, RPI, and FPIA). Cold Spring Harb Protoc. 2011, 6: 750-761.
This research has been co-financed by the European Union (European Social Fund – ESF) and Greek national funds through the Operational Program "Education and Lifelong Learning" of the National Strategic Reference Framework (NSRF) - Research Funding Program: Thales. Investing in knowledge society through the European Social Fund.
The authors declare that they have no competing interests.
DV and SK conceived, coordinated, supervised and designed the study. DV, CF, VM and SK carried out the study and drafted the manuscript. All authors read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.