ABSTRACT A natural mutant of human lysozyme, D67H, causes hereditary systemic nonneuropathic amyloidosis, which can be fatal. In this disease, insoluble beta-stranded fibrils (amyloids) are found in tissues stemming from the aggregation of partially folded intermediates of the mutant. In this study, we specifically compare the conformation and properties of the structures adopted from the induced unfolding, at elevated temperature, using molecular dynamics, To increase the sampling of the unfolding conformational landscape, three 5 ns trajectories are performed for each of the wild-type and mutant D67H proteins resulting in a total of 30 ns simulation. Our results show that the mutant unfolds slightly faster than the wild-type with both wild-- type and mutant proteins losing most of their native secondary structure within the first 2 ns. They both develop random transient beta-strands across the whole polypeptide chain. Clustering analysis of all the conformations shows that a high population of the mutant protein conformations have a distorted beta-domain. This is consistent with experimental results suggesting that this region is pivotal in the formation of conformations prone to act as "seeds" for amyloid fiber formation.
INTRODUCTION
Human lysozyme is a 130-residue protein found in secretions (e.g., saliva, sweat, and mucus) and more generally in leukocytes and kidneys. It is an enzyme that hydrolyzes preferentially the beta-1,4 glucosidic linkages between N-acetylmuramic acid and N-acetylglucosamine that occur in the mucopeptide cell wall structure of certain microorganisms (Chipman and Sharon, 1969). The wild-type human lysozyme has been crystallized and its structure elucidated by Artymiuk and Blake (1981) at 1.5 A resolution (Fig. 1, left). Its native structure consists of two domains: an alpha-domain that has four a-helices (A-D) and one 3^sub 10^ helix, and a beta-domain, which consists mainly of an antiparallel beta-sheet and a long loop. The active site is located in the cleft that is formed between these two domains. The protein contains four disulphide bonds of which two are located in the alpha-domain, one in the long loop of the beta-domain, and one that connects the two domains.
There are two known natural mutations of the human lysozyme: D67H and I56T (Pepys et al., 1993). They both cause autosomal dominant hereditary nonneuropathic systemic amyloidosis. This is a condition whereby there is tissue deposition in viscera and other body cavities of normally soluble autologous proteins as insoluble fibrils called amyloids. The core of these fibrils structure consists of beta-sheet with the strands perpendicular to the long axis of the fiber (Pepys, 1996). Amyloids can be formed from proteins of diverse sequence, fold, and function and are known to lead to serious medical conditions such as Alzheimer's disease and spongiform encephalopathies (Kelly, 1998). Although the mechanism of fibrilogenesis is not clear, it has been suggested that it is related to changes in stability and tendency to aggregate due to mutations. More specifically, Booth et al. (1997) proposed that a partially folded transient population of the amyloidogenic proteins, which lacks global cooperativity, undergoes structural transformation (a helix-- to-sheet transition) and creates the first template, the "seed", for further protein deposition and fibril formation. In a recent paper, Morozova-Roche et al. (2000) show that the presence of these "seeds" for both wild-type and the two natural mutants of human lysozyme facilitate the formation of fibrils.
We focus on the behavior of the D67H variant. The crystallized structure of this mutant (Booth et al., 1997) at 1.75 A resolution is quite similar to that of the wild-type (Fig. 1, right). However, because of the mutation, the network of hydrogen bonds that stabilizes the Bi-domain is destroyed. This results in a large concerted movement of the beta-sheet and the long loop within the beta-domain. This distortion of the beta-sheet propagates partially downstream to Ile56, the residue at the interface of the two domains, for which increased B-factors have been found.
To understand the behavior of the partially folded structures that may trigger amyloid formation, we explore the conformations adopted during the induced unfolding of the wild-type and mutants at high temperatures (500 K) using molecular dynamics (MD) simulations. The temperature denaturation of proteins using MD is considered as one of the most straightforward computational experiments (Brooks, 1998). The great advantigation of unfolding simulations is that they allow the investigation of the conformational properties at every point along the unfolding pathway (Li and Daggett, 1994). There are many examples of their use (Daggett, 2000) and these have led to detailed insight of experimental results of protein unfolding (Li and Daggett, 1998; Alonso and Daggett, 2000) and the study of the folding pathway (Karplus and Sali, 1995). Lysozyme from hen egg white has been extensively investigated by means of unfolding simulations (Mark and van Gunsteren, 1992; Hulnenberger et al., 1995; Williams et al., 1997; Kazmirski and Daggett, 1998; Gilquin et al., 2000) with the aim to investigate several issues such as the stability and folding of the protein. The human and hen forms of lysozyme have 60% sequence similarity but very similar 3D structure.
In our study, we attempt to examine and compare the structure and properties of the partially folded intermediates obtained from the induced unfolding of the wild-type and the D67H form.
MODEL AND SIMULATION DETAILS
The crystal structures of the wild-type (IREX) and the mutant (1LYY) lysozyme are the starting point of the simulations. Hydrogen atoms are added and the final system includes solvent molecules that are represented by the SPC216 model (Berendsen et al., 1981). All atoms are explicitly represented. The solvated model is contained in a rectangular box (70 X 70 X 70 A) using periodic boundaries conditions (Allen and Tildesley, 1987). The building of the protein models and all their simulations are carried out with the GROMACS suite of programs (Berendsen et al., 1995). The GROMOS 96 force field is used to describe the atomic interactions (van Gunsteren et al., 1996). For the correct treatment of long-range electrostatics, we make use of the particle mesh Ewald summation algorithm (Darden et al., 1993). The high frequency degrees of freedom from the covalent bonds of hydrogen atoms are constrained using the LINCS algorithm (Hess et al., 1997) and thus the time step is increased from 1 to 2 fs. The system is coupled to an external temperature bath with a separate bath for the solvent and the solute (Berendsen et al., 1984). The regulation of the pressure is achieved by means of a pressure bath. Data on the trajectories are saved every 0.2 ps. For these simulations, we use an in-house multiprocessor Origin 2000 with four CPUs. The total central processing unit time for all simulations was ~48 days.
To minimize the initial system, we use a combination of the conjugate gradients and steepest descent methods in which after every 50 steps of conjugate gradients, one step of steepest descent is performed. The minimization is terminated when the overall force of the system is 100 N or after 5000 steps. This protocol is first used to minimize hydrogen atom and water molecules positions and then extended to the whole system. To further optimize the arrangement of the solvent around the protein and alleviate high energy regions (hot spots), the water molecules only are assigned initial velocities from a Gaussian distribution generated from a random seed and then warmed up from 50 K to 300 K over 10 ps of MD. This is followed by equilibration of the water molecules at 300 K for 30 ps. Then the whole system (including the previously constrained protein) is assigned initial velocities from a Gaussian distribution generated by a random seed for 50 K, warmed up to 300 K for 10 ps, and finally equilibrated at this temperature for 1000 ps. The 1000 ps trajectory at 300 K serves as a control simulation. The last conformation of the system obtained from this trajectory is used as the starting structure for the unfolding simulation in which the system is warmed up to 500 K for 10 ps and is subsequently maintained at this temperature. To enhance the sampling of the unfolding pathway, long multiple trajectories are performed by assigning velocities generated from different seed numbers. Three 5000 ps high temperature (500 K) and three 1080 ps control (300 K) simulations are performed for each lysozyme form.
We thank the Biotechnology and Biological Sciences Research Council for computer hardware and the Engineering and Physical Sciences Research Council for a studentship to G.M.
This work has been carried out within the Biotechnology and Biological Sciences Research Council-sponsored Bloomsbury Centre for Structural Biology.
REFERENCES
Allen, M. P., and D. J. Tildesley. 1987. Computer Simulations of Liquids. Clarendon, Oxford.
Alonso, D. 0. V., and V. Daggett. 2000. Staphylococcal protein A: unfolding pathways, unfolded states, and differences between the B and E domains. Biophysics. 97:133-138.
Artymiuk, P. J., and C. C. F. Blake. 1981. Refinement of human lysozyme at 1.5 A resolution. Analysis of non-bonded and hydrogen-bond interactions. J. Mol. BioL 152:737-762.
Berendsen, H. J. C., J. P. M. Postma, W. F. van Gunsteren, and J. Her-mans. 1981. Interaction models for waters in relation to protein hydration. In Intermolecular Forces. B. Pullman, editor. D. Reidel, Dordrecht, The Netherlands. 331-342.
Berendsen, H. J. C., J. P. M. Postma, W. F. van Gusteren, A. Di Nola, and J. R. Haak. 1984. Molecular dynamics with coupling to an external bath. J. Chem . Phys. 81:3684-3690.
Berendsen, H. J. C., D. van der Spoel, and R. van Drunen. 1995. GROMACS: a message-passing parallel molecular dynamics implementation. Comp. Phys. Comm. 91:43-56.
Booth, D. R., M. Sund, V. Bellotti, C. V. Robinson, W. L. Hutchinson, P. E. Fraser, P. N. Hawkins, C. M. Dobson, S. E. Radford, C. C. F. Blake, and M. B. Pepys. 1997. Instability, unfolding and aggregation of human lysozyme variants underlying arnyloid fibrillogenesis. Nature. 385.787793.
Brooks, C. L., 3rd. 1998. Simulations of protein folding and unfolding. Curr. Op. Struct. Biol. 8:222-226.
Canet, D., A. M. Last, P. Tito, M. Sunde, A. Spencer, D. B. Archer, C. Redfield, C. V. Robinson, and C. M. Dobson. 2002. Local cooperativity in the unfolding of an amyloidogenic variant of human lysozyme. Nat. Struct. Biol. 9:308-315.
Canet, D., M. Sunde, A. M. Last, A. Miranker, A. Spencer, C. V. Robinson, and C. M. Dobson. 1999. Mechanistic studies of the folding of human lysozyme and the origin of amyloidogenic behavior in its disease-related variants. Biochemistry. 38:6419-27.
Chipman, D. M., and N. Sharon. 1969. Mechanism of lysozyme action. Science. 165:454-465.
Daggett, V. 2000. Long timescale simulations. Curr. Op. Struct. Biol. 10: 160-164.
Darden, T., D. York, and L. Pedersen. 1993. Particle mesh Ewald: A N-log(N) method for Ewald sums in large systems. J. Chem. Phys. 98:10089-10092.
Dyson, H. J., J. R. Sayre, G. Merutka, H. C. Shin, R. A. Lerner, and P. E. Wright. 1992. Folding of peptide fragments comprising the complete sequence of proteins models for initiation of protein folding. 11. Plastocyanin. J. Mol. Biol. 226:819-835.
Everitt, B. S. 1993. Cluster Analysis. John Wiley & Sons, New York. Gilquin, B., C. Guilbert, and D. Perahia. 2000. Unfolding of hen egg lysozyme by molecular dynamics simulations at 300K: insight into the role of the interdomain interface. Proteins. 41:58-74.
Hess, B., H. Bekker, H. J. C. Berendsen, and J. G. E. M. Fraaije. 1997. LINCS: a linear constraint solver for molecular simulations. J. Comp. Chem. 18:1463-1472.
Hunenberger, P. H., A. E. Mark, and W. F. van Gunsteren. 1995. Computational approaches to study protein unfolding: hen egg white lysozyme as a case study. Nature: Struct. Funct. Genet. 21:196-213.
Kabsch, W., and C. Sander. 1983. Dictionary of protein secondary structure: pattern-recognition of hydrogen-bonded and geometrical features. Biopolymers. 22:2577-2637.
Karplus, M., and A. Sali. 1995. Theoretical studies of protein folding and unfolding. Curr. Opin. Struct. Biol. 5:58-73.
Kazmirski, S. L., and V. Daggett. 1998. Non-native interactions in protein folding intermediates: molecular dynamics simulations of hen lysozyme. J. Mol. Biol. 284:793-806.
Kazmirski, S. L., A. Li, and V. Daggett. 1999. Analysis methods for comparison of multiple molecular dynamics trajectories: applications to protein unfolding pathways and denatured ensembles. J. Mol. Biol. 290: 283-304.
Kelly, J. W. 1998. The alternative conformations of amyloidogenic proteins and their multi-step assembly pathways. Curr. Opin. Struct. Biol. 8:101-106.
Li, A., and V. Daggett. 1994. Characterization of the transition state of protein unfolding by use of molecular dynamics: chymotrypsin inhibitor 2. Proc. Natl. Acad. Sci. USA. 91:10430-10434.
Li, A., and V. Daggett. 1998. Molecular dynamics simulation of the unfolding of barnase: characterization of the major intermediate. J. Mol. Biol. 275:677-694.
Mahalanobis, P. C. 1930. On tests and measures of groups divergence I. Journal of the Asiatic Society of Bengal. 26:541.
Mark, A. E., and W. F. van Gunsteren. 1992. Simulation of the thermal denaturation of HEW-lysozyme: trapping the molten globule state. Biochemistry. 31:7745-7748.
Miranker, A., S. E. Radford, M. Karplus, and C. M. Dobson. 1991. Demonstration by NMR of folding domains in lysozyme. Nature. 349:633-636.
Miranker, A., C. V. Robinson, S. E. Radford, R. T. Alpin, and C. M. Dobson. 1993. Detection of transient protein folding populations by mass spectroscopy. Science. 262:896-900.
Morozova-Roche, L. A., J. Zurdo, A. Spencer, W. Noppe, V. Receveur, D. B. Archer, M. Joniau, and C. M. Dobson. 2000. Amyloid fibril formation and seeding by wild-type human lysozyme and its disease-related mutational variants. J. Struct. Biol. 130:339-351.
Otto, M. 1998. Chemometrics: Statistics and Computer Application in Analytical Chemistry. Weinheim, Cambridge.
Pepys, M. B., P. N. Hawkins, D. R. Booth, D. M. Viguishin, G. A. Tennent, A. K. Soutar, N. Tony, 0. Nguyen, C. C. F. Blake, C. J. Terry, T. G. Feest, A. M. Zalin, and J. J. Hsuan. 1993. Human lysozyme gene mutations cause hereditary systemic amyloidosis. Nature. 206:553-557.
Pepys, M. B. 1996. Amyloidosis. In The Oxford Textbook of Medicine, Vol. 2. D. J. Weatherall, J. G. G. Ledingham, and D. A. Warell, editors. Oxford University Press, Oxford.
Schormann, N., J. R. Murrell, and M. D. Benson. 1998. Tertiary structures of amyloidogenic and non-amyloidogenic transthyretin variants: new model for amyloid fibril formation. Amyloid. 5:175-187.
Skolnick, J., and A. Kolinski. 1991. Dynamic Monte Carlo simulations of a new lattice model of globular protein folding, structure and dynamics. J. Mol. Biol. 221:499-531.
Spath, H. 1980. Cluster Analysis Algorithms for Data Reduction and Classification of Objects. Ellis Horwood, Chichester, UK.
Tobias, D. J., S. F. Sneddon, and C. L. Brooks. 1990. Reverse turns in blocked dipeptides are intrinsically unstable in water. J. Mol. Biol. 216:783-796.
van Gunsteren, W. F., S. R. Billeter, A. A. Eising, P. H. Hunenberger, P. Kruger, A. E. Mark, W. R. 0. Scott, and I. G. Tironi. 1996. Biomolecular Simulation: The GROMOS96 Manual and User Guide. Hochschulverlag AG an der ETH, Zurich, Switzerland.
Venables, W, N., and B. D. Ripley. 1999. Modern Applied Statistics with S-PLUS, 3rd ed. Springer, New York.
Williams, M. A., J. M. Thornton, and J. M. Goodfellow. 1997. Modelling protein unfolding: hen egg-white lysozyme. Protein Eng. 10:895-903. Worth, G. A., F. Nardi, and R. C. Wade. 1998. Use of multiple molecular
dynamics trajectories to study biomolecules in solution: the YTGP peptide. J. Phys. Chem. B. 102:6260-6272.
Yang, A., B. Hitz, and B. Honig. 1996. Free energy determinants of secondary structure formation: III. /-toms and their role in protein folding. J. Mol. Biol. 259:873-882.
Zimmerman, S. S., and H. A. Scheraga. 1977. Local interactions in bends of proteins. Proc. Natl. Acad. Sci. USA. 74:4126-4129.
George Moraitakis and Julia M. Goodfellow
School of Crystallography, Birkbeck College, University of London, London WC1E 7HX, United Kingdom
Submitted March 18, 2002, and accepted for publication September 18, 2002.
Address reprint requests to Julia M. Goodfellow, School of Crystallography, Birkbeck College, University of London, Malet Street, London WCIE 7HX, UK. Tel.: 44-01793-413208; E-mail: julia.goodfellow@bbsrc.ac.uk.
(C) 2003 by the Biophysical Society
0006-3495/03/04/2149/10 $2.00
Copyright Biophysical Society Apr 2003
Provided by ProQuest Information and Learning Company. All rights Reserved