Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
MODEL FOR MUTUALLY EXCLUSIVE DOMAIN FOLDING MOLECULAR SWITCH
Document Type and Number:
WIPO Patent Application WO/2004/085611
Kind Code:
A2
Abstract:
The present invention is a model for a mutually exclusive domain folding molecular switch comprising a two-domain, bifunctional fusion protein wherein the free energy released by the folding of a first domain of the fusion protein drives an unfolding of a second domain of the fusion protein, and vice versa. The molecular structure of the fusion protein is engineered so that, at any time, the folding of the first domain necessarily unfolds the other domain, and vice versa, thereby making the folded and unfolded states of the first and second domains mutually exclusive. This is accomplished by the insertion of an exemplary insert protein into a surface loop of an exemplary target protein subject to a novel structural design criterion wherein the N-C terminal length of the exemplary insert protein is at least two-times greater than the C&agr -C&agr length of the surface loop of the exemplary target protein.

Inventors:
LOH STEWART (US)
JEUNG-HOI HA (US)
BUTLER MARK C (US)
RADLEY TRACEY I (US)
Application Number:
PCT/US2004/008087
Publication Date:
October 07, 2004
Filing Date:
March 17, 2004
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
UNIV NEW YORK STATE RES FOUND (US)
LOH STEWART (US)
JEUNG-HOI HA (US)
BUTLER MARK C (US)
RADLEY TRACEY I (US)
International Classes:
C12N9/22; G01N33/48; G06F19/00; G06G7/48; C12N; (IPC1-7): C12N/
Foreign References:
US6037523A2000-03-14
US5877012A1999-03-02
US6090933A2000-07-18
Attorney, Agent or Firm:
Rabin, Sander (2 Irving Place Troy, NY, US)
Download PDF:
Claims:
We Claim :
1. A model for a mutually exclusive folding domain molecular switch including a fusion protein comprising at least one insert protein having an insert domain lying between an amino terminal and a carboxyl terminal of said at least one insert protein, said insert domain being associated with a first quantity of free energy; and, a target protein having at least one surface loop that begins at an alpha carbon of a first surface loop amino acid and terminates at an alpha carbon of a second surface loop amino acid, said at least one surface loop comprising a target domain of said target protein, said target domain being associated with a second quantity of free energy, wherein, said at least one insert protein is operatively inserted within said at least one surface loop between said alpha carbon of said first surface loop amino acid and said alpha carbon of said second surface loop amino acid such that an aminocarboxyl length extending between an alpha carbon of said amino terminal of said at least one insert protein and an alpha carbon of said carboxyl terminal of said at least one insert protein is at least twotimes greater than an alphacarbonalphacarbon length extending between said alpha carbon of said first surface loop amino acid and said alpha carbon of said second surface loop amino acid.
2. The fusion protein of claim 1, wherein said insert domain exists in either a folded or unfolded conformation and said target domain exists in either a folded or unfolded conformation, said insert domain and said target domain comprising a cooperative and reversible conformational equilibrium such that if said insert domain is in its folded conformation, said at least one target domain is in its unfolded conformation and vice versa.
3. The fusion protein of claim 2, wherein all or part of said first quantity of free energy is made available to drive a folding of said target domain from its unfolded conformation by means of a first controllable effector signal, and all or part of said second quantity of free energy is made available to drive a folding of said insert domain from its unfolded conformation by means of a second controllable effector signal.
4. The fusion protein of claim 3, wherein said first controllable effector signal is selected from the group comprising ligand binding, pH, temperature, chemical denaturants, or mutations in either said insert domain or said target domain.
5. The fusion protein of claim 3, wherein said second controllable effector signal is selected from the group comprising ligand binding, pH, temperature, chemical denaturants, or mutations in either said insert domain or said target domain.
6. The fusion protein of claim 2, wherein said insert domain and said target domain are disenabled from simultaneously coexisting in their respective folded conformations.
7. The fusion protein of claim 2, wherein said insert domain and said target domain are disenabled from simultaneously coexisting in their respective unfolded conformations.
8. The fusion protein of claim 2, wherein any excess of said first quantity of free energy of said insert domain that is not necessary to stabilize said insert domain in its folded conformation is spontaneously transferred, through the structure of said fusion protein, to said target domain to unfold it from its folded conformation.
9. The fusion protein of claim 2, wherein any excess of said second quantity of free energy of said target domain that is not necessary to stabilize said target domain in its folded conformation is spontaneously transferred, through the structure of said fusion protein, to said insert domain to unfold it from its folded conformation.
10. The fusion protein of claim 2, wherein said at least one insert protein comprises human ubiquitin, said insert domain comprises a regulatory domain of human ubiquitin, said target protein comprises barnase, said at least one target domain comprises a cytotoxic domain of bamase, said aminocarboxyl length is about 38 A, said first surface loop amino acid comprises proline in the number 64 position ("Pro64"), said second surface loop amino acid comprises threonine in the number 70 position ("Thr70"), and said alphacarbonalphacarbon length is about 10.4 A.
11. The fusion protein of claim 10 wherein said regulatory domain of human ubiquitin and said cytotoxic domain of barnase comprise a cooperative and reversible conformational equilibrium, that may be determined by said controllable first and second effector signals.
12. A method for the production of a protein comprising the steps of : a. selecting a linker containing first and second restriction sites between a Lys66 and a Ser67 codon of a barnase gene; b. using said first and second restriction sites of said linker to operationally insert a ubiquitin gene between two aminoacid codons of said linker, thereby creating a ubiquitinbarnase fusion gene; c. fully sequencing said ubiquitinbarnase fusion gene to verify its integrity; d. using enzymes to operationally insert said ubiquitinbarnase fusion gene into any plasmid of a BL21 (DE3) family, thereby creating an interim ubiquitin barnase fusion expression plasmid; e. operationally inserting a gene for barstar and its natural promoter from Bacillus amyloliquifaciesls into said interim ubiquitinbarnase fusion expression plasmid, thereby creating a ubiquitinbarnase fusionbarstar complex plasmid ; f. cloning said gene for barstar into a T7 promotercontaining plasmid conferring resistance to an antibiotic other than ampicillin onto cells transformed by said T7 promotercontaining plasmid, thereby creating a barstar plasmid; g. transforming E. coli BL21 (DE3) cells grown at about 20 to 37 degrees C in any medium compatible with E. coli growth using both said barstar plasmid and said ubiquitinbarnase fusionbarstar complex plasmid, and inducing said E. coli BL21 (DE3) cells with about 100 mg/L isopropyl bDthiogalactopyranoside; h. harvesting said transformed E. coli cells by centrifugation after about 2 to 12 hours ; after said induction; i. placing said harvested E. coli cells in 10 mM sodium phosphate at a pH of 7.5, thereby creating a solution of harvested E. coli cells; j. lysing said solution of harvested E. coli cells by repeated freezethaw cycles in the presence of about 10mg/liter lysozyme, thereby creating a lysate; k. adding about 10 mg/liter DNase I to reduce the viscosity of said lysate; 1. centrifuging said reduced viscosity lysate to remove insolubles, thereby forming a supernatant; m. adding about 8 M urea to said supernatant to dissociate bound barstar; n. Removing said dissociated barstar from said supernatant by passing said supernatant through an anion exchange chromatography resin to yield a solution ; o. loading said solution onto a cation exchange column; p. washing said solution with about 10 mM sodium phosphate (pH about 7.5) and about 6 M urea; q. eluting said solution using a 0 to 0.2 M NaCl gradient; r. Removing said urea from said dilution by dialysis against doubledistilled water to yield barnaseubiquitin fusion protein.
Description:
MODEL FOR MUTUALLY EXCLUSIVE DOMAIN FOLDING MOLECULAR SWITCH

Background of the Invention 1. Technical Field The invention relates generally to a model for a molecular switch to modulate the bioactivity of proteins 2. Related Art Ribonucleases are hydrolase enzymes that break linkages between nucleotides in ribonucleic acid. They are accordingly highly cytotoxic. A major problem with their use as therapeutic agents, for example, as pharmacologic agents in the treatment of cancer, is that their cytotoxicity is indiscriminate. Currently available ribonuclease pharmacologic agents kill normal as well as neoplastic cells, and the side effects of their use can be severe. Additionally, currently available ribonuclease agents demonstrate poor bioavailability owing to their rapid degradation by the liver and their difficulty in passing through both normal and neoplastic cell membranes.

Summary of the Invention The present invention comprises a model for a mutually exclusive folding domain molecular switch (and a method for the creation thereof), which model includes a fusion protein comprising at least one insert protein having an insert domain lying between an amino terminal and a carboxyl terminal of the at least one insert protein, the insert domain being associated with a first quantity of free energy; and, a target protein having at least one surface loop that begins at an alpha carbon of

a first surface loop amino acid and terminates at an alpha carbon of a second surface loop amino acid, the at least one surface loop comprising a target domain of the target protein, the target domain being associated with a second quantity of free energy, wherein, the at least one insert protein is operatively inserted within the at least one surface loop between the alpha carbon of first surface loop amino acid and the alpha carbon of second surface loop amino acid such that an amino-carboxyl length extending between an alpha carbon of the amino terminal of the at least one insert protein and an alpha carbon of the carboxyl terminal of the at least one insert protein is at least two-times greater than an alpha-carbon-alpha-carbon length extending between the alpha carbon of the first surface loop amino acid and the alpha carbon of the second surface loop amino acid.

Brief Description of the Drawings FIG. 1A is a schematic illustration of a domain of an exemplary insert protein in an unfolded conformation.

FIG. 1B is a schematic illustration o f a domain of an exemplary insert protein in an folded conformation.

FIG. 1 C is a schematic illustration of a conformation of an exemplary target protein having a folded domain in the shape of a wedge and having a surface loop.

FIG. 1D is a schematic illustration of a conformation of an exemplary target protein having an unfolded domain in the shape of a straight line and having a surface loop.

FIG. 1E is a schematic illustration of a exemplary fusion protein capable of existing in two mutually exclusive conformations.

FIG. 1F is a schematic illustration of a exemplary fusion protein capable of existing in two mutually exclusive conformations, in which an equilibrium state has been influenced by the binding of a ligand.

FIG. 2A is a schematic illustration of a human ubiquitin molecule.

FIG. 2B is a schematic illustration a barnase molecule.

FIG. 3 is a graph of circular dichroism spectra of a ubiquitin-barnase fusion protein as a function of temperature, showing temperature-induced conformational change in the structure of ubiquitin-barnase fusion protein.

FIG. 4 is a graph of the conversion an ubiquitin-barnase fusion protein from the barnase conformation to the ubiquitin conformation, monitored by ellipticity at 230 nm (open triangles, solid line) and barnase enzymatic activity (closed triangles, dashed line). Lines are for illustrative purpose only.

FIG. 5 is a graph showing the denaturation of the bamase domain (10 °C, circles) and the ubiquitin domain (40 °C, squares) induced by urea and guanidine hydrochloride ("GdnHCl"), respectively.

FIG. 6 is a graph of a circular dichroism spectral recording at 40 degrees C, showing barstar-induced folding of the barnase domain and unfolding of the ubiquitin domain.

FIG. 7 is a graph of a circular dichroism spectral recording at 15 degrees C, showing barstar-induced folding of the barnase domain and unfolding of the ubiquitin domain.

Detailed Description of the Invention The model for a mutually exclusive domain folding molecular switch is a novel and powerful approach to understanding the fundamental mechanisms of

allosteric switching in molecular biology and for the developing diagnostic and therapeutic proteins with novel capabilities, possessing the following advantages: [i] mechanism of the molecular switch it is inherently cooperative; and, [ii] the all-or-nothing action of the mechanism of the molecular switch assures that it behaves in a binary fashion; and, [iii] the switching mechanism is reversible; and, [iv] the position of a reciprocally folding and unfolding confonnational equilibrium, to be described hereinafter, can be readily adjusted by external factors.

In their simplest form, proteins are polypeptides, i. e. , linear polymers of amino acid monomers. However, the polymerization reaction which produces a polypeptide results in the loss of one molecule of water from each amino acid. Consequently, a polypeptide is more rigorously defined as a polymer of amino acid residues. Natural protein molecules may contain as many as 20 different types of amino acid residues, each of which contains a distinctive side chain.

An amino acid is an organic molecule containing an amino group ("-NH2") and a carboxylic acid group ("-COOH"). While there are many forms of amino acids, all of the important amino acids found in living organisms are alpha-amino acids. Alpha amino acids have their both their-COOH and-NH2 groups attached to the same carbon atom, which is called the alpha carbon atom.

Thus, all of the important amino acids found in living organisms consist of an alpha carbon atom to which there is attached: [i] A hydrogen atom [ii] An amino group (-NH2) [iii] A carboxyl group (-COOH).

[iv] One of 20 different"R"groups.

It is the structure of the R group that distinguishes each amino acid structurally and determines its biochemical properties. Moreover, the structure and biochemical properties of a protein is dictated by the precise sequence of the amino acids in the polypeptide chains of which it is comprised The particular linear sequence of amino acid residues in the polypeptide chain comprising a protein defines the primary structure of that protein. However individual polypeptides and groups of polypeptides undergo spontaneous structural alteration and association into a number of recurring intermediate patterns such as, for example, helices, including alpha helices, and sheets, including beta sheets. These recurring intermediate polypeptide patterns are referred to as a protein's secondary structure. The spontaneous structural alteration and association of polypeptide chains into a secondary structure is determined by the sequence of amino acids in the polypeptide chains and by the ambient biochemical environment.

The helices, sheets, and other patterns of a protein's secondary structure additionally undergo a process of thermodynamically-preferred compound folding to produce a three-dimensional or tertiary structure of the protein. The fully folded conformation of the protein is maintained by relatively weak interatomic forces such as, for example, hydrogen bonding, hydrophobic interactions and charge-charge interactions. Covalent bonds between sulphur atoms may also participate in protein folding into a tertiary conformation by forming intra-molecular disulfide bridges in a single polypeptide chain, as well as by forming intermolecular disulfide bridges between separate polypeptide chains of a protein. This ability of polypeptide chains to fold into a great variety of structures, combined with the large number of amino

acid sequences of a polypeptide chain that can be derived from the 20 common amino acids in proteins, confers on protein molecules their great range of biological activity.

The tertiary structure of a protein may contain a surface loop. A surface loop is a continuous length of polypeptide chain whose constituent amino acids are in neither an alpha helical conformation nor in a beta sheet conformation, and can contact at least five water molecules, as determined by the DSSP computer program of Wolfgang Kabsch and Chris Sander. The DSSP, a program which is well known in the art, defines secondary structure, geometrical features and solvent exposure of proteins, given atomic coordinates in Protein Data Bank format, which is also well known in the art. (W. Kabsch & C. Sander,"Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical figures", Biopolymers 22, 2577-2637. (1983); See also, Centre for Molecular and Biomolecular Informatics, University of Nijmegen, Toemooiveld 1, P. O. Box 9010,6500 GL Nijmegen, +3. 1 (0) 24-3653391, postmaster@cmbi. kun. nl ; http ://www/cmbi. kun. nl. gv/dssp/) Protein folding occurs on a global level that endows the entire protein molecule with a three dimensional structure and surface topology. Protein folding also occurs at a local level at multiple sites upon and within a protein. Locally, folding may involve one or more polypeptide subunits of the protein to endow different regions of the protein with different specific biological activities, or different specific molecular architectures, such as, for example, fashioning a location in a protein molecule into a receptor site for another molecule.

As used herein, the term domain means the molecular structure of an entire protein molecule or the molecular structure of a part, portion, or region of the

molecular structure of a protein molecule, including a part, portion, or region of the protein molecule's surface or the protein molecule's interior. A domain may refer only to a distinction in a protein molecule's structure, such as for example, an alpha helix or a beta sheet. A domain may or may not have an associated biological function, such as a regulatory, receptor, signaling, active, catalytic, or other biological function. A domain may further be associated with a free energy, i. e. , a thermodynamic state function that indicates the amount of energy that stabilizes the domain when the protein, or part thereof, with which the domain is associated is in a folded configuration. All of part of the free energy may be available for the domain to do biochemical work.

Because the folding of a protein molecule is both a global and local process, it can endow a protein molecule with both global and local structural and biological properties, such as, for example, an enzymatic activity, or a capacity and specificity for binding other proteins, such as antigens. Consequently, the biological functions of a protein depend on both its global folded tertiary structure, which is also called its native or folded conformation, as well as the folded structure of regions of the protein.

Conversely, a global or local unfolding of a protein deactivates its global or local biological activity. An unfolded, biologically inactive protein is said to be in a denatured or unfolded conformation.

Many proteins are comprised of domains that communicate with each other by means of conformational changes in the structure of the protein of which they are a part, in order to activate or deactivate a biological function. For example, in the case of a protein that is an enzyme, ligand binding or phosphorylation can serve as a switching mechanism to induce structural changes within the enzyme's regulatory domain, which then triggers activity in the enzyme's catalytic domain.

Another type of switching mechanism is illustrated in vivo by proteins that are unfolded in physiological conditions but fold upon binding to a cellular target. In this molecular switching mechanism, the folding and unfolding of a regulatory domain of a protein modulates the function of the protein via propagation of structural changes to its active domain.

In the case of the protein serving as an exemplary embodiment of the model for a mutually exclusive domain folding (hereinafter, "MEDF") switch, the protein is a exemplary fusion protein synthesized from: [i] an exemplary insert protein having an insert domain lying between an amino terminal and a carboxyl terminal, which insert domain is associated with a first quantity of free energy; and, [ii] an exemplary target protein having at least one surface loop that begins at an alpha carbon of a first surface loop amino acid and terminates at an alpha carbon of a second surface loop amino acid, which surface loop comprises a target domain associated with a second quantity of free energy.

The amino terminal of the exemplary insert protein is spatially separated from the carboxyl terminal of the exemplary insert protein by a linear distance known as the amino-carboxyl length (hereinafter, the"N-C terminal length") of the exemplary insert protein, that is measured when the exemplary insert protein is in its folded conformation. The first surface loop amino acid of the exemplary target protein is spatially separated from the second surface loop amino acid of the exemplary target protein by a linear distance known as the alpha-carbon-alpha-carbon length of the surface loop of the exemplary target protein (hereinafter, the"Ca-Ca length"), that is also measured when the exemplary target protein is in its folded conformation.

The molecular structure of the exemplary fusion protein is engineered so that, at, any time, the folding of the insert domain necessarily unfolds the target domain, and vice versa, thereby making the folded and unfolded states of the insert and target domains mutually exclusive. This mutual exclusion of concurrently folded or concurrently unfolded states is accomplished by the insertion of the exemplary insert protein into the surface loop of the exemplary target protein subject to a novel structural design criterion wherein the N-C terminal length of the exemplary insert protein is at least two-times greater than the Ca-Ca length of the surface loop of the exemplary target protein.

The present invention is an exemplary embodiment of a model for a mutually exclusive domain folding ("MEDF") molecular switch comprising an exemplary two- domain, bifunctional fusion protein, wherein the free energy released by the folding of a first domain of the exemplary fusion protein drives unfolding of a second domain of the exemplary fusion protein, and vice versa.

Subject to this novel structural design criterion, a dynamic state of thermodynamic and structural equilibrium is established in the exemplary fusion protein that disenables the insert domain of the exemplary insert protein and the target domain of the exemplary target protein from simultaneously co-existing in their native folded states.

Accordingly, any excess free energy present in one of the two domains that is not necessary to stabilize its folded configuration is spontaneously transferred, through the structure of the exemplary fusion protein, to the other of the two domains to unfold it from its folded configuration, and vice versa. In effect, the excess free energy stored in the folded conformation of one domain is used to drive the unfolding of the other domain; and, the molecular structure of the exemplary fusion protein is

engineered to create a dynamic state. of thermodynamic and correlative structural equilibrium that is determined by the relative thermodynamic and structural stabilities of the two domains.

Viewed another way, the molecular structure of the exemplary fusion protein is engineered to create a MEDF molecular switch by creating cooperatively folding- unfolding subunits comprising two protein domains, which two domains cannot simultaneously exist in their folded states. This scheme is depicted in FIGS. lA-F In FIG. 1A, exemplary insert protein 51, having an amino terminal 21 and a carboxyl terminal 22, exists in an unfolded conformation 20, thereby forming an exemplary unfolded insert domain, schematically illustrated as a hatched ribbon that is coincident with the extent of exemplary insert protein 51. In FIG. 1B, exemplary insert protein 51, having an amino terminal 21 and a carboxyl terminal 22, exists in a folded conformation 23, thereby forming an exemplary folded insert domain, schematically illustrated as a hatched double-crossed ribbon that is coincident with the extent of exemplary insert protein 51, and folds to form indentation 24.

In FIG. 1 C, there is shown schematically a folded conformation 26 of an exemplary target protein 4 having an exemplary folded target domain in the shape of a wedge 46. Exemplary target protein 41 also has a surface loop 27, schematically shown as a nearly full circle, arising from a first alpha carbon forming a first terminus 28 of a first arm 29 of wedge 46, and ending at a second alpha carbon forming a second terminus 30 of a second arm 31 of wedge 46. Also shown schematically in FIG. 1C is line 45, representing the COL-CCG length of the surface loop 27.

In FIG. ID, there is shown schematically an unfolded conformation 32 of exemplary target protein 41 in which exemplary folded target domain 46 has

unfolded into the shape of straight line 56. Unfolded conformation 32 of exemplary target protein 41 also has surface loop 27, now shown as a half-circle arising from first alpha carbon forming a first terminus 28 of the straight line 56 and ending at a second alpha carbon forming a second terminus 30 of the straight line 56.

In FIG. 1E, there is shown schematically an exemplary fusion protein 35 comprised of exemplary insert protein 51 inserted into surface loop 27 of exemplary target protein 41, which exemplary fusion protein 35 is capable of existing in two mutually exclusive conformations 35L and 35R, representing the mutually exclusive binary states of the model for the MEDF molecular switch embodied in exemplary fusion protein 35.

The image to the left of the antiparallel arrows 36 of FIG. lE shows exclusive state 35L of exemplary fusion protein 35, wherein exemplary insert protein 51, with its exemplary insert domain in unfolded (hatched ribbon) conformation 20, (as shown in FIG. 1A), has been inserted into surface loop 27 of exemplary target protein 41 with its exemplary target domain in its folded conformation 46, (as shown in FIG. 1C). The image to the right of the antiparallel arrows 36 of FIG. lE shows exclusive state 35R of exemplary fusion protein 35, wherein exemplary insert protein 51, with its exemplary insert domain in its folded (hatched double-crossed ribbon) conformation 23, (as shown in FIG. 1B), inserted into surface loop 27 of exemplary target protein 41 with its exemplary target domain in its unfolded (straight line) conformation 56 (as shown in FIG. 1D).

In FIG. 1F, exemplary fusion protein 35 is again shown schematically existing in two mutually exclusive conformations 35L and 35R, representing the mutually exclusive binary states of the model for the MEDF molecular switch embodied in exemplary fusion protein 35. However the dynamic equilibrium

existing between conformations 35L and 35R has been shifted to the right by the binding of exemplary ligand 40 to the indentation 24 of exemplary insert domain of protein 51 in folded conformation 23.

If the exemplary insert domain of exemplary insert protein 51 in folded conformation 23 (FIG. 1B and FIG. 1E Right) is more stable. than exemplary target domain having wedge 46 of exemplary target protein 41 in its folded conformation 26, (FIG. 1C and FIG. 1E Left), then exemplary insert domain of exemplary insert protein 51 in folded conformation 23 (FIG. 1B and FIG. 1E Right) will have an excess of free energy with which to forcibly stretch and unfold folded conformation 26 of exemplary target domain, having wedge 46, of exemplary target protein 41 (FIG. 1C and FIG. 1 E Left), thereby unfolding wedge 46 into line 56, and yielding exemplary fusion protein 35 in state 35R.

If exemplary target domain, having wedge 46, of exemplary target protein 41 in folded conformation 26 (FIG. 1C and FIG. 1E Left) is more stable than exemplary insert domain of exemplary insert protein 51 in folded conformation 23 (FIG. 1B and FIG. 1E Right), then exemplary target domain, having wedge 46, of exemplary target protein 41 in its folded conformation 26 (FIG. 1C and FIG. 1E Left) will have an excess of free energy with which to forcibly stretch and unfold exemplary insert domain of exemplary insert protein 51 in folded conformation 23 (FIG. 1B and FIG.

1E Right), thereby folding line 56 into wedge 46, and yielding exemplary fusion protein 35 in state 35L.

In this manner, exemplary embodiment of the model for a MEDF switch fully exploits the free energy stored in the folded conformations of the aforementioned domains, as well as the inherent cooperativity of reciprocal domain folding, to create a molecular switch of unprecedented efficiency. Consequently, the model for a

MEDF molecular switch is a novel and powerful approach to understanding the fundamental mechanisms of allosteric switching in molecular biology and for the developing diagnostic and therapeutic proteins with novel capabilities, possessing the following advantages: [i] mechanism of the molecular switch it is inherently cooperative; and, [ii] the all-or-nothing action of the mechanism of the molecular switch assures that it behaves in a binary fashion; and, [iii] the switching mechanism is reversible; and, [iv] the position of the folding/unfolding equilibrium can be readily adjusted by external factors.

For example, When present at l, uM concentration, a ligand that binds to a protein with a dissociation constant of 1 nM., will stabilize the native conformation of the protein by as much as RT o ln (l03), or 4.2 kcal mol-1 at 37 degrees C. This value is comparable to the total free energy change of folding for many proteins.

While the exemplary embodiment of the model for a MEDF switch entails the creation of a two-domain, bifunctional exemplary fusion protein to be described more fully hereinafter, the exemplary embodiment of the model for a MEDF switch disclosed herein is not limited to the insertion of an exemplary insert protein into a exemplary target protein having only one domain or only one biological function.

The exemplary embodiment of the model for a MEDF switch disclosed herein comprises cases wherein one or more exemplary insert proteins is inserted into one or more surface loops of exemplary target proteins having multiple domains and multiple biological functions, the effect of these insertions being to form a one or more cooperatively folding-unfolding subunits in the resultant exemplary fusion protein, each comprising two protein domains, which two domains cannot simultaneously exist in their folded states, thereby forming one or more cooperative,.

reversible, MEDF molecular switches in the same exemplary fusion protein, each of which is responsive to different controllable effector signals such as, for example, ligand binding, pH, temperature, chemical denaturants, or the presence of stabilizing or destabilizing mutations in either the barnase or ubiquitin domains.

To create a specific nonlimiting exemplary embodiment of the model for a MEDF switch, the inventors herein created a two-domain bifunctional fusion protein comprising the insertion of an exemplary insert protein, the human ubiquitin molecule having one regulatory domain, into a selected surface loop of an exemplary target protein, barnase, which has one catalytic (or cytotoxic) domain.

The exemplary insert protein, human ubiquitin, is a protein having one regulatory domain, and one biological function, that of serving as a signaling marker or flag.

The exemplary target protein, bamase is a ribonuclease produced exclusively by the bacterium Bacillus a77Zyloliquegaciens. Bamase has one catalytic domain that is functionally cytotoxic to all mammalian cell types.

FIG. 2A is a schematic illustration a molecule of human ubiquitin. The sheet- like arrows and ribbons in FIG. 2A represent beta strands and alpha helices, respectively. The noodle-like strands in the protein are loops and turns of the ubiqutin molecule. In FIG. 2A, the N-C terminal length from ubiquitin is about 38 A in its folded conformation, as indicated by the double-headed arrow bearing the bearing the legend"38 A.

FIG. 2B is a schematic illustration a barnase molecule. The sheet-like arrows and ribbons in FIG. 2A represent beta strands and alpha helices, respectively. The noodle-like strands in the protein are loops and turns of the barnase molecule. The barnase molecule has a surface loop in which the Ca-Ca length, measured from

alpha-carbon of the loop amino acid proline in the number 64 position ("Pro64") to the alpha-carbon of the loop amino acid threonin in the number 70 position ("Thr70") is about 10.4 A, with the barnase molecule in its folded conformation, as indicated by the double-headed arrow bearing the legend"10.4 A."Theasteriskin FIG. 2B represents the point at which the ubiquitin molecule, shown in FIG. 2A is inserted between amino acid residue 66 and amino acid residue 67 of barnase.

The exemplary insert protein ubiquitin and the exemplary target protein barnase satisfy the novel structural design criterion that the N-C terminal length of the exemplary insert protein be at least twice the Ca-Ca length of the exemplary target protein surface loop selected for insertion. In the exemplary fusion protein resulting from the insertion of the exemplary insert protein ubiquitin molecule into the surface loop of the exemplary target protein barnase, the foregoing structural design criterion is satisfied.

Consequently, the regulatory domain of ubiquitin and the catalytic domain of barnase cannot simultaneously co-exist in their folded states ; and, the regulatory domain of ubiquitin, may be used to regulate the cytotoxic activity of barnase.

Moreover, the ubiquitin and barnase domains participate in a cooperative and reversible conformational equilibrium, that may be influenced and controlled by a variety of controllable effector signals such as, for example, ligand binding, pH, temperature, chemical denaturants, or the presence of stabilizing or destabilizing mutations in either the barnase or ubiquitin domains.

The ubiquitin and barnase genes are created by annealing and ligating synthetic oligonucleotides (Integrated DNA Technologies) according to standard protocols.

The ubiquitin-barnase fusion gene is made by first adding an exemplary five amino acid linker (Gly-Thr-Gly-Gly-Ser) between the Lys66 and Ser67 codons of the barnase gene. The inserted DNA contains exemplary KpnI and BamHI restriction sites that are used to introduce the ubiquitin gene.

The exemplary five amino acids of the exemplary linker individually serve as short, flexible linkers at the points of attachment. The ubiquitin gene is inserted between the Thr and Gly codons of the linker.

All genes are fully sequenced to verify their integrity.

An interim ubiquitin-barnase fusion expression plasmid pETMT is created by using exemplary NdeI and XhoI enzymes to insert the ubiquitin-barnase fusion gene into a plasmid, such as, for example, a pET25b (+) plasmid (Novagen), or any other T7 promotor-containing plasmid that also confers resistance to an antibiotic other than ampicillin.

In order to make the plasmid stable in E. coli, the gene for barstar, the intracellular inhibitor of bamase that is coexpressed with barnase by Bacillus amyloliquegacie7Zs (together with its natural promoter from Bacillus amyloliquefaciens), is cleaved out of an exemplary pMT1002 plasmid (gift of Dr. Y.

Bai, National Institutes of Health), or any other T7 promotor-containing plasmid that also confers resistance to an antibiotic other than ampicillin, with ClaI and PstI enzymes. The barstar gene is then placed between Clal and PstI restriction sites on the pETMT plasmid (prior to this step, these sites are introduced using the QuikChange mutagenesis kit (Strategene)).

In order to obtain milligram quantities of the ubiquitin-barnase fusion protein, it is necessary to increase cellular levels of barstar and purify the inactive ubiquitin- barnase fusion-barstar complex. Accordingly, the barstar gene is cloned into an

exemplary pET41 plasmid (Novagen), thereby placing it under control of a T7 promoter and conferring upon the transformed cells resistance to kanamycin or any other antibiotic other than ampicillin.

E. coli BL21 (DE3) cells are transformed with both plasmids, grown in a temperature range between about 20 degrees C and 37 degrees C in exemplary Luria- Bertani medium containing ampicillin and kanamycin to OD600 = 1.0, and induced with 100 mg/L IPTG. Bacteria are harvested about 2 tol2 hours later by centrifugation.

Cells are lysed in about 10 mM sodium phosphate (pH 7.5) by repeated freeze- thaw cycles in the presence of a small amount of lysozyme at a concentration of about lysozyme is 10 mg/liter. Exemplary DNase I (Sigma) at a concentration of about 10 mg/liter is then added to reduce viscosity, and the solution is centrifuged to remove insolubles. 8 M urea is added to the supernatant to dissociate bound barstar, which is subsequently removed by passing the solution through DE52 resin (Whatman) or a substantially equivalent anion exchange chromatography resin. The solution is then loaded onto a HiTrap heparin column (Amersham-Pharmacia) or substantially equivalent cation exchange column, washed with 10 mM sodium phosphate (pH 7.5) and 6 M urea, and eluted with a 0-0.2 M NaCl gradient.

Western blot analysis using anti-ubiquitin antibodies is used to show that the major impurities are truncated ubiquitin-barnase fusion protein products in which the ubiquitin domain, which is unfolded in the ubiquitin-bamase fusion protein-barstar complex, is partially digested. These proteins, however, elute significantly later than the intact ubiquitin-barnase fusion protein in the NaCl gradient. The urea is removed by dialysis against double-distilled water, to yield barnase-ubiquitin fusion protein

that is approximately 95% pure as judged by sodium dodecyl sulfate polyacrylamide gel electrophoresis.

To confirm the switching mechanism of the model for the MEDF molecular switch as embodied in the exemplary ubiquitin-barnase fusion protein, and to characterized its structure, stability, and enzymatic function, experiments were perfonned by the inventors herein upon the purified ubiquitin-barnase fusion protein, obtained as described hereinabove All experiments were performed in 10 mM potassium phosphate (pH 7.5), 0.1 M NaCl. The circular dichroism ("CD") spectra are shown in FIG. 3 as. a function of temperature. In FIG 3., circles and squares indicate 5 °C and 50 °C, respectively; other scans were recorded at 5 °C increments between these two limits. Below 20 °C, the barnase-ubiquitin fusion protein exhibits molar ellipticities and spectral features nearly identical to those of barnase. A particularly diagnostic characteristic of barnase that is also observed for the barnase-ubiquitin fusion protein is a minimum at 231 run attributed to Trp94. As temperature is increased, however, the spectrum shifts to one that strongly resembles ubiquitin. The 231 nm minimum disappears, and the position and molar ellipticity of the new minimum are consistent with native ubiquitin FIG. 4 shows a graph of the conversion a ubiquitin-barnase fusion protein from the bamase conformation to the ubiquitin conformation, monitored by ellipticity at 230 nm (open triangles, solid line) and barnase enzymatic activity (closed triangles, dashed line). Lines are for illustrative purpose only. Activities were determined by recording initial velocities in triplicate (2 mM ubiquitin-barnase fusion protein, 50 mM guanylyl (3-5) uridine 3-monophosphate ("GpUp") ) and are normalized to the largest value. As can be seen from FIG. 4, the transition is fully reversible and has a midpoint of about 30 degrees C. Further, it appears to be two-state ; an isodichroic

point near 223 nm is apparent, and plotting the transition at different wavelengths yields identical midpoints.

The temperature-induced transition from barnase to ubiquitin is consistent with the higher thermal stability of ubiquitin. Ubiquitin and barnase unfold with midpoints of about 100 degrees C (pH 7) and 55 degrees C (pH 6.3) respectively.

The possibility that the ubiquitin and barnase domains are simultaneously folded is ruled out by the observation that the molar ellipticity of the ubiquitin-barnase fusion protein never exceeds that of the individual proteins at any wavelength or temperature..

To further test the mutual exclusivity of ubiquitin and barnase domain folding, the inventors herein monitored urea-induced denaturation by Circular dichroism and Trp fluorescence. ubiquitin and barnase contain zero and three Trp residues, respectively. Fluorescence is therefore expected to report primarily on structural changes within the barnase domain, whereas Circular dichroism reports on both domains. FIG. 5 is a graph showing the denaturation of the barnase domain (10 degrees C, circles) and the ubiquitin domain (40 degrees C, squares) induced by urea and guanidine hydrochloride ("GdnHCl"), respectively. Data were collected by Circular dichroism at 230 nm (open symbols, black lines) or by Trp fluorescence at 320 nm (closed symbols, grey line). Lines represent best fits to the linear extrapolation equation. At 10 degrees C, the Circular dichroism and fluorescence curves reveal a single cooperative unfolding transition. It appears to be two-state; thermodynamic parameters obtained by fitting both data sets to the linear extrapolation equation are identical within error (DG = 4.1 0.2 kcalxmol-i, m = 2.2 0. 1 lccalxmol-i xM-i, Cm = 2.3 0.05 M). The fact that only one transition is apparent by Circular dichroism confirms that only one domain is folded. The

agreement between Circular dichroism and fluorescence curves indicates that this domain is barnase.

If the situation is reversed by raising temperature above 30 degrees C, addition of denaturant at 40 degrees C is predicted to generate an unfolding transition by Circular dichroism but no transition by fluorescence. 6 M urea failed to produce a change in either spectrum. Addition of the stronger denaturant guanidine hydrochloride, however, yields the expected Circular dichroism unfolding transition with the following thermodynamic parameters: DG = 8.5 0.1 kcalxmol-i, in = 2. 5 0.1 lccalxmol-ix M-i, Cm= 3.4 0.05 M, as shown in FIG. 5.

In contrast to molar ellipticity, fluorescence emission at 320 nm does not change significantly as a function of guanidine hydrochloride concentration, suggesting that the barnase Trp residues are solvent exposed at all denaturant concentrations. This conclusion is supported by the finding that the wavelength of maximum emission remains constant at 356 nm, the value for unfolded barnase. At 10 degrees C, this wavelength shifts from 340 nm in the absence of denaturant to 356 nm in 6 M urea. These data prove that: [i] folding of the barnase domain induces unfolding of the Ubiquitin domain, and vice versa; and, [ii] temperature provides an efficient switch between the two folded conformations of the ubiquitin-barnase fusion protein To determine how tightly folding of the ubiquitin domain is coupled to unfolding of the barnase domain, the inventors herein measured the enzymatic activity of the ubiquitin-barnase fusion protein as a function of temperature, using the substrate guanylyl (3'-5') uridine 3'-monophosphate. As shown in FIG. 4, loss of The barnase-ubiquitin fusion protein activity in the ubiquitin-barnase fusion protein

mirrors the structural conversion from barnase to ubiquitin, although the apparent midpoint of the former transition occurs at a higher temperature.

A likely reason is that substrate binding in the enzyme assays preferentially stabilizes the barnase domain. The complete loss of barnase activity above 50 degrees C, together with the molar ellipticity values shown in FIG. 3, provides strong evidence that ubiquitin domain folding induces complete unfolding of the barnase domain.

The preceding experimental results suggested to the inventors herein that temperature can be used to regulate cytotoxicity of the ubiquitin-barnase fusion protein in vivo. The inventors herein tested this prediction by transforming E. coli with a plasmid containing the ubiquitin-barnase fusion protein gene under control of an isopropyl b-D-thiogalactopyranoside ("IPTG") -induced T7 promoter. Like barnase, the ubiquitin-barnase fusion protein is extremely lethal. The plasmid was found to be unstable in all strains of E. coli, including those that do not harbor the T7 RNA polymerase gene. Very few transformants were consistently recovered, and in each case the ubiquitin-barnase fusion protein gene was found to contain frameshift or nonsense mutations in the bamase coding region.

To overcome this problem, the inventors herein inserted the barstar gene and its natural promoter from Bacillus amyloliquifaciens into the foregoing plasmid, thereby creating a pETMT plasmid that is stable in E. coli.

As shown in Table 1 hereinbelow, wherein figures are the averages with standard deviations are obtained from five plates, far fewer colonies were obtained when plates were grown at 15 degrees C compared to 37 degrees C. Moreover, 36 mg/m L IPTG was sufficient to kill nearly all of the bacteria at 15 degrees C,

whereas 143 mg/mL IPTG was required to achieve a comparable result at 37 degrees C.

TABLE 1 Cytotoxicity of The barnase-ubiquitin fusion protein as a function of temperature and isopropyl P-D-thiogalactopyranoside concentration Number of colonies at indicated concentration of IPTG T (°C) 0 mg/mL 9 mg/mL 18 mg/mL 36 mg/mL 72 mg/mL 143 mg/mL 15 14 ~ 5 35 ~ 18 39 ~ 4 0.3 ~ 0.5 0 ~ 0 0 ~ 0 37 160 ~ 40 250 ~ 60 257 ~ 40 274 ~ 30 12 4 0. 60 1

These findings reflect the temperature-induced conformational shift from the bamase-form of the ubiquitin-barnase fusion protein to the ubiquitin form. Further, they demonstrate that the ubiquitin-barnase fusion protein/E. coli system is highly responsive to the relative stabilities of the two domains within the temperature range of bacterial growth.

"Natively-unfolded"proteins contain subunits that are unstructured or partially structured in physiological conditions. The free energy of folding is provided by ligand binding interactions, and the resulting conformational change is used to modulate function of distant domains. The ubiquitin-barnase fusion protein captures this property as well. When barstar is added to the ubiquitin to form of ubiquitin- barnase fusion protein at 40 degrees C, a conformational change is observed. FIG. 6

is a graph of a circular dichroism spectral recording at 40 degrees C, showing barstar- induced folding of the barnase domain and unfolding of the ubiquitin domain.. Dotted lines represent 10 mM The barnase-ubiquitin fusion protein in the absence and presence of 12.5 mM. barstar, respectively. Black lines were generated by subtracting the circular dichroism spectra of free barstar The Circular dichroism spectrum of the ubiquitin-barnase fusion protein- barstar complex, after subtracting the spectrum of free barstar, resembles that of the ubiquitin-barnase fusion protein at 15 degrees C. FIG. 7 is a graph of a circular dichroism spectral recording at 15 degrees C, showing barstar-induced folding of the barnase domain and unfolding of the ubiquitin domain.. Dotted lines represent 10 mM The barnase-ubiquitin fusion protein in the absence and presence of 12.5 mM barstar, respectively. Black lines were generated by subtracting the circular dichroism spectra of free barstar. The two are not identical, apparently because of minor structural differences between the free and bound states of barnase and/or barstar.

However, Circular dichroism spectra of the ubiquitin-barnase fusion protein-barstar complexes at 40 °C and 15 °C are virtually indistinguishable. This result signifies that ubiquitin domain.

Accordingly, the inventors herein have demonstrated that the folding free energy of one protein domain can be used to drive unfolding of another. Because folding is reversible and inherently cooperative, this mechanism constitutes an efficient and responsive model for a mutually exclusive domain folding molecular switch for, inter alia, coupling confonnational change to protein function. The cytotoxic activity of barnase in the present invention provides the basis for several novel applications. For example, stability-enhanced ubiquitin variants can be rapidly

identified from combinatorial libraries by their ability to allow E. coli to survive at temperatures less than 30 degrees C in the presence of 36 mg/mL IPTG.

A major advantage that this model affords over other directed evolution techniques, such as, for example, phage display, is that the entire selection takes place inside a living bacterium, and stabilizing mutations are sorted from destabilizing mutations in the most efficient and decisive mazer possible. This model is general and can be applied to other proteins, including those too large to be expressed on the phage surface. If necessary, known stabilizing or destabilizing mutations can be introduced into the barnase domain in order to make the target protein optimally responsive to the inserted protein. Additionally, the model can be modified to generate a class of cytotoxic molecules with novel sensor capabilities. For example, the model allows ribonuclease activity to be turned on and off by ligand binding to an engineered regulatory domain. Ligand binding domains from any one of a large number of proteins can perform this function as long as they meet the structural design novel described hereinabove. This invention forms the basis for developing cytotoxic proteins that are activated by a wide variety of cell-specific effector molecules, and can thus target cancerous or virally infected cells for destruction.