1 00:00:00 --> 00:00:05 So our next class of biomolecule that we're going to talk about are 2 00:00:05 --> 00:00:10 nucleic acids. And we can, for the most part, 3 00:00:10 --> 00:00:15 describe their properties by considering just covalent bonds and 4 00:00:15 --> 00:00:20 hydrogen bonds. Although, that's a bit of an 5 00:00:20 --> 00:00:25 oversimplification. But, anyway, these are, 6 00:00:25 --> 00:00:32 again, polymers. So this is DNA and RNA, 7 00:00:32 --> 00:00:40 terms you've undoubtedly heard. And these are made by splitting out 8 00:00:40 --> 00:00:48 water. And, in this case, the monomeric units are given the 9 00:00:48 --> 00:00:56 special term nucleotide. And a nucleotide consists of a 10 00:00:56 --> 00:01:05 sugar with something called a base on it. 11 00:01:05 --> 00:01:12 It's got a phosphate group at one end and a hydroxyl group, 12 00:01:12 --> 00:01:19 one of the sugar hydroxyls that we saw the other day at the other end. 13 00:01:19 --> 00:01:30 The B stands for base. 14 00:01:30 --> 00:01:34 And the way the bond is formed, as I said, is by splitting out water 15 00:01:34 --> 00:01:39 like that to form what's known as a phosphodiester bond. 16 00:01:39 --> 00:01:43 And we'll be talking a lot about those when we talk about DNA and RNA 17 00:01:43 --> 00:01:48 in more detail later on in the course. The sugars are pentoses 18 00:01:48 --> 00:01:53 where N equals 5. We were talking about these the 19 00:01:53 --> 00:01:57 other day. The base goes in this position. That's the 1 position 20 00:01:57 --> 00:02:03 of the carbon. This is the 5 position of the carbon. 21 00:02:03 --> 00:02:10 And this is where the phosphate is located. This sugar is called 22 00:02:10 --> 00:02:17 ribose. And then RNA, which is the polymer of nucleotides 23 00:02:17 --> 00:02:24 that have ribose as the sugar is ribonucleic acid or RNA, 24 00:02:24 --> 00:02:31 as you've known it. If this hydroxyl here is replaced by a 25 00:02:31 --> 00:02:38 hydrogen and the rest of it's the same -- 26 00:02:38 --> 00:02:47 -- this is deoxyribonucleoside. 27 00:02:47 --> 00:02:55 And if you polymerize that together then you get DNA or deoxyribonucleic 28 00:02:55 --> 00:03:01 acid. The bases come in two flavors. 29 00:03:01 --> 00:03:05 And this will be on your handout. Ones that either have two rings, 30 00:03:05 --> 00:03:10 adenine or guanine. And the general term of those is 31 00:03:10 --> 00:03:14 purine or they have one ring of pyrimidine. And in DNA one finds 32 00:03:14 --> 00:03:19 cytidine and thiamine abbreviated as C and T. Or in RNA, 33 00:03:19 --> 00:03:24 instead of finding thiamine you find uracil, which is the same except 34 00:03:24 --> 00:03:28 that it doesn't have the methyl group that's present at this 35 00:03:28 --> 00:03:33 position on thiamine. And the important thing about these 36 00:03:33 --> 00:03:38 particular nucleotide bases is that they can form hydrogen bonds in a 37 00:03:38 --> 00:03:43 very special way. It's diagramed on here that this is 38 00:03:43 --> 00:03:48 a guanine pairing with the cytidine, so a G pairing with a C. And you 39 00:03:48 --> 00:03:53 can form three hydrogen bonds. Or between an A, an adenine and a 40 00:03:53 --> 00:03:59 thiamine you can form two hydrogen bonds. 41 00:03:59 --> 00:04:03 Those are just the things we were diagramming on the board the other 42 00:04:03 --> 00:04:08 day. And those are the forces that hold the strands of DNA together so 43 00:04:08 --> 00:04:12 that DNA is the double helix, as you know. It's basically a 44 00:04:12 --> 00:04:17 backbone with sugars and phosphates. And then there'll be some sequence 45 00:04:17 --> 00:04:22 of bases down this. And then on the other strand you'll 46 00:04:22 --> 00:04:27 have the base that can form hydrogen bonds with this. 47 00:04:27 --> 00:04:30 So C, there would be three hydrogen bonds here. This would be a G on 48 00:04:30 --> 00:04:34 this side, again, three hydrogen bonds. 49 00:04:34 --> 00:04:38 If there's an A here that will be a T there, two hydrogen bonds. 50 00:04:38 --> 00:04:42 And so on down. And we'll talk about the implications of this later 51 00:04:42 --> 00:04:46 in the course when we talk about DNA replication, but for the moment I 52 00:04:46 --> 00:04:50 think your eye can see, you can probably see that the 53 00:04:50 --> 00:04:54 geometric arrangement of these is just exactly the same, 54 00:04:54 --> 00:04:58 whether it's a G-C or it's an A-T base pair. You can superimpose them, 55 00:04:58 --> 00:05:02 and they have just exactly the same molecular structure. 56 00:05:02 --> 00:05:05 And that's really crucial for a lot of things having to do with DNA. 57 00:05:05 --> 00:05:09 So, as you know, it's not just sort of a ladder with hydrogen bonds. 58 00:05:09 --> 00:05:13 It's twisted in 3-dimensional space. That's the double helix. 59 00:05:13 --> 00:05:16 And in that little movie I showed you the nitrogen atoms and the bases 60 00:05:16 --> 00:05:20 are blue so you can pretty much pick it out that there's a series of 61 00:05:20 --> 00:05:24 hydrogen bonds going right down the middle of a DNA molecule with the 62 00:05:24 --> 00:05:28 phosphoribose backbone on the outside. 63 00:05:28 --> 00:05:32 So every one of your cells, since you have about 3 billion base 64 00:05:32 --> 00:05:36 pairs you have two point something times that many hydrogen bonds 65 00:05:36 --> 00:05:40 holding your DNA together. The thing to remember about the 66 00:05:40 --> 00:05:45 strength of the hydrogen bond, it's about a twentieth a covalent 67 00:05:45 --> 00:05:49 bond, and so you're able to pull those things apart and then put them 68 00:05:49 --> 00:05:53 back together at physiological temperature while leaving all the 69 00:05:53 --> 00:05:57 covalent bonds that make up each strand of the DNA leaving 70 00:05:57 --> 00:06:02 those intact. It's also possible, 71 00:06:02 --> 00:06:06 since RNAs are usually single-stranded, 72 00:06:06 --> 00:06:10 that if you have a little sequence here that has the complimentary 73 00:06:10 --> 00:06:14 sequence over there then these can pair like this forming a hairpin or 74 00:06:14 --> 00:06:18 some little structure like that. And, again, we'll talk about 75 00:06:18 --> 00:06:22 transfer RNAs which play a really key role in protein synthesis. 76 00:06:22 --> 00:06:26 They're the little translators that go back and forth between the 77 00:06:26 --> 00:06:30 nucleic acid code, the genetic code and the protein 78 00:06:30 --> 00:06:35 code, which is written in amino acids. 79 00:06:35 --> 00:06:39 And this just shows making an RNA copy for a tRNA gene from the DNA, 80 00:06:39 --> 00:06:43 but then these are the relationships between the complimentary sequences 81 00:06:43 --> 00:06:47 right in that strand. So this thing is able to fold up 82 00:06:47 --> 00:06:51 into a sort of cloverleaf structure that some of you have certainly 83 00:06:51 --> 00:06:55 probably seen at some point. It's a little bit twisted here 84 00:06:55 --> 00:06:59 because you can see how the complimentary sequences 85 00:06:59 --> 00:07:02 have found each other. And even though this is just a 86 00:07:02 --> 00:07:06 single strand of RNA, by forming hydrogen bonds to 87 00:07:06 --> 00:07:09 complimentary sequences within itself it can take up a structure. 88 00:07:09 --> 00:07:12 And I'll show you. It actually goes on, there are some other forces 89 00:07:12 --> 00:07:16 that come in. And this will fold up into a 3-dimensional structure that 90 00:07:16 --> 00:07:19 goes even beyond what I've shown you, but we won't need to talk about that 91 00:07:19 --> 00:07:23 for just a little bit. OK. So then the next -- 92 00:07:23 --> 00:07:32 -- class of molecules that we're 93 00:07:32 --> 00:07:36 going to spend a lot of this course on are proteins. And these 94 00:07:36 --> 00:07:46 are polymers again. 95 00:07:46 --> 00:07:52 -- made by splitting out water. So that's been true of 96 00:07:52 --> 00:07:58 polysaccharides. It's true of nucleic acids. 97 00:07:58 --> 00:08:04 It's true of proteins. In this case the monomers are structures 98 00:08:04 --> 00:08:10 known as amino acids. And they have an amino group. 99 00:08:10 --> 00:08:17 And then it's joined to a carbon known as the alpha carbon. 100 00:08:17 --> 00:08:24 And then there's a carboxyl group. So this is why they're called amino 101 00:08:24 --> 00:08:32 acids, because their carboxyl group is an acid. 102 00:08:32 --> 00:08:39 And the way they form -- 103 00:08:39 --> 00:08:45 We'll give these different side chains here. I'll tell you about 104 00:08:45 --> 00:08:52 these side chains in just a minute. The way these form a bond is by 105 00:08:52 --> 00:08:59 splitting out water here. And then this will give this very 106 00:08:59 --> 00:09:12 important bond in nature -- 107 00:09:12 --> 00:09:16 -- which is known as the peptide bond. And there's a chemical 108 00:09:16 --> 00:09:21 property of this that's important. Someone was bemoaning the fact that 109 00:09:21 --> 00:09:25 I had to go over a bunch of chemistry and they hadn't liked 5. 110 00:09:25 --> 00:09:29 11. My apologies. But we won't be spending all course 111 00:09:29 --> 00:09:32 doing chemistry. But if you want to understand how 112 00:09:32 --> 00:09:36 these things work you do need to understand some of the chemical 113 00:09:36 --> 00:09:39 principles to understand them. And this is a case where it's 114 00:09:39 --> 00:09:42 really important because, although it's written this way with 115 00:09:42 --> 00:09:45 the double bond here and a single bond there, this double bond 116 00:09:45 --> 00:09:48 actually sort of spends part of its time over here. 117 00:09:48 --> 00:09:52 So this is actually sort of a partial double bond. 118 00:09:52 --> 00:09:55 And that has an important consequence because if you're a 119 00:09:55 --> 00:09:58 single bond, if you remember a single bond can bend and stretch but 120 00:09:58 --> 00:10:02 it can also rotate. But if you're a double bond you 121 00:10:02 --> 00:10:06 cannot rotate. So the peptide bond, 122 00:10:06 --> 00:10:10 and you make a lot of these when you're polymerizing amino acids 123 00:10:10 --> 00:10:14 together to make proteins, those bonds have a very special 124 00:10:14 --> 00:10:18 character that they cannot rotate. Now, let me say, I'll come back and 125 00:10:18 --> 00:10:22 show you why that's important in just a moment. 126 00:10:22 --> 00:10:26 But let me just say a word about the side chains. 127 00:10:26 --> 00:10:29 There are 20 different amino acids. And they have side chains that have 128 00:10:29 --> 00:10:33 very different chemical properties. And when we start thinking about how 129 00:10:33 --> 00:10:37 a chain of amino acids take up the properties that make it into an 130 00:10:37 --> 00:10:41 enzyme or part of a motor or a structural protein or into your 131 00:10:41 --> 00:10:44 finger nails or your hair or skin, they have to have very special 132 00:10:44 --> 00:10:48 properties. And it's the sequence of these different amino acids with 133 00:10:48 --> 00:10:52 their different chemical properties that are eventually going to let 134 00:10:52 --> 00:10:56 each protein form up to one particular 3-dimensional structure 135 00:10:56 --> 00:11:00 that will give it its characteristics. 136 00:11:00 --> 00:11:05 So the different types of amino acids, and again you won't have to 137 00:11:05 --> 00:11:10 memorize these, but here they are up here. 138 00:11:10 --> 00:11:15 But let me just point out the important classes, 139 00:11:15 --> 00:11:20 because the thing you really want to do with this one is to remember the 140 00:11:20 --> 00:11:25 types of amino acids we find. There are negatively charged amino 141 00:11:25 --> 00:11:28 acids. An example of this would be 142 00:11:28 --> 00:11:28 aspartate. Under physiological conditions, although this is an acid, 143 00:11:28 --> 00:11:28 it will dissociate so it will have a negative charge. 144 00:11:28 --> 00:11:28 And that's abbreviated as A-S-P. Glutamine is another one that has a 145 00:11:28 --> 00:11:28 negative charge. There are also amino acids that 146 00:11:28 --> 00:11:29 have positive charges on the side chain. 147 00:11:29 --> 00:11:39 An example of this would be lysine which has four methylene groups. 148 00:11:39 --> 00:11:50 And then it has an amino group. But, again, under physiological 149 00:11:50 --> 00:12:00 conditions, around pH 7, that will be protonated so it will 150 00:12:00 --> 00:12:11 have a plus charge. And that is lysine or L-Y-S. 151 00:12:11 --> 00:12:22 And arginine and histidine are two other amino acids that have a 152 00:12:22 --> 00:12:33 positive charge, or can have a positive charge. 153 00:12:33 --> 00:12:38 Then there's a set of amino acids that have a polar character. 154 00:12:38 --> 00:12:43 They don't have a full charge. And, as you might guess, they have 155 00:12:43 --> 00:12:48 one of the bonds that we've talked about that are polar. 156 00:12:48 --> 00:12:56 This is serine. 157 00:12:56 --> 00:12:59 Serine. There's another one that has a hydroxyl that's 158 00:12:59 --> 00:13:02 known as threonine. And then there is a glutamine and 159 00:13:02 --> 00:13:06 asparagine, both of which have an N-H bond. So just through what I've 160 00:13:06 --> 00:13:09 told you here, we haven't even been through the set, 161 00:13:09 --> 00:13:13 you can see how you can begin to decorate an amino acid chain. 162 00:13:13 --> 00:13:16 So there's a plus charge and a minus charge, a polar charge. 163 00:13:16 --> 00:13:20 There's a tremendous amount of diversity because at every single 164 00:13:20 --> 00:13:23 thing you have a choice of 20 things you can put in. 165 00:13:23 --> 00:13:27 So they not only have size and shape characteristics, 166 00:13:27 --> 00:13:31 but they have particular charges and other properties. 167 00:13:31 --> 00:13:39 Then, as always, there are a bunch of special, 168 00:13:39 --> 00:13:47 oh, excuse me. Actually, before we do that, we have hydrophobic. 169 00:13:47 --> 00:13:52 Or you could think of these as 170 00:13:52 --> 00:13:56 greasy or water-hating. These are the ones that are sort of 171 00:13:56 --> 00:14:00 when I was talking about trying to dissolve butter into water. 172 00:14:00 --> 00:14:05 These are things that don't like to interact with water or cannot 173 00:14:05 --> 00:14:10 interact with water, and so they cannot form hydrogen 174 00:14:10 --> 00:14:15 bonds so you cannot get them to go under water easily. 175 00:14:15 --> 00:14:20 And they come from very simple ones that have just the methyl group 176 00:14:20 --> 00:14:25 which is alanine or A-L-A or one like this which would be CH2-CH with 177 00:14:25 --> 00:14:30 a couple of methyl groups. This is even more water-hating, 178 00:14:30 --> 00:14:34 that would be lucien, L-E-U. Or here's one that you probably 179 00:14:34 --> 00:14:36 could guess that really doesn't interact with water. 180 00:14:36 --> 00:14:39 This is phenylalanine or P-H-E. And you can see what this side 181 00:14:39 --> 00:14:42 chain is. It's a methylene group. And what's dangling off it but a 182 00:14:42 --> 00:14:44 benzene ring. And I think most of you remember from probably beginning 183 00:14:44 --> 00:14:47 chemistry that benzene is something that you cannot dissolve 184 00:14:47 --> 00:14:54 sugar in or something. It's an organic solvent. 185 00:14:54 --> 00:15:04 It will only dissolve things that have a very hydrophobic character. 186 00:15:04 --> 00:15:17 Then there are some special cases. 187 00:15:17 --> 00:15:21 Glycine is one, because in this case the side chain 188 00:15:21 --> 00:15:25 is simply a hydrogen atom. And, as a consequence to that, 189 00:15:25 --> 00:15:29 this is a very flexible amino acid. So if you want to build -- 190 00:15:29 --> 00:15:35 If nature wants to build a loop into a protein, it's going to undergo a 191 00:15:35 --> 00:15:41 tight turn. You often find glycines there because there's not a big side 192 00:15:41 --> 00:15:47 chain to get in the way if you're going to be bending the chain in 193 00:15:47 --> 00:15:53 3-dimensional space. Another one is cysteine, 194 00:15:53 --> 00:15:59 which looks like serine over there, but it has a thiol group instead of 195 00:15:59 --> 00:16:04 a hydroxyl group. And that's important because that 196 00:16:04 --> 00:16:09 allows for the formation of another special type of bond that if you 197 00:16:09 --> 00:16:15 have one chain of protein that has a cysteine on it and another 198 00:16:15 --> 00:16:20 polypeptide chain that has a cysteine on it and they're close 199 00:16:20 --> 00:16:25 together in space, what can happen is you can form a 200 00:16:25 --> 00:16:31 covalent bond between these under oxidative conditions. 201 00:16:31 --> 00:16:36 This is known as a disulfide bond. That's covalent. 202 00:16:36 --> 00:16:40 So those two chains, if that bond occurs, are now sort of 203 00:16:40 --> 00:16:44 semi-permanently locked together. They're locked together in a very, 204 00:16:44 --> 00:16:48 very strong way. So this is a feature of, this is the only 205 00:16:48 --> 00:16:53 intrastrand covalent bond that you'd characteristically find in proteins. 206 00:16:53 --> 00:16:57 All the rest of them we're going to show you, when they fold up in 207 00:16:57 --> 00:17:01 3-dimensional space, depend on other kinds of 208 00:17:01 --> 00:17:06 interactions. And finally there's one last case 209 00:17:06 --> 00:17:12 which is proline. And this one is a little different 210 00:17:12 --> 00:17:17 because in the amino acid the side chain bends around like this and 211 00:17:17 --> 00:17:23 joins here. So it's actually forming a little circle here between 212 00:17:23 --> 00:17:29 the nitrogen, the amino group and the carboxyl group. 213 00:17:29 --> 00:17:33 And the consequence of this is this bond cannot rotate. 214 00:17:33 --> 00:17:37 The bond that would normally be able to rotate is not able to do 215 00:17:37 --> 00:17:41 that. And so this is sort of a protein you find that when there are 216 00:17:41 --> 00:17:45 some of these regular structures, I'm going to show you in a minute, 217 00:17:45 --> 00:17:49 like helices and things, this protein won't, 218 00:17:49 --> 00:17:53 this amino acid particularly won't fit into those structures. 219 00:17:53 --> 00:17:57 So you tend to, if nature wants to interrupt a particular regular 220 00:17:57 --> 00:18:01 structure that's coming, it will often find a proline right 221 00:18:01 --> 00:18:07 at that particular point. OK. So what we've talked about up 222 00:18:07 --> 00:18:14 until now is sort of just the very, very basic piece of protein 223 00:18:14 --> 00:18:27 structure. 224 00:18:27 --> 00:18:34 It's what called the primary structure which is nothing more than 225 00:18:34 --> 00:18:41 the sequence of amino acids. However, here's a little piece of 226 00:18:41 --> 00:18:48 protein. This is polyalanine. And one thing you can sort of see 227 00:18:48 --> 00:18:56 is if I was trying to figure out how to fold this up into a 3-dimensional 228 00:18:56 --> 00:19:02 confirmation. And let's say this had 300 amino 229 00:19:02 --> 00:19:06 acids or something, there are essentially an infinite 230 00:19:06 --> 00:19:10 number of confirmations. And so one of the real holy grails 231 00:19:10 --> 00:19:14 still in biology is trying to understand if you see the linear 232 00:19:14 --> 00:19:18 sequence of an amino acid, which we can now deduce, excuse me, 233 00:19:18 --> 00:19:22 of a protein, of an amino. If you see the linear sequence of amino 234 00:19:22 --> 00:19:26 acids in a protein, and we can deduce those from 235 00:19:26 --> 00:19:30 analyzing genomes and so on, how do we go from a thing that says 236 00:19:30 --> 00:19:34 a tryptophan, a cysteine, a serine, a serine, a threonine, 237 00:19:34 --> 00:19:38 whatever down the chain to finding its 3-dimensional structure and 238 00:19:38 --> 00:19:43 ultimately its role? And you can sort of hopefully get a 239 00:19:43 --> 00:19:48 sense from this of why it's important. So there are levels this 240 00:19:48 --> 00:19:52 goes. The next level is what's known as secondary structure. 241 00:19:52 --> 00:19:57 These are regions of local secondary structure and they're 242 00:19:57 --> 00:20:01 determined by hydrogen bonds. And I'll show you how these go in 243 00:20:01 --> 00:20:05 just a second. And then you can think about 244 00:20:05 --> 00:20:09 proteins in the tertiary structure. So what we've done and sort of 245 00:20:09 --> 00:20:13 taken a chain and then found out how a little region might take up a 246 00:20:13 --> 00:20:17 particular, for example, here's a portion that's in a helix. 247 00:20:17 --> 00:20:21 This is fairly rigid right now because of the way it's held 248 00:20:21 --> 00:20:25 together. And we'll then find maybe another region like a beta sheet I'm 249 00:20:25 --> 00:20:29 going to show you in a minute. Ultimately we have to figure out how 250 00:20:29 --> 00:20:33 all these units fold up into a 3-dimensional structure. 251 00:20:33 --> 00:20:38 And what we get there is called the tertiary structure. 252 00:20:38 --> 00:20:42 And this has some other forces we're going to talk about besides 253 00:20:42 --> 00:20:47 covalent bonds and hydrogen bonds that determine that. 254 00:20:47 --> 00:20:51 And then, as I've tried to tell you, you can see that proteins play a lot 255 00:20:51 --> 00:20:56 of roles in nature and they're not all single proteins running around 256 00:20:56 --> 00:21:01 being an enzyme or something like that. 257 00:21:01 --> 00:21:05 Many of them are parts of machines so they're made to fit together in 258 00:21:05 --> 00:21:10 absolutely beautiful ways. Some of them have, at this point, 259 00:21:10 --> 00:21:14 fifty-hundred parts that all go together fitting shapes and 260 00:21:14 --> 00:21:19 interacting with these shapes on the principles that we'll be talking 261 00:21:19 --> 00:21:24 about here, the different forces that make things happen in nature. 262 00:21:24 --> 00:21:28 And so quaternary means the structure when there's more than one 263 00:21:28 --> 00:21:36 polypeptide chain. 264 00:21:36 --> 00:21:41 So getting a handle on protein structure was kind of a very 265 00:21:41 --> 00:21:46 important intractable problem for a long time because it was just too 266 00:21:46 --> 00:21:51 hard a nut to crack, but in the 1930s and 1940s x-ray 267 00:21:51 --> 00:21:57 crystallography started to come into usage where basically you'd bounce 268 00:21:57 --> 00:22:01 x-rays off of a crystal. And then they would refract and 269 00:22:01 --> 00:22:05 you'd see characteristic reflections. And you could work backwards to 270 00:22:05 --> 00:22:09 figure out what the structure of the crystal was. This had been applied 271 00:22:09 --> 00:22:13 to minerals and a lot of structure, but it hadn't been applied to 272 00:22:13 --> 00:22:17 proteins. When people started to look they found there were certain 273 00:22:17 --> 00:22:21 proteins that gave characteristic reflections. Carotin, 274 00:22:21 --> 00:22:25 for example. Your hair gives a characteristic reflection around 5. 275 00:22:25 --> 00:22:29 angstroms. So that suggested that there was a repeating unit somewhere 276 00:22:29 --> 00:22:33 in carotin that had this. And, again, with artificial peptides 277 00:22:33 --> 00:22:37 sometimes they were able to see these reflections. 278 00:22:37 --> 00:22:42 And so that was where things stood for a while. And then one of these 279 00:22:42 --> 00:22:46 secondary structures, a very, very important one known as 280 00:22:46 --> 00:22:51 the alpha helix was deduced by Linus Pauling. Some of you have heard of 281 00:22:51 --> 00:22:55 him. He was a famous chemistry at Caltech. He got the Nobel prize. 282 00:22:55 --> 00:23:00 He also got famous later in his career because he championed the use 283 00:23:00 --> 00:23:04 of vitamin C to cure every ill known to mankind, including 284 00:23:04 --> 00:23:09 the common cold. Although there's some merit to what 285 00:23:09 --> 00:23:14 Linus stated, he probably overstated some of those later findings, 286 00:23:14 --> 00:23:19 but his contributions to the underlying chemistry and 287 00:23:19 --> 00:23:24 biochemistry of proteins was amazing. And he was the one that figured out 288 00:23:24 --> 00:23:29 the structure that explained the 5. angstrom repeat. 289 00:23:29 --> 00:23:33 And it was kind of an interesting story. He was in Oxford, 290 00:23:33 --> 00:23:38 England. And he got sick. I think it was some time in the 291 00:23:38 --> 00:23:42 winter. And he got bored reading detective books after a while so he 292 00:23:42 --> 00:23:47 thought he'd try and figure out the structure of proteins that gave rise 293 00:23:47 --> 00:23:51 to this characteristic repeat. So he made a simplifying assumption. 294 00:23:51 --> 00:23:56 He decided he'd forget all the side chains and just focus on this 295 00:23:56 --> 00:24:01 peptide backbone just with the peptide bond. 296 00:24:01 --> 00:24:06 And he was a chemist. And he knew, what I just told you, 297 00:24:06 --> 00:24:11 that this had a partial double bond character so it couldn't rotate. 298 00:24:11 --> 00:24:16 And he reasoned that this was held together by, since these things 299 00:24:16 --> 00:24:21 could form hydrogen bonds that this was probably forming a hydrogen bond 300 00:24:21 --> 00:24:26 with a carboxyl group of some other amino acid and this was probably 301 00:24:26 --> 00:24:31 forming a hydrogen bond with an amino group of a different 302 00:24:31 --> 00:24:40 amino acid. And so what he did was he made a 303 00:24:40 --> 00:24:53 sort of chain like this and he started to pleat it at the alpha 304 00:24:53 --> 00:25:06 carbon, which is the one that has the side chain on it, 305 00:25:06 --> 00:25:19 and was finding to trying to structure that would let him do this. 306 00:25:19 --> 00:25:32 And basically what he found was that if he made a helix that looks 307 00:25:32 --> 00:25:45 something like this, right-handed helix, and he could get 308 00:25:45 --> 00:25:58 a repeat structure that allowed him to form a hydrogen bond. 309 00:25:58 --> 00:25:53 And the repeating unit was 5. angstroms and 3.7, excuse me, 310 00:25:53 --> 00:25:49 amino acids per turn. And it's a right-handed helix. 311 00:25:49 --> 00:25:44 It's the same sort of thing if you're trying to turn in a screw. 312 00:25:44 --> 00:25:40 It's got that kind of structure. And this shows you a little movie of 313 00:25:40 --> 00:25:35 an alpha helix. You can see this is just showing 314 00:25:35 --> 00:25:31 the backbone. So this is the part you can look right down 315 00:25:31 --> 00:25:31 the end of it. See how you can look right though? 316 00:25:31 --> 00:25:36 And you can see how the hydrogen bonds are formed by turning this 317 00:25:36 --> 00:25:40 thing into this regular structure. And the neat thing about this then 318 00:25:40 --> 00:25:45 is if you put on the side chains, and you can put them on in any order, 319 00:25:45 --> 00:25:50 you can build a tremendous amount of diversity even within that helical 320 00:25:50 --> 00:25:55 structure. I think I can stop this. I just want to show you one thing, 321 00:25:55 --> 00:26:00 if I can manage to this when it comes around again. Stop it there. 322 00:26:00 --> 00:26:03 One of the things you can see, now we're looking down the helix. 323 00:26:03 --> 00:26:06 And although you won't recognize the structures of all the amino 324 00:26:06 --> 00:26:10 acids right away, you may be able to see in this 325 00:26:10 --> 00:26:13 particular one. Here are a couple of aromatic rings 326 00:26:13 --> 00:26:17 off on this side. So this side of the helix wouldn't 327 00:26:17 --> 00:26:20 like to see water, and over here are a bunch of charged 328 00:26:20 --> 00:26:24 and polar amino acids. So you could see how you could 329 00:26:24 --> 00:26:27 build into a helix like that, a surface, one part that wouldn't 330 00:26:27 --> 00:26:31 like to interact with water and another part that would. 331 00:26:31 --> 00:26:35 So that was an extremely important contribution. And there are alpha 332 00:26:35 --> 00:26:40 helices in almost all proteins. They'll be in little chunks coming 333 00:26:40 --> 00:26:45 down an amino acids chain. But they'll take up that structure. 334 00:26:45 --> 00:26:50 And, as you can see, it's driven by these hydrogen bonds that we've been 335 00:26:50 --> 00:26:54 talking about. There turns out then to be a second 336 00:26:54 --> 00:26:59 type of secondary structure that's important. It's called 337 00:26:59 --> 00:27:05 a beta sheet. And in this case you can either line 338 00:27:05 --> 00:27:11 up two polypeptide chains running in the same orientation, 339 00:27:11 --> 00:27:17 amino to carboxyl, amino to carboxyl, or you can run them in opposite 340 00:27:17 --> 00:27:23 orientations, amino to carboxyl, amino to carboxyl in the other way. 341 00:27:23 --> 00:27:29 The latter one is called an anti-parallel beta sheet. 342 00:27:29 --> 00:27:34 And if you line things up this way you'll see you can find hydrogen 343 00:27:34 --> 00:27:39 bonds between the chains like that. So this allows two things to form 344 00:27:39 --> 00:27:44 in this way and gives a sort of sheet-like structure. 345 00:27:44 --> 00:27:49 Whereas, that alpha helix has this tight coil like this. 346 00:27:49 --> 00:27:54 So over here I think we have a movie of a beta sheet. 347 00:27:54 --> 00:27:59 And you'll see again you can build up more than one. 348 00:27:59 --> 00:28:03 Because if you look up here you can see how you are all set up to form 349 00:28:03 --> 00:28:08 more hydrogen bonds out in that kind of way. And, as I said, 350 00:28:08 --> 00:28:13 you can do this same trick putting the polypeptide chains so they have 351 00:28:13 --> 00:28:17 the same polarity. And so you can approximate, 352 00:28:17 --> 00:28:22 look at the structure of most proteins then by depicting them 353 00:28:22 --> 00:28:27 either as alpha helices, which you'll see in these diagrams. 354 00:28:27 --> 00:28:32 You've already seen a few in the examples given. 355 00:28:32 --> 00:28:36 They'll look like this. Or a beta sheet which are indicated 356 00:28:36 --> 00:28:40 as these flat arrows. So here's a little piece of a 357 00:28:40 --> 00:28:44 protein made up of alpha helix, these beta sheets. What this is, 358 00:28:44 --> 00:28:49 actually, is a piece of the BRCA-1 gene. That's the familial 359 00:28:49 --> 00:28:53 susceptibility to breast cancer. The gene that causes that is called 360 00:28:53 --> 00:28:57 BRCA-1. And it has a special interaction domain called 361 00:28:57 --> 00:29:01 the BRC T domain. This is the structure. 362 00:29:01 --> 00:29:05 And the only point I'm trying to make, it's of a protein that's 363 00:29:05 --> 00:29:08 involved in preventing you from getting breast cancer. 364 00:29:08 --> 00:29:12 If you get a mutation in it, or particularly in this region, 365 00:29:12 --> 00:29:15 for example, you can end up with an increased susceptibility to breast 366 00:29:15 --> 00:29:18 cancer. But what is it? It's an alpha helices beta sheet. 367 00:29:18 --> 00:29:22 There's green fluorescent protein. You've seen that a few times. 368 00:29:22 --> 00:29:25 Maybe now you'll recognize it's mostly made of beta sheets. 369 00:29:25 --> 00:29:29 There's a little bit of alpha helix down there, a little 370 00:29:29 --> 00:29:32 bit right there. And that has the property that we've 371 00:29:32 --> 00:29:36 talked about of fluorescing. This is a protein I'll tell you 372 00:29:36 --> 00:29:40 later on that recognizes mismatches in DNA, and you get a susceptibility 373 00:29:40 --> 00:29:44 to cancer if it breaks. The only thing you notice here are 374 00:29:44 --> 00:29:47 a lot of alpha helices in it. And hopefully already your eye can 375 00:29:47 --> 00:29:51 begin to pick these out. This is an enzyme. What it does is 376 00:29:51 --> 00:29:55 it's got a catalytic ability to cleave other polypeptide chains. 377 00:29:55 --> 00:29:59 The functions of these don't matter. But you can see once again alpha 378 00:29:59 --> 00:30:03 helices beta sheets. Here's another one. 379 00:30:03 --> 00:30:07 It looks just about the same, alpha helices, beta sheets, except 380 00:30:07 --> 00:30:12 in this case this is the human gene known as, the protein encoded by 381 00:30:12 --> 00:30:16 human genes called RAS. That's an oncogene. That's a gene 382 00:30:16 --> 00:30:21 that if it mutates in a particular way will cause the cell that has 383 00:30:21 --> 00:30:26 that to move a step down the pathway to cancer. So what I've done is put 384 00:30:26 --> 00:30:30 up a whole lot of structures that have some alpha helices, 385 00:30:30 --> 00:30:35 some beta sheets. But you can get the idea that you 386 00:30:35 --> 00:30:40 can get very, very different biological activities from just 387 00:30:40 --> 00:30:45 depending on how you arrange those. OK. So there are a couple of other 388 00:30:45 --> 00:30:50 then forces that I need to tell you about if we're going to go all the 389 00:30:50 --> 00:30:55 way to understanding the 3-dimensional structure of proteins. 390 00:30:55 --> 00:31:00 What we can get to from that is alpha helices beta sheets. 391 00:31:00 --> 00:31:04 But you saw there were loops, there were other interactions that I 392 00:31:04 --> 00:31:09 haven't accounted for in showing you those 3-dimensional structures. 393 00:31:09 --> 00:31:14 So one of them are ionic bonds. This is the third class of force. 394 00:31:14 --> 00:31:19 This is an extreme case of electron sharing where one atom gets all of 395 00:31:19 --> 00:31:24 the electrons. So aspartate, which I had up on the 396 00:31:24 --> 00:31:28 board, aspartic acid looks like that, but under physiological conditions 397 00:31:28 --> 00:31:33 the oxygen will get all the electrons and you'll have 398 00:31:33 --> 00:31:39 a hydrogen on it. And a consequence of that then is 399 00:31:39 --> 00:31:45 that if you have a polypeptide chain that over here has an aspartate and 400 00:31:45 --> 00:31:52 over here has a lysine, which is the four methylene groups, 401 00:31:52 --> 00:31:59 and the positively charged thing here, you can get an ionic bond 402 00:31:59 --> 00:32:05 between those two amino acids that can be very far apart on 403 00:32:05 --> 00:32:12 the polypeptide chain. There may be a lot of amino acids in 404 00:32:12 --> 00:32:18 between, but what they then do is bring these two points together and 405 00:32:18 --> 00:32:24 hold them like that. The next class of force is kind of 406 00:32:24 --> 00:32:30 tricky. You may have heard of it in chemistry. It's referred to as van 407 00:32:30 --> 00:32:34 der Waals interaction. And the basis of this, 408 00:32:34 --> 00:32:38 without going into it too much, is even a nonpolar bond -- 409 00:32:38 --> 00:32:48 -- can have a transient polarity. 410 00:32:48 --> 00:33:00 And this then induces -- 411 00:33:00 --> 00:33:05 -- a transient polarity in a nearby bond. And it has to be a really 412 00:33:05 --> 00:33:11 nearby bond. So about 0. to 0.4 nanometers. Remember, 413 00:33:11 --> 00:33:17 covalent bonds are roughly half that distance or something. 414 00:33:17 --> 00:33:23 So it's got to be a very, very close interaction. It's weak. 415 00:33:23 --> 00:33:29 It's only one-third to one-quarter of a hydrogen bond, 416 00:33:29 --> 00:33:35 which you may recall is about one-twentieth of a covalent bond. 417 00:33:35 --> 00:33:38 But there can be many, many of them if the surfaces fit 418 00:33:38 --> 00:33:42 together really, really tightly. So if you have a 419 00:33:42 --> 00:33:45 protein fold, so there's a surface here, and then it folds up in such a 420 00:33:45 --> 00:33:49 way that there's a surface here, then you can get a lot of van der 421 00:33:49 --> 00:33:52 Waals interactions down here. Now, I've never had a really good 422 00:33:52 --> 00:33:56 way of explaining this. But today, part of these activities 423 00:33:56 --> 00:34:00 of this Hughes Professorship, I've set up some seminars on 424 00:34:00 --> 00:34:04 teaching. And I've invited a guy from Berkley 425 00:34:04 --> 00:34:09 named Robert Full who is talking in 68.180 at 4:00 PM. 426 00:34:09 --> 00:34:14 And I borrowed some things from him this morning. And we're just going 427 00:34:14 --> 00:34:19 to take a quick tour because I want to show you this. 428 00:34:19 --> 00:34:24 He works on, well, he does a lot of things. 429 00:34:24 --> 00:34:30 He works on biomotion and how animals work. 430 00:34:30 --> 00:34:37 But one of the things he works on, let's see if we can get this guy to 431 00:34:37 --> 00:34:44 go here. Oops. How do I figure out how to get it 432 00:34:44 --> 00:34:51 to play here? Hang on a second. I just discovered that the 433 00:34:51 --> 00:34:59 PowerPoint is not really terribly effective. 434 00:34:59 --> 00:35:04 So this isn't working as nicely as I would like. OK. 435 00:35:04 --> 00:35:09 Let's try this. Just a minute. Where are we? Here we go. OK. 436 00:35:09 --> 00:35:15 Let's see if I can get this to go. So he studies a bunch of things, 437 00:35:15 --> 00:35:20 but he did an undergrad project library studying geckos. 438 00:35:20 --> 00:35:26 And here this is a transparent surface. And he's studying how the 439 00:35:26 --> 00:35:30 geckos climb up and down the thing. And they were making measurements. 440 00:35:30 --> 00:35:34 And they found they couldn't account for why this was such an 441 00:35:34 --> 00:35:37 efficient organism. It used much less energy than most 442 00:35:37 --> 00:35:41 things, so they started looking into how it adhered to the surface. 443 00:35:41 --> 00:35:44 It can go up a vertical wall, as you can see here. 444 00:35:44 --> 00:35:47 And so they were able to look underneath and they could see, 445 00:35:47 --> 00:35:51 see how it sort of peels off the surface? And this was a robot that 446 00:35:51 --> 00:35:54 they eventually built that's not using the same molecular bases but 447 00:35:54 --> 00:35:58 uses this peeling thing. And they can get a robot that 448 00:35:58 --> 00:36:02 climbs up a wall. But that's not what we're going to 449 00:36:02 --> 00:36:07 talk about here. We're going to instead, 450 00:36:07 --> 00:36:12 I hope, go back to here. And you can see that all of the geckos have 451 00:36:12 --> 00:36:17 these sort of bizarre toes, and so they started looking to see 452 00:36:17 --> 00:36:22 what the underlying principle of this was. And they saw it has these 453 00:36:22 --> 00:36:28 setae. And they got looking in greater detail and blew it up. 454 00:36:28 --> 00:36:33 And then they found that there were, as they started looking there were 455 00:36:33 --> 00:36:38 these little hairs. And that's a 900 fold magnification. 456 00:36:38 --> 00:36:43 And once they got looking in more detail they found the ends were 457 00:36:43 --> 00:36:48 split so that they were, the very ends are about 200 458 00:36:48 --> 00:36:53 nanometers roughly at the end of this. And so a gecko has about a 459 00:36:53 --> 00:36:59 billion of these on its feet. And what it turns out it does -- 460 00:36:59 --> 00:37:02 And just to see, here's a human hair. 461 00:37:02 --> 00:37:06 You see how it splits down? Now, this is made of keratin, 462 00:37:06 --> 00:37:10 the molecule I just mentioned that was used, alpha helices, 463 00:37:10 --> 00:37:13 but it's very, very fine. And what it can do, it can make van 464 00:37:13 --> 00:37:17 der Waals interactions. This is an animal that sticks to 465 00:37:17 --> 00:37:21 the wall by van der Waals interactions. And the peeling away 466 00:37:21 --> 00:37:25 allows it to break those bonds. But, as you can see, they're 467 00:37:25 --> 00:37:29 enormously important. He's got here a micrograph. 468 00:37:29 --> 00:37:33 They're measuring the force, and the force is just huge. This is 469 00:37:33 --> 00:37:37 the end of the thing, the frayed end sticking to a surface. 470 00:37:37 --> 00:37:42 And for those of you who didn't think biology any relevance to you, 471 00:37:42 --> 00:37:46 Bob was telling me about this. They followed up, he's an engineer as 472 00:37:46 --> 00:37:51 well and builds interdisciplinary teams, and they've measured this 473 00:37:51 --> 00:37:55 stuff. But this is turning into what appears to look like it's going 474 00:37:55 --> 00:38:00 to be a $30 to $50 billion industry as all sorts of things are -- 475 00:38:00 --> 00:38:04 They're beginning to realize it can hold car parts together, 476 00:38:04 --> 00:38:08 it can go in space shuttles, Post-it notes. And here's a little 477 00:38:08 --> 00:38:12 Band-Aid they made. They own the patent on this 478 00:38:12 --> 00:38:16 self-cleaning dry adhesive. It doesn't have to be made out of 479 00:38:16 --> 00:38:20 gecko stuff. It could be made out of all sorts of things. 480 00:38:20 --> 00:38:25 But, anyway, here's an example of where not only are van der Waals 481 00:38:25 --> 00:38:29 forces very important, but where somebody who started a 482 00:38:29 --> 00:38:33 very simply aspect of biology worrying about the efficiency of how 483 00:38:33 --> 00:38:37 geckos ran and pushed it all the way down to the molecular level 484 00:38:37 --> 00:38:41 understood a principal that's going to make somebody a very 485 00:38:41 --> 00:38:45 large amount of money. OK. The last, 486 00:38:45 --> 00:38:48 and if anybody wants to come, he's an amazing speaker. Perhaps 487 00:38:48 --> 00:38:52 one of the most exciting speakers I've ever heard. 488 00:38:52 --> 00:38:55 68-180, 4:00 PM if you want to go. He'll have more of that sort of 489 00:38:55 --> 00:39:00 stuff to show you then. OK. So the last force here, 490 00:39:00 --> 00:39:07 it's not really a force, but what we'll call hydrophobic 491 00:39:07 --> 00:39:14 effects. And what I mean by this is that the principle of this is that 492 00:39:14 --> 00:39:22 amino acids that don't like to interact with water, so -- 493 00:39:22 --> 00:39:33 So hydrophobic amino acids. 494 00:39:33 --> 00:39:37 These are ones like lucien and phenylalanine. 495 00:39:37 --> 00:39:41 Well, I showed you the water the other day and how it was forming 496 00:39:41 --> 00:39:45 hydrogen bonds between the molecules. So if you're going to stick another 497 00:39:45 --> 00:39:48 molecule in there, you're going to break a bunch of 498 00:39:48 --> 00:39:52 bonds. And if you're not charged or polar you cannot make new bonds with 499 00:39:52 --> 00:39:56 the water. And so what happens, if you put these together, just like 500 00:39:56 --> 00:40:00 if you put oil together it will all bundle up and it will minimize its 501 00:40:00 --> 00:40:04 interactions with water. And that's what proteins do. 502 00:40:04 --> 00:40:08 Here's the structure of a protein all folded up in 3-dimensional space. 503 00:40:08 --> 00:40:13 And you can see at the core of the protein how there are these many 504 00:40:13 --> 00:40:17 hydrophobic amino acids that are interacting. And let me just, 505 00:40:17 --> 00:40:22 I'm going to close by showing you one more little movie. 506 00:40:22 --> 00:40:26 And the new version of PowerPoint doesn't do this well so I'm just 507 00:40:26 --> 00:40:31 going to get out of this for a second here. 508 00:40:31 --> 00:40:39 This is a really cool movie I saw. 509 00:40:39 --> 00:40:47 I want to show you a DNA repair protein sticking to a piece of helix. 510 00:40:47 --> 00:40:55 Can you hit the lights somebody there? So this is a lesion on a 511 00:40:55 --> 00:41:03 piece of, see the double helix here? 512 00:41:03 --> 00:41:06 And what I especially liked about this is this is sort of a Star Wars 513 00:41:06 --> 00:41:10 movie. You're going to fly down the major groove of a double helix. 514 00:41:10 --> 00:41:14 And you can see where this particular protein folded up in 515 00:41:14 --> 00:41:18 3-dimensional space is reaching down into that helix. 516 00:41:18 --> 00:41:22 So this is sort of putting together the two things that I've been 517 00:41:22 --> 00:41:26 telling you about. This blue is a DNA repair protein. 518 00:41:26 --> 00:41:30 Oopsy daisy. A DNA repair protein that's able to 519 00:41:30 --> 00:41:34 find a lesion in the DNA. And here's the double helix that's 520 00:41:34 --> 00:41:39 the two chains held together by hydrogen bonds. 521 00:41:39 --> 00:41:43 And then, as you can see, there's a groove on each side. 522 00:41:43 --> 00:41:46 And the protein is searching down into that groove --