Lecture 12: Continuation: General Theory for Inhomogeneous ODEs

{'English - US': '/courses/mathematics/18-03-differential-equations-spring-2010/video-lectures/lecture-12-continuation-general-theory-for-inhomogeneous-odes/18-03_L12.srt'}

Flash and JavaScript are required for this feature.

Download the video from iTunes U or the Internet Archive.

Topics covered: Continuation: General Theory for Inhomogeneous ODE's. Stability Criteria for the Constant-coefficient ODEs

Instructor/speaker: Prof. Arthur Mattuck

 

We are going to start today in a serious way on the inhomogenous equation, second-order linear differential, I'll simply write it out instead of writing out all the words which go with it. So, such an equation looks like, the second-order equation is going to look like y double prime plus p of x, t, x plus q of x times y. Now, up to now the right-hand side has been zero. So, now we are going to make it not be zero. So, this is going to be f(x). In the most frequent applications, x is time. x is usually time, often, but not always.

So, maybe just for today, I will use X in talking about the general theory. And, from now on, I'll probably make X equal time because that's what is most of the time in the applications. So, this is the part we've been studying up until now. It has a lot of names. It's input, signal, commas between those, a driving term, or sometimes it's called the forcing term. You'll see all of these in the literature, and it pretty much depends upon what course you're sitting at, what the professor habitually calls it. I will try to use all these terms now and then, probably most often I will lapse into input as the most generic term, suggesting nothing in particular, and therefore, equally acceptable or unacceptable to everybody.

The response, the solution, then, the solution as you know is then called the response. The response, sometimes it's called the output. I think I'll stick pretty much with response. So, I'm using pretty much the same terminology we use for studying first-order equations. Now, as you will see, the reason we had to study the homogeneous case first was because you cannot solve this without knowing the homogeneous solutions. So, that's the inhomogeneous case. But the homogeneous one, the corresponding homogeneous thing, y'' + p(x) y' + q(x) y = 0 is an essential part of the solution to this equation.

That's called, therefore, it has names. Now, unfortunately, it doesn't have a single name. I don't know what to call it, but I think I'll probably call it the associated homogeneous equation, or ODE, the associated homogeneous equation, the one associated to the guy on the left. It's also called the reduced equation by some people. There is some other term for it, which escapes me totally, but what the heck. Now, its solution has a name. So, its solution, of course, doesn't depend on anything in particular, the general solution, because the right-hand side is always zero.

So, its solution, we know can be written as y equals in the form c1y1 + c2y2, where y1 and y2 are any two independent solutions of that, and then c1's and c2's are arbitrary constants. Now, what you are looking at this equation, you're going to need this also. And therefore, it has a name. It has various names. Sometimes there is a subscript, c, there. Sometimes there's a subscript, h. Sometimes there's no subscript at all, which is the most confusing of all. But, anyway, what's the name given to it? Well, there is no name. Many books call it the solution to the associated homogeneous equation. That's maximally long. Your book calls it the complementary solution.

Many people call it that, and many will look at you with a blank, who know differential equations very well, and will not have the faintest idea what you're talking about. If you call it (y)h, then you are thinking of it as the solution; the h is for homogeneous to indicate it's the solution. So, it's the solution to the, I'm not going to write that. You put it in her books if you like writing. Write solution to the associated homogeneous equation, y(h). But, it's all the same thing.

Now, or the solution to the reduced equation, I see I have in my notes. Okay, good, the solution to the reduced equation, too. Okay, now, the examples, there are, of course, two classical examples, of which you know one. But, use them as the model for what solutions of these things should look like and how they should behave. So, the model you know already is the one, I won't make the leading coefficient one because it usually isn't, is the one, mx'' + bx' + kx = f(t). That's the spring-mass system, the spring-mass-dashpot system.

Mass, the damping constant and the spring constant, except up to now, it's always been zero here. What does this f(t) represent? Well, if you think of the way in which I derived the equation, the mx, that was the Newton's law. That's the acceleration. So, it's the acceleration, the mass times the acceleration. By Newton's law, this is equal to the imposed force on the little mass truck.

Okay, you got that truck, there. I'm not going to draw the truck for the nth time. You'll have to imagine it. So, here's our truck. Okay, forces are acting on it. Remember, the forces were -kx. That came from the spring. There was a force, -bx'. That came from the dashpot, the damping force. So, this other guy is f(t). What's this? This is the external force, which is acting out. In other words, instead of the little truck going back and forth and doing its own thing all by itself, here's someone with an electromagnet, and the mass it's carrying is a big pile of iron ore. You're turning it on and off, and pulling that thing from afar where nobody can see it. So, this is the external force. Now, think, that is the model you must have in your mind of how these equations are treated.

In other words, when f(t) = 0, the system is passive. There is no external force on it when this is zero. The system is sitting, and just doing what it wants to do, all by itself. You wanted up by giving it an initial push, and putting its initial position somewhere. But after that, you lay your hands off. The system then just passively responds to its initial conditions and does what it wants.

The other model is that you don't let it respond the way it wants to. You force it from the outside by pushing it with an external force. Now, those are clearly two entirely different problems: what it does by itself, or what it does when it's acted on from outside. And, when I explained to you how the thing is to be solved, you have to keep in mind those two models. So, this is the forced system.

I'll just use the word, forced system, that's where f(t) is not zero, versus the passive system where there is no external applied force. The passive system, the forced system, now, you have to both, even if you wanted to solve the forced system, the way the system would behave if nothing would be done to it from the outside is nonetheless going to be an important part of the solution.

And, I won't be able to give you that solution without knowing this also. Now, I'd like to give you the other model very rapidly because it's in your book. It's in the problems I have to give you. You know, it's part of everybody's culture, whether they like it or not. So, that's example number one. Example number two, which follows the differential equation just as perfectly as the spring-mass-dashpot system is the simple electric circuit.

The inductance, you don't know yet what an inductance is, officially, but you will, a resistance, sorry, that's okay, put the capacitance up there, resistance, and then maybe a thing. So, this is a resistance. I think you know these symbols. By now, you certainly know the system for capacitance. What I mean when I say C is the capacitance, you may not know yet what L is. That's called the inductance. So, this is something called a coil because it looks like one. L is what's called its inductance. And, the differential equation, there are two differential equations which can be used in this. They are essentially the same. One is simply the derivative of the other. Both differential equations come from Kirchhoff's voltage law, that the sum of the voltage drops as you move around the circuit --

-- has to be zero because otherwise, I don't have to, that's because of somebody's law, Kirchhoff, with two h's. The sum of the voltage drops to zero, and now you know the voltage drop across this, and you know the voltage drop across that because you learned in 8.02. You will, one day, learn the voltage drop across this. But, I already know it. It's Li. So, i is the current. I'll write this thing in its primitive form first. So, i is the current that's flowing in the circuit. q is the charge on the capacitance. So, the voltage drop across the coil is Li. The voltage shop across the, Li', the voltage drop across the resistance is, well, you know that.

And, the voltage shop across the capacitance is q/C. And so, that's equal to, well, it's equal to zero, except if there's a battery here or something generating a voltage drop, so, let's call that E is a generic word. E could be a battery. It could be a source of alternating current, something like that. But, there's a voltage drop across it, and I'm giving E the name of the voltage drop.

So, and then there's the question of the signs, which I know I'll never understand. But, let's assume you've chosen the sign convention so that this comes out nicely on the right-hand side. So, this might be varying sinusoidally, in which case you'd have source of alternating current. Or, it might be constant, in which that would be a battery, a little dry cell giving you direct current of a constant voltage, stuff like that.

So, you could make this minus if you want, but everything will have the wrong signs, so don't do it. Now, this doesn't look like what it's supposed to look like because it's got q and i. So, the final thing you have to know is that q' = i. The rate at which that charge leaves the condenser and hurries around the circuit to find its little soul mate on the other side is the current that's flowing in the circuit. That's why current flows, except nothing really happens.

Electrons just push on each other, and they stay where they are. I don't understand this at all. So, if I differentiate this, you can do two things. Either you could integrate i, and expressed the thing entirely in terms of q, or you can differentiate it, and express everything in terms of i. Your book does nicely both, does not take sides. So, let's differentiate it, and then it will look like Li'' + Ri' + i/C equals, and now, watch out, you have now not the electromotive force, but its derivative. So, if you were so unfortunate as to put a little dry cell there, now you've got nothing, and you've got the homogeneous case. That's okay. Where are the erasers? One eraser? I don't believe this.

So, there's the equation. There are our two equations. Why don't we put them up in colored chalk. There's the spring equation. And, here's the equation that governs the current, for how the current flows in that circuit. And now, you can see, again, what does it mean? If this is zero, for example, if I have a dry cell there, or if I have nothing at all in the circuit, then this represents the passive circuit.

It's just sitting there. It wouldn't do anything at all, except that you've put a charge on the capacitor, and waited, and of course, when you put a charge on there, it's got a discharge, and discharges through the circuit, and swings back and forth a little bit if it's under-damped until finally towards the end the current dies away to zero. But, what usually happens is that you drive this passive circuit by putting an effective E in it, and then you want to know how the current behaves. So, those are the two problems, the passive circuit without an applied electromotive force, or plugging it into the wall, and wanting it to do things. That's the normal state of affairs.

People don't want passive circuits, they want circuits which do things because, okay, that's why they want to solve inhomogeneous equations instead of homogeneous equations. But as I said, you have to do the homogeneous case first. Okay, you are now officially responsible for this, and I don't care that you haven't had it in physics yet. You will before the next exam. So, I don't even feel guilty.

But, you're going to start using it on the problem set right away. So, it's never too soon to start learning it. Okay, now, the main theorem, I now want to go, so that was just examples to give you some physical feeling for the sorts of differential equations we'll be talking about. I now want to tell you briefly about the key theorem about solving the homogeneous equation. So, the main theorem about solving the homogeneous equation is, the inhomogeneous equation.

So, I'm going to write the inhomogeneous equation out. I'm going to make the left-hand side a linear operator, and am going to write the equation as Ly = f(x). That's the inhomogeneous equation. So, L is the linear operator, second order because I'm only talking about second-order equations. L is a linear operator, and then this is the differential equation. So, here's our differential equation. It's inhomogeneous because it's go the f(x) on the right hand side. And, what the theorem says is that the solution has the following form, yp + yc.

So, the hypothesis is we've got the linear equation, and the conclusion is that that's what its solution looks like. Now, you already know what y sub c looks like. In other words, if I write this out in more detail, it would be i.e., department of fuller explanation, -- -- the general solution looks like y equals yp, and then this thing is going to look like an arbitrary constant times y1 plus an arbitrary constant times y2, where these are solutions of the homogeneous equation. So, Yc looks like this part, and the yp, what's yp? p stands for particular, the most confusing word in this subject.

But, you've got at least four weeks to learn what it means. Okay, yp is a particular solution to Ly = f(x). Now, I'm not going to explain what particular means. First, I'll chat as if you knew what it meant, and then we'll see if you have picked it up. In other words, the procedure for solving this equation is composed of two steps. First, to find this part. In other words, to find the complementary solution, in other words, to do what we've been doing for the last week, solve not the equation you are given, but the reduced equation. So, the first step is to find this. The second step is to find yp.

Now, what's yp? yp is a particular solution to the whole equation. Yeah, but which one? Well, if it's any one, then it's not a particular solution, yeah. I say, unfortunately the word particular here is not being used in exactly the same sense in which most people use it in ordinary English. It's a perfectly valid way to use it. It's just confusing, and no one has ever come up with a better word. So, particular means any one solution.

Any one will do. Okay, even these have slightly different meanings. Any questions about this? I refuse to answer them. [LAUGHTER] Now, well, examples of course will make it all clear. But I'd like, first, to prove the theorem, to show you how simple it is. It's extremely simple if you just use the fact that L is a linear operator. We've got two things to prove. What have we got to prove? Well, I have to prove two statements, first of all, that all the yp + c1y1 + c2y2 are solutions.

How are we going to prove that? Well, how do you know if something is a solution? Well, you plug it into the equation, and you see if it satisfies the equation. Good, let's do it, proof. L, I'm going to plug it into the equation. That means I calculate L(yp + c1y1 + c2y2). Now, what's the answer? Because this is a linear operator, and notice, the argument doesn't use the fact that the equation is second order. It immediately generalizes to a linear equation of any order, whatever-- 47. Okay, this is L of yp plus L(yp + c1y1 + c2y2. Well, what's that? What's L of the complementary solution?

What does it mean to be the complementary solution? It means when you apply the operator L to it, you get zero because this satisfies the homogeneous equation. So, this is zero. What's L(yp)? Well, it was a particular solution to the equation. Therefore, when I plugged it into the equation, I must have gotten out on the right-hand side, f(x). So, this is since yp is a solution to the whole equation.

So, what's the conclusion? That, if I take any one of these guys, no matter what c1 and c2 are, apply the linear operator, L to it, the answer comes out to be f(x). Therefore, this proves that this shows that these are all solutions because that's what it means. Therefore, they satisfy L(y) = f(x). They satisfy the whole inhomogeneous differential equation, and that's it. Well, that's only half the story. The other half of the story is to show that there are no other solutions. Okay, so we got our little u(x) coming up again, and he thinks he's a solution. Okay, so, to prove there are no other solutions, it almost sounds biblical, thou shalt have no other solutions before me, okay.

There are no other solutions accept these guys for different values of c1 and c2. Okay, so, u(x) is a solution. I have to show that u of x is one of these guys. How am I going to do that? Easy. If it's a solution that, L(u), okay, I'm going to drop the x, okay, just to make the, like I dropped the x over there. If it's a solution to the whole inhomogeneous equation, then this must come out to be f(x). Now, what's L(yp)? That's f of x too, by secret little particular solution I've got in my pocket. Okay, I pull it out, ah-ha, L of yp, that's f of x, too. Now, I'm going to not add them. I'm going to subtract them. What is L(u - yp)? Well, it's zero.

It's zero because this is a linear operator. This would be L(u) - L(yp). I guess the answer is zero on the right-hand side. And therefore, what is the conclusion? If that's zero, it must be a solution to the homogeneous equation. Therefore, u - yp is equal to, there must be c1 and c2. I won't give them the generic names. I'll give them a name, a particular one.

I'll put a tilde to indicate it's a particular one. c1 plus c2 y2 tilde, so, in other words, for some choice of these constants, and I'll call those particular choices c1 tilde and c2 tilde, it must be that these are equal. Well, what does that say? It says that u is equal to yp plus c1 tilde, blah, blah, blah, blah, plus c2 tilde, blah, blah, blah, blah, and therefore chose that u wasn't a new solution.

It was one of these. So, u isn't new. So, I should write it down. Otherwise some of you will have missed the punch line. Okay, therefore, u is equal to yp plus c1 tilde y1 plus c2 tilde y2. And, it shows. This guy who thought he was new was not new at all. It was just one of the other solutions. Okay, well, now, since the coefficient's a constant, apparently we've done half the work. We know what the complementary solution is because you know how to do those in terms of exponentials and complex exponentials, signs and cosines, and so on.

So, what's left to do? All we have to do is find to solve equations, which are inhomogeneous. All we have to do is find a particular solution, find one solution. It doesn't matter which one, any one. Just find one, okay? Now, we're going to spend the next two weeks trying to do this. I'll give you various methods. I'll give you a general method involving Fourier series because it's a good excuse for learning what Fourier series are. But, the answer is that in general, for a few standard functions, it's known how to do this. You will learn those methods for finding those using operators. For all the others, it's done by a series, or a method involving approximation.

Or, the worse comes to worst, you throw it on a computer and just take a graph and the numerical output of answers as the particular solution. Okay, now before, we are going to start that work, not today. We'll start it next Monday, and it will last, as I say the next two weeks. And, we will be up to spring break. But, before we do that, I'd like to relate this to what we did for first order equations because there is something to be learned from that.

Think back to the linear first-order equation, and I'm going to, since from now on for the rest of the period, I'm going to be considering the case for constant coefficients. In other words, this case of springs or circuits or simple systems which behave like those and have constant coefficients. So, for the linear, first-order equation, there, too, I'm going to think of constant coefficients. We talked quite a bit about this equation. What did I call the right-hand side? I think we usually called it q(t), right? This is in ancient history. The definition of ancient history was before the first exam. Okay, now how does that fit into this theorem that I've given you? Remember what the solution looked like.

The solution looked like, remember, you took the integrating factor was e^(kt), and then after you integrated both sides, multiplied through, and then the final answer looked like this, y equaled, it was e^(-kt) times either an indefinite integral, or a definite integral depending on your preference, q(t), so, x is metamorphosed into t. I gather you've got that, e to the kt plus, what was the other term? A constant times e^(-kt).

How does this fit into the paradigm I've given you over there for solving the second order equation? Which term is which? Well, this has the arbitrary constant in it. So, this must be the complementary solution. Is it? Is this the solution to the associated homogeneous equation? What's the associated homogeneous equation? Put zero here. Okay, if you put zero there, what's the solution? Now, this you ought to know. y' = -ky. What's the solution? e^(-kt). You are supposed to come into this course knowing that, except there's an arbitrary constant in front. So, right, this is exactly the solution to the associated homogeneous equation, where there is zero here. Then, what's this thing?

This is a particular solution. This is my yp. But that's not a particular solution because this indefinite integral, you know, has an arbitrary constant in it. In fact, it's just that arbitrary constant. So, it's totally confusing. But, this symbol, you know when you actually solve the equation this way, all you did was you found one function here. You didn't throw in the arbitrary constant right away. All you needed to do was find one function.

And, even if you really are bothered by the fact that this is so indefinite, and therefore, make it a particular solution by making this zero, make it a definite integral, zero, here, t there, and then change those t's to dummy t's, t1's or t tildes, or something like that. So, this fits into that thing. In other words, I could have done it at that time, but I didn't the point because this can be solved directly, whereas, of course, the general second order equation in homogeneous cannot be solved directly, and therefore you have to be willing to talk about what its solutions look like in advance.

Now, remember I said, we talked, I said there was two different cases, although both of them had the identical looking solution. Their meaning in the physical world was so different that they really should be considered as solving the same equation. And, one of these was the case. Of the two, perhaps the more important was the case when k was positive, and of course the other is when k is negative. When k is positive, that had the effect of separating that solution into this part, which was a transient, and the other part, which was a steady state. The steady state solution, that was the yp part of it in that terminology. And, the transient part, it was trangent because it went to zero.

If k is positive, the exponential dies regardless of what c is. So, the transient, that's the yc part. It goes to zero as Ttgoes to infinity. The transient depends on, uses, the initial condition, whatever it is, because that's what determines the value of c. On the other hand, this initial condition makes no difference as t goes towards infinity. All that's left is this steady state solution. And, all solutions tend to the steady state solution. So, if k is positive, one gets this analysis of the solutions into the sum of one basic solution, and the others, which just die away, have no influence on this, less and less influence as time goes to infinity.

For k less than zero, this analysis does not work because this term, if k is less than zero, this term goes to infinity or negative infinity, and typically tends to dominate that. So, it's the start that the important one. It depends on the initial conditions, and the analysis is meaningless. So, the above is meaningless. And now, what I'd like to do is try to see what the analog of that is for second order equations, and higher order equations. If you understand second-order, that's good enough. Higher order goes exactly the same way. So, the question is, for second-order, let's make it with constant coefficients plus, I could call it b and k, oh, no, b k, or p.

The trouble is, that wouldn't take care of the electrical circuits. So, I just want to use neutral letters, which suggest nothing. And, you can make them turn it into a circuit, so springs, or yet other examples undreamt of. But these are constants. And I'm going to think of it as time. I think I'll switch back to time, let x be the time. So, B y equals f(t). So, there is our equation. A and B are constants. And, the question is, the question I'm asking, can think of either of these two models or others, the question I'm asking is, under what circumstances can I make that same type of analysis into steady-state and transient?

Well, what does the solution look like? The solution looks like yp + c1y1 + c2y2. Therefore, to make that look like this, the c1 and c2 contain the initial conditions. This part does not. Therefore, if I want to say that the solutions look like a steady state solution plus something that dies away, which becomes less and less important as time goes on, what I'm really asking is, under what circumstances is this part guaranteed to go to zero?

So, the question is, when, in other words, under what conditions on the equation A and B, in effect, is what we are asking. When does c1y1 + c2y2 -> 0 as t --> infinity, regardless of what c1 and c2 are for all c1 c2. Now, here there was no difficulty. We had the thing very explicitly, and you could see k is positive: this goes to zero. And if k is negative, it doesn't go to zero. It goes to infinity. Here, I want to make the same kind of analysis, except it's just going to take, it's a little more trouble. But the answer, when it finally comes out is very beautiful. So, when are all these guys going to go to zero? First of all, you might as well just have the definition. So, all the good things that this is going to imply, if this is so, in other words, if they all go to zero, everything in the complementary solution, then the ODE is called stable.

Some people call it asymptotically stable. I don't know what to call it. I can make the analysis, and then I use the identical terminology, c1 y1 plus c2 y2. This is called the transient because it goes to zero. This is called the particular solution now that we labored so hard to get for the next two weeks. It's the important part. It's the steady-state part. It's what lasts out to infinity after the other stuff has disappeared. So, this is the steady-state solution, steady-state solution, okay?

And, the differential equation is called stable. Now, it's of the highest interest to know when a differential equation is stable, linear differential equation is stable in this sense because you have a control. You know what its solutions look like. You have some feeling for how it's behaving in the long term. If this is not so, each equation is a law unto itself if you don't know. So, let's do the work. For the rest of the period, what I'd like to do is to find out what the conditions are, which make this true. Those were the equations which we will have a right to call stable. So, when does this happen, and where is it going to happen? I don't know. I guess, here.

Now, I think the first step is fairly easy, and it will give you a good review of what we've been doing up until now. So, I'm simply going to make a case-by-case analysis. Don't worry, it won't take very long. What are the cases we've been studying? Well, what do the characteristic roots look like? The roots of the characteristic equation, in other words, remember, there are cases. The first case is they are real and distinct, r1 not equal to r2, real and distinct. What are the other cases? Well, r1 = r2. And then, there's the case where there are complex. So, I will write it r equals a plus or minus b i.

What do the solutions look like? So, my ham-handed approach to this problem is going to be, in each case, I'll look at the solutions, and first get the condition on the roots. So, in other words, I'm not going to worry right away about the a and the b. I'm going, instead, to worry about expressing this condition of stability in terms of the characteristic roots. In fact, that's the only way in which many people know the conditions.

You're going to be smarter. Okay, what do the solutions look like? Well, the general solution looks like e^(r1 t) + c2 e^(r2 t). Okay, so, what's the stability condition? In other words, if equation happened to have its characteristic roots, real and distinct, under what circumstances would it be stable? Would it, in other words, all its solutions go to zero? So, I'm talking about the homogeneous equation, the reduced equation, the associated homogeneous equation.

Why? Because that's all that's involved in this. In other words, when I write that, I am no longer interested in the whole equation. All I'm interested in is the reduced equation, the equation where you turn the f(t) on the right-hand side into zero. So, what's the stability condition? Well, let's write it out. Under what circumstances will all these guys go to zero? If r1 and r2 should be negative, can they be zero? No, because then it will be a constant and it will go to zero. How about this one? Well, in this one, it's (c1 + c2 t) e^(r1 t).

Of course, both of these are the same. I'll just arbitrarily pick one of them. What happens to this as things go to zero? Well, this part is rising, at least if c2 is positive. This part is either helping or it's hindering. But, I hope you know what these functions look like, and you know which of them go to zero. They go to zero if r1 is negative. It might rise in the beginning, but after a while they lose the energy. Of course, if r1 is equal to zero, what do these guys do? Linear, go to infinity. Well, we are doing okay. How about here? Well, here, it's a little more complicated. The solutions look like e^(at) c1 cos(bt) + c2 sin(bt).

Now, this part is a pure oscillation. You know that. It might have a big amplitude, but whatever it does, it does the same thing all the time. So, whether this goes to zero depends entirely upon what that exponential is doing. And, that exponential goes to zero if a is negative. So here, the condition is negative. And now, the only thing left to do is to say it nicely. I've got three cases, and I want to say them all in one breath. So, the stability condition is, the ODE is stable. So, this is, or f(t). It doesn't matter. But, psychologically, you can put this as zero there, is stable if what?

In case one, this is true. In case two, that's true. In case three, that's true. But that's ugly. Make it beautiful. The beautiful way of saying it is if all the characteristic roots have negative real parts. If the characteristic roots, the r's or the a plus or minus b i, have negative real part. That's the form in which the electrical engineers will nod their head, tell you, yeah, that's right, negative real part, sorry. Isn't it right? Is that right here? Yeah. What's the real part of these guys?

They themselves, because they are real. What's the real part of this? Yeah. The only case in which I really had to use real part is when I talk about the complex case because a is just the real part of a complex number. It's not the whole thing.