Calculus

Lesson 65

Calculus of Variations

Back to Dr. Nandor's Calculus Notes Page

 Back to Dr. Nandor's Calculus Page

Back to Dr. Nandor's Math Pages

Back to Wellington

 

 

Thanks to Dr. Helliwell of Harvey Mudd College - this lesson essentially comes from the class notes from his Theoretical Mechanics class at Mudd in 1992-1993.

 

                        Let's say that we know that there is some function that depends

                                    on the path we take.  It doesn't matter what the function

                                    is:  it could be friction, or work, or earned wages - whatever.

                                    , where  is the path taken.  So if the function

                                    were wages, perhaps the wages depend on which path is taken, and

                                    how fast each section of the path is tread.

 

                                    If we wanted to add up all of the wages, we would have to do

                                    an integral:

 

                                               

 

                                     is called a "Functional."  It is NOT necessarily a function!

 

 

                        Now, let us suppose that we want to evaluate .  In theory, we can,

                                    at the very least, numerically integrate it if we know what  is.

 

 

                        The question is:  if we know what outcome we want, can we find a ?

                                    More specifically, what if we want to minimize or maximize ?  Can

                                    we find what  must be?  Does such a function even exist?

 

                       

 

 
 

                                                

 

 

 

 

 

            Well, let us, for the moment, assume that it does exist, and we'll call it .  Let's

            also say that any other function , where  is

            the difference between  and  at any given point.  It will be

            easier, for our purposes, actually substitute  into  instead

            of , since we have a way to vary  as we shall see.  Note that if

            , that  so we should be able to make the substitution

            without any ill effects, as long as  at the end.

 

 

 

                       

 

 
   

                                                       

        

 

 

 

            We want some way to plug  into our functional and to manipulate it to reduce

                        it to  when we are minimizing the functional.  Thus we will want to

                        see how .  So, let's say

           

                       

 

                        which is a power series.   is called the "smallness parameter."  As we make

                        , we certainly make ; however, note that we also make

                        the higher order functions unimportant, since they are multiplied by

                        higher powers of .    Doing the above is similar to the statement we always

                        make about a function looking a like a line if we zoom in really close to it.  In

                        the same way, as we zoom in on  as , it looks like a

                        multiple of the same function .  Therefore, it doesn't matter which

                         we use, it is just some multiple of the function  as long as

                        we zoom in close enough.  So we have reduced the problem from one with

                        an infinite number of s to one where we always have the same

                        small difference function, , and we only vary the size of a constant, .

 

 

 

            Now, let's concentrate on .  We know some things about it. 

                         since  at the endpoints.  Plugging

                        everything into our functional, then,

 

                                     

                                    where we know

                                                                                    

            SO.  What is the method we will use to find the extremum of our functional?

 

                        The same thing we always do!  Take the derivative with respect to something

                        that must go to zero:

                                               

 

            This is especially convenient since  is not in either integration limit so we can

                        bring the derivative inside the integral.

 

                                               

 

            So what we have so far is:

                       

 

 

We can integrate the second integral by parts.

 

                       

 

 

 

 

 

                                            This term=0 since  at the endpoints

                                   

 

 

                                    So, the derivative of our functional is:

 

                                   

 

                                    We are looking for an extremum, so combining the integrals and

                                                setting the integral = 0, we get:

 

                                   

 

                        The above equation must be true for ANY arbitrary  (provided that

                                    it goes to zero at the endpoints), so the only way that the integral

                                    will ALWAYS be zero, is if what is in the parentheses is always zero.

                                    Note that if we take the limit now as , we arrive at

                                    our extremum path instead of one that is merely close to our

                                    extremum path.

                                                                       

 

 

                        This is called Euler's Equation (EI).  For the integral to be an extremum,

                                    the function inside the integral MUST obey this equation.

 

                         Think back to what we were doing to recap:  We are trying to find

                                    the function that extremizes the integral.  By saying that we were

                                    using a function close to, but not quite, the minimizing function,

                                    we were able to derive an equation that the minimizing function

                                    must obey.  We haven't done any examples yet, but at least we know

                                    now what one of the conditions placed on the function must be.                   

                        Also remember that all we've done is find a condition for the integral to

                                    be an extremum Ð we haven't specified whether that extremum

                                    is a min, max, or stationary.  However, usually we can use common

                                    sense.

 

                        Example:  if , write out Euler's Equation.

 

                                   

 

                        Of course we don't know how to solve this equation, but it's a start.

 

 

 

 

Let's do one we can solve:  Find the shortest distance between two points.

                       

                                   

                                    Using EI:

 

                                    where     

 

                                    So doing a bit of algebra:

 

                                   

 

                                    The shortest distance between two points is a line!!!!!!

 

 

 

                        But we won't always be able to solve EI!

 

                        Let's find another formulation of Euler's Equation that might make it

                        easier for us to solve.

 

                        Let's consider   and  .  Why?  Because I'm telling you

                        that we can get an easier form of Euler's Equation if we do!  So there!

 

                       

 

                                               

 

                                                But we know from EI that , so

 

 

                                                                   *

 

 

 

                       

                                                The other part of what we're looking at:

 

                                                                      **

 

 

 

 

                                    Subtracting the two results (** - *), we get

 

                                                                       This is (EII)

 

 

 

                        The reason EII is useful is that often we will not have a function that

                                    explicitly a function of x, which means .  When this is the case,

 

 

                                               

 

 

                        So, recap. 

                                    Use EII when  is not explicate a function of .

                                                Then we can move straight to

 

 

                                    Use EI when  is not explicate a function of .

                                                Then we can move straight to

 

 

 

 

                                    The reason this is called variational calculus is that small variations

                        of the path do not make a large change in our functional.  Think of it like

                        at the top of a parabola, small variations in x do not greatly change the

                        slope.  However, when we are NOT near the vertex, a small change has

                        a large effect on the slope.

 

 

                        Note that when we look at our previous line example,  

                                    is an explicit function of neither  nor , so we could use either

                                    EI or EII (although we would still use EI since although we know

                                    , we don't know  so we wouldn't be able

                                    to solve EII.

 

 

 

 

                        Now, let's see the problem that caused Newton to essentially invent the

                        calculus of variations:  the brachistochrone.

 

 

                        When only gravity is considered, what is the fastest path between two points?

                                    It's not a line.  Note that the sooner we speed up, the faster we'll be.

                                    The answer isn't straight down first since then we waste time by taking

                                    a path that's too long.  So.  We know what it should look like,

                                    at least approximately:                   A

                                                                                                                               B

 

 

 

 

                        Let's minimize time:                    

 

 

 

                                    We know  is the differential path length.

                                    We can use the Work-Energy Equation (transferring potential to

                                    kinetic energy, which we have done innumerable times) to find that

                                     at any given point.

 

 

 

 

                                   

 

 

 

 

                                    Plugging this into EII (since  has no explicit -

                                                dependence,

 

 

 

 

 

                                   

 

 

 

We will call  to simplify things.  Also, we only choose the +ve answer since

            y is +ve down!

 

           

                                                           

 

 

 

I'll tell you the substitution to make:           

 

                        We can at least see why this might work!

 

                                                           

 

                        We can solve for  by saying that are starting point is .  This gives

                                    us

 

 

                        Since we have already said what  is, we now have a parametric solution

                        to the problem:

 

                                   

 

 

                                    These equations describe what is called a "cycloid."  It is the shape that

                        is traced by any point on  a circle as it rolls without slipping on a plane.  If we

                        are given a final , we can solve for the constant, .

 

                                                                           

   A

                                                                 &