|
|
(One intermediate revision by one other user not shown) |
Line 1: |
Line 1: |
| {{Technical|date=April 2013}}
| | You probably have ever had a panic assault, you don't look ahead to one other one. The "fight-or flight" response in your body is involuntarily turned on and your nervous system alerts your adrenal glands to start pumping out stress hormones to get you able to either run away or flip and struggle.<br><br>To begin with the process of naturally beating anxiety attacks, take into consideration one particular state of affairs that's resulting in your blood pressure stage to skyrocket sending you into an actual panic attack. This may be finished as prevention to a full blown attack, since weight loss you may cease it before it really takes place. Imagine all of the stuff you can lose if you permit the troubles take over, if you happen to ever feel prefer it's getting uncontrolled. One of the vital keys to self-control is knowing that you're the one who has control over your physique.<br><br>If you happen to're making an attempt to decide whether or to not deal with mediterranean diet your anxiousness with medicine, it is important to weigh the pros and cons along side your physician. It's also essential to be taught in regards to the frequent uncomfortable side effects of the anxiety treatment you might be contemplating. Unwanted side effects of anxiety medication vary from gentle nuisances equivalent to dry mouth to extra severe issues similar to acute nausea or pronounced weight acquire. For any nervousness treatment, you will have to stability the uncomfortable side effects towards the benefits. Inquiries to ask your self and a mental well being skilled<br><br>Throughout your hot flash, it's possible you'll breathe too rapidly and evenly and really feel as if the wind is being pressed out of your chest. Of course anxiousness is the foundation reason behind the recent flash, however being in a room with a high temperature can worsen the situation. Spicy foods set off ache receptors and set off chemical chain-reactions which might be very much like sizzling flashes. The fundamental act of eating a spicy meal might be related sufficient to the experience of a hot flush that your physique is stimulated into producing an precise anxiousness sizzling flush. The recognizable hot flash feeling may additionally make you anxious, which then leads to a real simple diet burn fat hot flash from anxiety. Those are only a few examples of medications and the dangerous effects they have in your body. Worry of assembly new people<br><br>It's brought on by a sudden, vital drop in blood pressure. When blood pressure drops this fashion, there is a probability the mind won't get enough blood, as a result of it is on the highest point of the body. That's no good - your brain needs a gentle supply of the oxygen the blood carries. So when your body can't get sufficient blood to your mind due to low blood stress, it protects you by bringing your brain down to the blood. The faint ensures a blood provide to the mind by bringing your head down to the ground. |
| | |
| In [[linear algebra]], for a [[matrix (mathematics)|matrix]] ''A'', there may not always exist a full set of linearly independent eigenvectors that form a complete basis – a matrix may not be [[diagonalizable matrix|diagonalizable]]. This happens when the [[algebraic multiplicity]] of at least one [[eigenvalue]] λ is greater than its [[geometric multiplicity]] (the [[nullity]] of the matrix <math>(A-\lambda I)</math>, or the [[dimension (vector space)|dimension]] of its [[Kernel (linear algebra)|nullspace]]). In such cases, a '''generalized eigenvector''' of ''A'' is a nonzero [[Euclidean space|vector]] '''v''', which is associated with λ having [[algebraic multiplicity]] ''k'' ≥1, satisfying
| |
| | |
| : <math>(A-\lambda I)^k\mathbf{v} = \mathbf{0}.</math>
| |
| The set spanned by all generalized eigenvectors for a given λ, form the '''generalized eigenspace''' for λ.
| |
| | |
| Ordinary [[eigenvector]]s and [[eigenspace]]s are obtained for ''k''=1.
| |
| | |
| ==For defective matrices==
| |
| | |
| Generalized eigenvectors are needed to form a complete [[basis (linear algebra)|basis]] of a [[defective matrix]], which is a matrix in which there are fewer [[linearly independent]] eigenvectors than eigenvalues (counting multiplicity). Over an algebraically closed field, the generalized eigenvectors ''do'' allow choosing a complete basis, as follows from the [[Jordan form]] of a matrix.
| |
| | |
| In particular, suppose that an eigenvalue ''λ'' of a matrix ''A'' has an algebraic multiplicity ''m'' but fewer corresponding eigenvectors. We form a sequence of ''m'' eigenvectors and generalized eigenvectors <math>x_1, x_2, \ldots, x_m</math> that are linearly independent and satisfy
| |
| | |
| :<math>(A - \lambda I) x_k = \alpha_{k,1}x_1+\cdots+\alpha_{k,k-1}x_{k-1} </math>
| |
| | |
| for some coefficients <math>\alpha_{k,1},\ldots,\alpha_{k,k-1}</math>, for <math>k=1,\ldots,m</math>. It follows that
| |
| | |
| :<math>(A - \lambda I)^k x_k = 0. \!</math>
| |
| | |
| The vectors <math>x_1, x_2, \ldots, x_m</math> can always be chosen, but are not uniquely determined by the above relations. If the geometric multiplicity (dimension of the eigenspace) of ''λ'' is ''p'', one can choose the first ''p'' vectors to be eigenvectors, but the remaining ''m'' − ''p'' vectors are only generalized eigenvectors. | |
| | |
| ==Examples==
| |
| | |
| ===Example 1===
| |
| Suppose
| |
| :<math> A = \begin{bmatrix} 1 & 1\\ 0 & 1 \end{bmatrix}. </math>
| |
| Then there is one eigenvalue λ=1 with an algebraic multiplicity m of 2.
| |
| | |
| There are several ways to see that there will be one generalized eigenvector necessary. Easiest is to notice that this matrix is in [[Jordan normal form]], but is not diagonal, meaning that this is not a diagonalizable matrix. Since there is one superdiagonal entry, there will be one generalized eigenvector (or you could note that the vector space is of dimension 2, so there can be only one generalized eigenvector). Alternatively, you could compute the dimension of the [[Kernel (linear algebra)|nullspace]] of <math> A-I </math> to be ''p''=1, and thus there are ''m''-''p''=1 generalized eigenvectors.
| |
| | |
| Computing the ordinary eigenvector <math> v_1=\begin{bmatrix}1 \\0 \end{bmatrix}</math> is left to the reader (see the [[eigenvector]] page for examples). Using this eigenvector, we compute the generalized eigenvector <math> v_2 </math> by solving
| |
| | |
| :<math> (A-\lambda I)v_2 = v_1. </math>
| |
| Writing out the values:
| |
| :<math> \left(\begin{bmatrix} 1 & 1\\ 0 & 1 \end{bmatrix}- \begin{bmatrix} 1 & 0\\ 0 & 1 \end{bmatrix}\right)\begin{bmatrix}v_{21} \\v_{22} \end{bmatrix} = \begin{bmatrix}1 \\0 \end{bmatrix}.</math>
| |
| This simplifies to
| |
| :<math> \begin{matrix} v_{21}+v_{22}-v_{21} = 1 \\ v_{22}- v_{22} = 0. \end{matrix}</math>
| |
| This simplifies to
| |
| | |
| :<math> v_{22}= 1. </math>
| |
| | |
| And <math>v_{21}</math> has no restrictions and thus can be any scalar. So the generalized eigenvector is <math> v_2=\begin{bmatrix}* \\1 \end{bmatrix}</math>, where the * indicates that any value is fine. Usually picking 0 is easiest.
| |
| | |
| ===Example 2===
| |
| | |
| The matrix
| |
| | |
| :<math>A = \begin{bmatrix}
| |
| 1 & 0 & 0 & 0 & 0 \\
| |
| 3 & 1 & 0 & 0 & 0 \\
| |
| 6 & 3 & 2 & 0 & 0 \\
| |
| 10 & 6 & 3 & 2 & 0 \\
| |
| 15 & 10 & 6 & 3 & 2
| |
| \end{bmatrix}</math>
| |
| | |
| has ''eigenvalues'' of 1 and 2 with ''algebraic multiplicities'' of 2 and 3, but ''geometric multiplicities'' of 1 and 1.
| |
| | |
| The ''generalized eigenspaces'' of <math>A</math> are calculated below.
| |
| | |
| :<math>(A-1 I) \begin{bmatrix}
| |
| 0 \\ 1 \\ -3 \\ 3 \\ -1
| |
| \end{bmatrix} = \begin{bmatrix}
| |
| 0 & 0 & 0 & 0 & 0 \\
| |
| 3 & 0 & 0 & 0 & 0 \\
| |
| 6 & 3 & 1 & 0 & 0 \\
| |
| 10 & 6 & 3 & 1 & 0 \\
| |
| 15 & 10 & 6 & 3 & 1
| |
| \end{bmatrix}\begin{bmatrix}
| |
| 0 \\ 1 \\ -3 \\ 3 \\ -1
| |
| \end{bmatrix} = \begin{bmatrix}
| |
| 0 \\ 0 \\ 0 \\ 0 \\ 0
| |
| \end{bmatrix}</math>
| |
| | |
| :<math>(A - 1 I) \begin{bmatrix}
| |
| 1 \\ -15 \\ 30 \\ -1 \\ -45
| |
| \end{bmatrix} = \begin{bmatrix}
| |
| 0 & 0 & 0 & 0 & 0 \\
| |
| 3 & 0 & 0 & 0 & 0 \\
| |
| 6 & 3 & 1 & 0 & 0 \\
| |
| 10 & 6 & 3 & 1 & 0 \\
| |
| 15 & 10 & 6 & 3 & 1
| |
| \end{bmatrix} \begin{bmatrix}
| |
| 1 \\ -15 \\ 30 \\ -1 \\ -45
| |
| \end{bmatrix} = 3\begin{bmatrix}
| |
| 0 \\ 1 \\ -3 \\ 3 \\ -1
| |
| \end{bmatrix}
| |
| </math>
| |
| | |
| :<math>(A - 2 I) \begin{bmatrix}
| |
| 0 \\ 0 \\ 0 \\ 0 \\ 1
| |
| \end{bmatrix} = \begin{bmatrix}
| |
| -1 & 0 & 0 & 0 & 0 \\
| |
| 3 & -1 & 0 & 0 & 0 \\
| |
| 6 & 3 & 0 & 0 & 0 \\
| |
| 10 & 6 & 3 & 0 & 0 \\
| |
| 15 & 10 & 6 & 3 & 0
| |
| \end{bmatrix} \begin{bmatrix}
| |
| 0 \\ 0 \\ 0 \\ 0 \\ 1
| |
| \end{bmatrix} = \begin{bmatrix}
| |
| 0 \\ 0 \\ 0 \\ 0 \\ 0
| |
| \end{bmatrix}
| |
| </math>
| |
| | |
| :<math>(A - 2 I) \begin{bmatrix}
| |
| 0 \\ 0 \\ 0 \\ 1 \\ 0
| |
| \end{bmatrix} = \begin{bmatrix}
| |
| -1 & 0 & 0 & 0 & 0 \\
| |
| 3 & -1 & 0 & 0 & 0 \\
| |
| 6 & 3 & 0 & 0 & 0 \\
| |
| 10 & 6 & 3 & 0 & 0 \\
| |
| 15 & 10 & 6 & 3 & 0
| |
| \end{bmatrix} \begin{bmatrix}
| |
| 0 \\ 0 \\ 0 \\ 1 \\ 0
| |
| \end{bmatrix} = 3 \begin{bmatrix}
| |
| 0 \\ 0 \\ 0 \\ 0 \\ 1
| |
| \end{bmatrix}
| |
| </math>
| |
| | |
| :<math>(A - 2 I) \begin{bmatrix}
| |
| 0 \\ 0 \\ 1 \\ -2 \\ 0
| |
| \end{bmatrix} = \begin{bmatrix}
| |
| -1 & 0 & 0 & 0 & 0 \\
| |
| 3 & -1 & 0 & 0 & 0 \\
| |
| 6 & 3 & 0 & 0 & 0 \\
| |
| 10 & 6 & 3 & 0 & 0 \\
| |
| 15 & 10 & 6 & 3 & 0
| |
| \end{bmatrix} \begin{bmatrix}
| |
| 0 \\ 0 \\ 1 \\ -2 \\ 0
| |
| \end{bmatrix} = 3 \begin{bmatrix}
| |
| 0 \\ 0 \\ 0 \\ 1 \\ 0
| |
| \end{bmatrix}
| |
| </math>
| |
| | |
| This results in a basis for each of the ''generalized eigenspaces'' of <math>A</math>.
| |
| Together they span the space of all 5 dimensional column vectors.
| |
| | |
| :<math>
| |
| \left\{
| |
| \begin{bmatrix} 0 \\ 1 \\ -3 \\ 3 \\ -1 \end{bmatrix}
| |
| \begin{bmatrix} 1 \\ -15 \\ 30 \\ -1 \\ -45 \end{bmatrix}
| |
| \right\},
| |
| \left\{
| |
| \begin{bmatrix} 0 \\ 0 \\ 0 \\ 0 \\ 1 \end{bmatrix}
| |
| \begin{bmatrix} 0 \\ 0 \\ 0 \\ 1 \\ 0 \end{bmatrix}
| |
| \begin{bmatrix} 0 \\ 0 \\ 1 \\ -2 \\ 0 \end{bmatrix}
| |
| \right\}
| |
| </math>
| |
| | |
| The ''Jordan Canonical Form'' is obtained.
| |
| | |
| :<math>
| |
| T = \begin{bmatrix}
| |
| 0 & 0 & 0 &1& 0 \\
| |
| 3 & 0 & 0 &-15& 0 \\
| |
| -9 & 0 & 0 &30& 1 \\
| |
| 9 & 0 & 3 &-1& -2 \\
| |
| -3 & 9 & 0 &-45& 0
| |
| \end{bmatrix} \quad J = \begin{bmatrix}
| |
| 1 & 1 & 0 & 0 & 0 \\
| |
| 0 & 1 & 0 & 0 & 0 \\
| |
| 0 & 0 & 2 & 1 & 0 \\
| |
| 0 & 0 & 0 & 2 & 1 \\
| |
| 0 & 0 & 0 & 0 & 2
| |
| \end{bmatrix}
| |
| </math>
| |
| | |
| where
| |
| | |
| :<math>AT = TJ</math>
| |
| | |
| ==Other meanings of the term==
| |
| | |
| * The usage of [[generalized eigenfunction]] differs from this; it is part of the theory of [[rigged Hilbert space]]s, so that for a [[linear operator]] on a [[function space]] this may be something different.
| |
| | |
| * One can also use the term ''generalized eigenvector'' for an eigenvector of the ''[[generalized eigenvalue problem]]''
| |
| | |
| : <math> Av = \lambda B v.</math>
| |
| | |
| == The Nullity of (A − λ I)<sup>k</sup> ==
| |
| | |
| === Introduction ===
| |
| | |
| In this section it is shown, when <math>\lambda</math> is an ''eigenvalue'' of a matrix <math>A</math> with ''algebraic multiplicity'' <math>k</math>, then the ''null space'' of <math>(A - \lambda I)^k</math> has dimension <math>k</math>.
| |
| | |
| === Existence of Eigenvalues ===
| |
| | |
| Consider a n × n matrix '''A'''. The ''determinant'' of '''A''' has the fundamental properties of being ''n linear'' and ''alternating''. Additionally {{nowrap|det('''I''') {{=}} 1}}, for '''I''' the n × n identity matrix. From the determinant's
| |
| definition it can be seen that for a ''triangular'' matrix
| |
| {{nowrap|'''T''' {{=}} (t<sub>ij</sub>)}} that {{nowrap|det('''T''') {{=}} ∏(t<sub>ii</sub>)}}. In other words, the determinant is the product of the diagonal entries.
| |
| | |
| There are three ''elementary row operations'', ''scalar multiplication'', ''interchange'' of two rows, and the ''addition'' of a ''scalar multiple'' of one row to another. Multiplication of a row of '''A''' by α results in a new matrix whose determinant is
| |
| α det('''A'''). Interchange of two rows changes the ''sign'' of the determinant, and the addition of a scalar multiple of one row to another does not affect the determinant. The following simple theorem holds, but requires a little proof.
| |
| | |
| '''Theorem:''' The equation {{nowrap|'''A''' '''x''' {{=}} '''0'''}} has a solution {{nowrap|'''x''' ≠ '''0'''}}, if and only if {{nowrap|det('''A''') {{=}} 0}}.
| |
| | |
| ''Proof:'' Given the equation {{nowrap|'''A''' '''x''' {{=}} '''0'''}} attempt to solve it using the ''elementary row operations'' of ''addition'' of a ''scalar multiple'' of one row to another and row ''interchanges'' only, until an equivalent equation {{nowrap|'''U''' '''x''' {{=}} '''0'''}} has been reached, with '''U''' an upper triangular matrix. Since {{nowrap|det('''U''') {{=}} ±det('''A''')}} and {{nowrap|det('''U''') {{=}} ∏(u<sub>ii</sub>)}} we have that {{nowrap|det('''A''') {{=}} 0}} if and only if at least one {{nowrap|u<sub>ii</sub> {{=}} 0}}. The back substitution
| |
| procedure as performed after ''Gaussian Elimination'' will allow placing at least one non zero element in '''x''' when there is a {{nowrap|u<sub>ii</sub> {{=}} 0}}. When all {{nowrap|u<sub>ii</sub> ≠ 0}} back substitution will require {{nowrap|'''x''' {{=}} '''0'''}}. ''QED''
| |
| | |
| '''Theorem:''' The equation {{nowrap|'''A''' '''x''' {{=}} λ '''x'''}} has a solution {{nowrap|'''x''' ≠ '''0'''}}, if and only if {{nowrap|det( λ '''I''' − '''A''') {{=}} 0}}.
| |
| | |
| ''Proof:'' The equation {{nowrap|'''A''' '''x''' {{=}} λ '''x'''}} is equivalent to {{nowrap|( λ '''I''' − '''A''') '''x''' {{=}} '''0'''}}. ''QED.''
| |
| | |
| === Constructive proof of Schur's triangular form ===
| |
| | |
| The proof of the main result of this section will rely on the ''similarity transformation'' as stated and proven next.
| |
| | |
| '''Theorem''': (''Schur Transformation to Triangular Form Theorem'') For any n × n matrix '''A''', there exists a ''triangular'' matrix '''T''' and a ''unitary'' matrix '''Q''', such that {{nowrap|'''A''' '''Q''' {{=}} '''Q''' '''T'''}}. (The transformations are not unique, but are related.)
| |
| | |
| ''Proof:'' Let λ<sub>1</sub> be an ''eigenvalue'' of the {{nowrap|n × n}} matrix '''A''' and '''x''' be an associated ''eigenvector'', so that '''A''' '''x''' = λ<sub>1</sub>'''x'''. Normalize the ''length'' of '''x''' so that {{abs|'''x'''}} = 1.
| |
| | |
| For
| |
| | |
| :<math>x=\begin{bmatrix} x_1 \\ x_2 \\ \vdots \\ x_n \end{bmatrix}</math>,
| |
| | |
| construct a unitary matrix
| |
| | |
| :<math>Q=\begin{bmatrix}
| |
| x_1 & q_{1\,2} & q_{1\,3} & \cdots & q_{1\,n} \\
| |
| x_2 & q_{2\,2} & q_{2\,3} & \cdots & q_{2\,n} \\
| |
| \vdots & \vdots & \vdots & & \vdots \\
| |
| x_n & q_{n\,2} & q_{n\,3} & \cdots & q_{n\,n}
| |
| \end{bmatrix}</math>
| |
| | |
| '''Q''' should have '''x''' as its first column and have its columns an ''orthonormal basis'' for '''C<sup>n</sup>'''. Now, {{nowrap|'''A''' '''Q''' {{=}} '''Q''' '''U<sub>1</sub>'''}}, with
| |
| '''U<sub>1</sub>''' of the form:
| |
| | |
| [[File:u1 shur.gif|link=|alt=]]
| |
| | |
| Let the ''induction hypothesis'' be that the theorem holds for all {{nowrap|(n-1) × (n-1)}} matrices. From the construction, so far, it holds for {{nowrap|n {{=}} 2}}. Choose a unitary '''Q<sub>0</sub>''', so that {{nowrap|'''U<sub>0</sub>''' '''Q<sub>0</sub>''' {{=}} '''Q<sub>0</sub>''' '''U<sub>2</sub>'''}}, with '''U<sub>2</sub>''' of the ''upper triangular'' form. Define '''Q<sub>1</sub>''' by:
| |
| | |
| [[File:q1 shur.gif|link=|alt=]]
| |
| | |
| Now:
| |
| | |
| [[File:u1 q1 shur.gif|link=|alt=]]
| |
| | |
| [[File:q0 u2 shur.gif|link=|alt=]]
| |
| | |
| Summarizing,
| |
| | |
| :<math>U_1 Q_1 = Q_1 U_3</math>
| |
| | |
| with:
| |
| | |
| :<math>U_3=\begin{bmatrix}
| |
| \lambda_1 & z_{1\,2} & z_{1\,3} & \cdots & z_{1\,n} \\
| |
| 0 & \lambda_2 & z_{2\,3} & \cdots & z_{2\,n} \\
| |
| 0 & 0 & \lambda_3 & \cdots & z_{3\,n} \\
| |
| \vdots & \vdots & \vdots & & \vdots \\
| |
| 0 & 0 & 0 & \cdots & \lambda_n
| |
| \end{bmatrix}</math>
| |
| | |
| Now, {{nowrap|'''A''' '''Q''' {{=}} '''Q''' '''U<sub>1</sub>'''}} and {{nowrap|'''U<sub>1</sub>''' '''Q<sub>1</sub>''' {{=}} '''Q<sub>1</sub>''' '''U<sub>3</sub>'''}}, where '''Q''' and '''Q<sub>1</sub>''' are ''unitary'' and
| |
| '''U<sub>3</sub>''' is ''upper triangular''. Thus {{nowrap|'''A''' '''Q''' '''Q<sub>1</sub>''' {{=}} '''Q''' '''Q<sub>1</sub>''' '''U<sub>3</sub>'''}}. Since the product of two unitary matrices is unitary, the proof is done. ''QED''.
| |
| | |
| === Nullity Theorem's Proof ===
| |
| | |
| Starting from {{nowrap|'''A Q {{=}} Q U'''}}, we can solve for '''A''' to obtain {{nowrap|'''A''' {{=}} '''Q U Q'''<sup>T</sup>}}, since
| |
| {{nowrap|'''Q Q'''<sup>T</sup> {{=}} '''I'''}}. Now, after subtracting x'''I''' from both sides, we find
| |
| : {{nowrap|x '''I''' − '''A''' {{=}} '''Q''' (x '''I''' − '''U''') '''Q'''<sup>T</sup>}}
| |
| and hence
| |
| : {{nowrap|det(x '''I''' − '''A''') {{=}} det(x '''I''' − '''U''')}}.
| |
| So, the characteristic polynomial of '''A''' is the same as that for '''U''' and is given by
| |
| : {{nowrap|p(x) {{=}} (x − λ<sub>1</sub>)(x − λ<sub>2</sub>)...(x − λ<sub>n</sub>)}},
| |
| where the λ<sub>i</sub>s are the eigenvalues of '''A''' and '''U'''.
| |
| | |
| Observe, the construction used in the proof above, allows choosing any order for the eigenvalues of '''A''' that will end up as the diagonal elements of the upper triangular matrix '''U''' obtained. The ''algebraic multiplicity'' of an eigenvalue is the count of the number of times it occurs on the diagonal.
| |
| | |
| Now. it can be supposed for a given eigenvalue '''λ''', of algebraic multiplicity
| |
| '''k''', that '''U''' has been contrived so that '''λ''' occurs as the first
| |
| '''k''' diagonal elements.
| |
| | |
| [[File:u shur.gif|link=|alt=]]
| |
| | |
| Place {{nowrap|'''U''' − λ'''I'''}} in ''block form'' as below.
| |
| | |
| [[File:u i shur.gif|link=|alt=]]
| |
| | |
| The lower left block has only elements of ''zero''. The {{nowrap|β<sub>i</sub> {{=}} λ<sub>i</sub> − λ ≠ 0}}
| |
| for {{nowrap|i {{=}} k+1, ..., n}}. It is easy to verify the following.
| |
| | |
| [[File:u i 2 shur.gif|link=|alt=]]
| |
| | |
| [[File:u i k shur.gif|link=|alt=]]
| |
| | |
| Where '''B''' is the {{nowrap|k × k}} subtriangular matrix, with all elements on or below the diagonal equal to 0,
| |
| and '''T''' is the {{nowrap|(n-k) × (n-k)}} upper triangular matrix, taken from the blocks of {{nowrap|('''U''' − λ'''I''')}}, as shown below.
| |
| | |
| [[File:b shur.gif|link=|alt=]]
| |
| | |
| Now, almost trivially,
| |
| | |
| [[File:b k shur.gif|link=|alt=]]
| |
| | |
| That is '''B<sup>k</sup>''' has only elements of 0 and '''T<sup>k</sup>''' is triangular with all non zero diagonal elements.
| |
| Observe that if a column vector v = [v<sub>1</sub>, v<sub>2</sub>, ..., v<sub>k</sub>]<sup>T</sup>,
| |
| is multiplied by '''B''', then after the first multiplication the last, kth, component is zero. After the second multiplication the second to last, (k-1)th component is zero, also, and so on.
| |
| | |
| The conclusion that {{nowrap|('''U''' − λ '''I''')<sup>k</sup>}} has ''[[Rank (linear algebra)|rank]]'' (n-k)
| |
| and ''[[Kernel (linear algebra)|nullity]]'' k follows.
| |
| | |
| It is only left to observe,
| |
| since {{nowrap|('''A''' − λ'''I''')<sup>k</sup> {{=}} '''Q''' ('''U''' − λ '''I''')<sup>k</sup> '''Q'''<sup>T</sup>}},
| |
| that {{nowrap|('''A''' − λ'''I''')<sup>k</sup>}} has ''rank'' (n-k) and ''nullity'' k, as well.
| |
| A ''unitary'', or any other similarity transformation by a non-singular matrix preserves rank.
| |
| | |
| The main result is now proven.
| |
| | |
| '''Theorem:'''<br>
| |
| If λ is an ''eigenvalue'' of a matrix '''A''' with ''algebraic multiplicity'' k, then the ''null space'' of {{nowrap|('''A''' − λ'''I''')<sup>k</sup>}} has dimension k.
| |
| | |
| An important observation is that raising the power of {{nowrap|('''A''' − λ'''I''')}} above k will not affect the ''[[Rank (linear algebra)|rank]]'' and ''[[Kernel (linear algebra)|nullity]]'' any further.
| |
| | |
| == Motivation of the Procedure ==
| |
| | |
| === Introduction ===
| |
| | |
| In the section ''Existence of Eigenvalues'' it was shown that when a
| |
| {{nowrap|n × n}} matrix '''A''', has an
| |
| ''eigenvalue'' λ, of ''algebraic multiplicity'' k, then the ''null space'' of {{nowrap|('''A''' − λ'''I''')<sup>k</sup>}}, has dimension k.
| |
|
| |
| The ''Generalized Eigenspace'' of '''A''', λ will be defined to be the ''null space'' of {{nowrap|('''A''' − λ'''I''')<sup>k</sup>}}.
| |
| Many authors prefer to call this the ''[[Kernel (linear algebra)|kernel]]'' of {{nowrap|('''A''' − λ'''I''')<sup>k</sup>}}.
| |
| | |
| Notice that if a {{nowrap|n × n}} matrix has ''eigenvalues'' {{nowrap|λ<sub>1</sub>, λ<sub>2</sub>, ..., λ<sub>r</sub>}}
| |
| with ''algebraic multiplicities'' {{nowrap|k<sub>1</sub>, k<sub>2</sub>, ..., k<sub>r</sub>}},
| |
| then {{nowrap|k<sub>1</sub> + k<sub>2</sub> + ... + k<sub>r</sub> {{=}} n}}.
| |
| | |
| It will turn out that any two ''generalized eigenspaces'' of '''A''', associated with different ''eigenvalues'', will have a trivial intersection of '''{0}'''. From this it follows that the ''generalized eigenspaces'' of '''A''' combined span '''C<sup>n</sup>''', the set of all n dimensional column vectors of complex numbers.
| |
| | |
| The motivation for using a recursive procedure starting with the ''eigenvectors'' of '''A''' and solving for a basis of the ''generalized eigenspace'' of '''A''', λ using the matrix {{nowrap|('''A''' − λ '''I''')}}, will be expounded on.
| |
| | |
| === Notation ===
| |
| | |
| Some notation is introduced to help abbreviate statements.
| |
| | |
| * '''C<sup>n</sup>''' is the vector space of all n dimensional ''column'' vectors of ''complex numbers''.
| |
| * The ''Null Space'' of '''A''', {{nowrap|N('''A''') {{=}} {'''x''': '''A''' '''x''' {{=}} '''0'''}}}.
| |
| * '''V''' ⊆ '''W''' denotes '''V''' is a ''subset'' of '''W'''.
| |
| * '''V''' ⊂ '''W''' denotes '''V''' is a ''proper subset'' of '''W'''.
| |
| * The ''Range'' of '''A''' over '''V''', is {{nowrap|'''A'''('''V''') {{=}} {'''y''': '''y''' {{=}} '''A''' '''x'''}}, for some {{nowrap|'''x''' ∈ '''V'''}.}}
| |
| * '''W''' \ '''V''' denotes the set {'''x''': '''x''' ∈ '''W''' and '''x''' is not in '''V'''}.
| |
| * The ''Range'' of '''A''' is '''A'''('''C<sup>n</sup>''') and will be denoted by R('''A''').
| |
| * dim('''V''') denotes the ''dimension'' of '''V'''.
| |
| * '''{0}''' is the ''trivial subspace'' of '''C<sup>n</sup>'''.
| |
| | |
| === Preliminary Observations ===
| |
| | |
| Throughout this discussion it is assumed that '''A''' is a
| |
| {{nowrap|n × n}} matrix of complex numbers.
| |
| | |
| Since {{nowrap|'''A'''<sup>m</sup> '''x''' {{=}} '''A''' ('''A'''<sup>m-1</sup> '''x''')}}, the inclusions
| |
| | |
| :{{nowrap|N('''A''') ⊆ N('''A'''<sup>2</sup>) ⊆ ... ⊆ N('''A'''<sup>m-1</sup>) ⊆ N('''A'''<sup>m</sup>)}},
| |
| | |
| are obvious. Since {{nowrap|'''A'''<sup>m</sup> '''x''' {{=}} '''A'''<sup>m-1</sup>('''A''' '''x''')}}, the inclusions
| |
| | |
| :{{nowrap|R('''A''') ⊇ R('''A'''<sup>2</sup>) ⊇ ... ⊇ R('''A'''<sup>m-1</sup>) ⊇ R('''A'''<sup>m</sup>)}},
| |
| | |
| are clear as well.
| |
| | |
| '''Theorem:'''
| |
| | |
| When the more trivial case {{nowrap|N('''A'''<sup>2</sup>) {{=}} N('''A''')}}, does not hold,
| |
| there exists {{nowrap|k ≥ 2}}, such that the inclusions,
| |
| | |
| : {{nowrap|N('''A''') ⊂ N('''A'''<sup>2</sup>) ⊂ ... ⊂ N('''A'''<sup>k-1</sup>) ⊂ N('''A'''<sup>k</sup>) {{=}} N('''A'''<sup>k+1</sup>) {{=}} ...}},
| |
| | |
| and
| |
| | |
| :{{nowrap|R('''A''') ⊃ R('''A'''<sup>2</sup>) ⊃ ... ⊃ R('''A'''<sup>k-1</sup>) ⊃ R('''A'''<sup>k</sup> {{=}} R('''A'''<sup>k+1</sup>) {{=}} ...}},
| |
| | |
| are proper.
| |
| | |
| ''Proof:'' {{nowrap|0 ≤ dim(R('''A'''<sup>m+1</sup>)) ≤ dim(R('''A'''<sup>m</sup>))}}
| |
| so eventually dim(R('''A'''<sup>m+1</sup>)) = dim(R('''A'''<sup>m</sup>)),
| |
| for some m. From the inclusion {{nowrap|R('''A'''<sup>m+1</sup>) ⊆ R('''A'''<sup>m</sup>)}}
| |
| it is seen that a basis for R('''A'''<sup>m+1</sup>) is a basis for R('''A'''<sup>m</sup>) as well. That is, {{nowrap|R('''A'''<sup>m+1</sup>) {{=}} R('''A'''<sup>m</sup>)}}.
| |
| Since {{nowrap|R('''A'''<sup>m+1</sup>) {{=}} '''A'''(R('''A'''<sup>m</sup>))}}, when
| |
| {{nowrap|R('''A'''<sup>m+1</sup>) {{=}} R('''A'''<sup>m</sup>)}}, it will be
| |
| {{nowrap|R('''A'''<sup>m+2</sup>) {{=}} '''A'''(R('''A'''<sup>m+1</sup>)) {{=}} '''A'''(R('''A'''<sup>m</sup>)) {{=}} R('''A'''<sup>m+1</sup>)}}. By the ''rank nullity theorem'', it will also be the case that {{nowrap|dim(N('''A'''<sup>m+2</sup>)) {{=}} dim(N('''A'''<sup>m+1</sup>)) {{=}} dim(N('''A'''<sup>m</sup>))}}, for the same m. From the inclusions
| |
| {{nowrap|N('''A'''<sup>m+2</sup>) ⊆ N('''A'''<sup>m+1</sup>) ⊆ N('''A'''<sup>m</sup>)}}, it is clear that a basis for N('''A'''<sup>m+2</sup>) is also a basis for N('''A'''<sup>m+1</sup>) and N('''A'''<sup>m</sup>). So {{nowrap|N('''A'''<sup>m+2</sup>) {{=}} N('''A'''<sup>m+1</sup>) {{=}} N('''A'''<sup>m</sup>)}}. Now, k is the first m for which this happens. ''QED''
| |
| | |
| Since certain expressions will occur many times in the following, some more notation will be introduced.
| |
| | |
| * {{nowrap|'''A<sub>λ,k</sub>''' {{=}} ('''A''' − λ'''I''')<sup>k</sup>}}
| |
| * {{nowrap|'''N<sub>λ,k</sub>''' {{=}} N(('''A''' − λ'''I''')<sup>k</sup>) {{=}} N('''A<sub>λ,k</sub>''')}}
| |
| * {{nowrap|'''R<sub>λ,k</sub>''' {{=}} R(('''A''' − λ'''I''')<sup>k</sup>) {{=}} R('''A<sub>λ,k</sub>''')}}
| |
| | |
| From the inclusions {{nowrap|'''N<sub>λ,1</sub>''' ⊂}}
| |
| {{nowrap|'''N<sub>λ,2</sub>''' ⊂ ... ⊂}}
| |
| {{nowrap|'''N<sub>λ,k-1</sub>''' ⊂ '''N<sub>λ,k</sub>'''}}
| |
| {{nowrap|{{=}} '''N<sub>λ,k+1</sub>''' {{=}} ...}},
| |
| {{nowrap|'''N<sub>λ,k</sub>''' \ '''{0}''' {{=}} ∪ ('''N<sub>λ,m</sub>''' \ '''N<sub>λ,m-1</sub>)'''}}, for {{nowrap|m {{=}} 1, ..., k}} and {{nowrap|'''N<sub>λ,0</sub>''' {{=}} '''{0}'''}}, follows.
| |
| | |
| When λ is an eigenvalue of '''A''', in the statement above, k will not exceed the algebraic multiplicity of λ, and can be less. In fact when k would only be 1 is when there is a full set of linearly independent eigenvectors. Let's consider when {{nowrap|k ≥ 2}}.
| |
| | |
| Now, {{nowrap|'''x''' ∈ '''N<sub>λ,m</sub>''' \ '''N<sub>λ,m-1</sub>'''}}, if and only {{nowrap|if '''A<sub>λ,m</sub>''' '''x''' {{=}} '''0'''}}, and {{nowrap|'''A<sub>λ,m-1</sub>''' '''x''' ≠ '''0'''}}.
| |
| | |
| Make the observation that {{nowrap|'''A<sub>λ,m</sub>''' '''x''' {{=}} '''0''',}}
| |
| and {{nowrap|'''A<sub>λ,m-1</sub>''' '''x''' ≠ '''0'''}}, if and only {{nowrap|if '''A<sub>λ,m-1</sub>''' '''A<sub>λ,1</sub>''' '''x''' {{=}} '''0'''}},
| |
| and {{nowrap|'''A<sub>λ,m-2</sub>''' '''A<sub>λ,1</sub>''' '''x''' ≠ '''0'''}}.
| |
| | |
| So, {{nowrap|'''x''' ∈ '''N<sub>λ,m</sub>''' \ '''N<sub>λ,m-1</sub>'''}}, if and only {{nowrap|if '''A<sub>λ,1</sub>''' '''x''' ∈ '''N<sub>λ,m-1</sub>''' \ '''N<sub>λ,m-2</sub>'''}}.
| |
| | |
| === Recursive Procedure ===
| |
| | |
| Consider a matrix '''A''', with an ''eigenvalue'' λ of ''algebraic multiplicity'' {{nowrap|k ≥ 2}}, such that there are not k ''linearly independent eigenvectors'' associated with λ.
| |
| | |
| It is desired to extend the ''eigenvectors'' to a ''basis'' for {{nowrap|'''N<sub>λ,k</sub>'''}}. That is a ''basis'' for the ''generalized eigenvectors'' associated with λ.
| |
| | |
| There exists some {{nowrap|2 ≤ r ≤ k}}, such that
| |
| | |
| :{{nowrap|'''N<sub>λ,1</sub>''' ⊂ '''N<sub>λ,2</sub>''' ⊂ ...}} {{nowrap|⊂ '''N<sub>λ,r-1</sub>''' ⊂ '''N<sub>λ,r</sub>''' {{=}} '''N<sub>λ,r+1</sub>''' {{=}} ...,}} {{nowrap|'''N<sub>λ,r</sub>''' \ '''{0}''' {{=}} ∪ ('''N<sub>λ,m</sub>''' \ '''N<sub>λ,m-1</sub>''')}}, for {{nowrap|m {{=}} 1, ..., r}} and {{nowrap|'''N<sub>λ, 0</sub>''' {{=}} '''{0}'''}}.
| |
| | |
| The ''eigenvectors'' are {{nowrap|'''N<sub>λ,1</sub>''' \ '''{0}'''}}, so let {{nowrap|'''x<sub>1</sub>''', ..., '''x<sub>r<sub>1</sub></sub>''' }} be a basis for {{nowrap|'''N<sub>λ,1</sub>''' \ '''{0}'''}}.
| |
| | |
| Note that each {{nowrap|'''N<sub>λ,m</sub>'''}} is a ''subspace'' and so a ''basis'' for {{nowrap|'''N<sub>λ,m-1</sub>'''}} can be extended to a ''basis'' for '''N<sub>λ,m</sub>'''.
| |
| | |
| Because of this we can expect to find some r<sub>2</sub> = {{nowrap|dim('''N<sub>λ,2</sub>''') − dim('''N<sub>λ,1</sub>''')}} ''linearly independent'' vectors {{nowrap|'''x<sub>r<sub>1</sub>+1</sub>''', ..., '''x<sub>r<sub>1</sub>+r<sub>2</sub></sub>'''}} such that {{nowrap|'''x<sub>1</sub>''', ..., '''x<sub>r<sub>1</sub></sub> '''}}, '''x<sub>r<sub>1</sub>+1</sub>''', ..., '''x<sub>r<sub>1</sub>+r<sub>2</sub></sub>''' is a ''basis'' for '''N<sub>λ,2</sub>'''
| |
| | |
| Now, {{nowrap|'''x''' ∈ '''N<sub>λ,2</sub>''' \ '''N<sub>λ,1</sub>'''}}, if and only if {{nowrap|'''A<sub>λ,1</sub>''' '''x''' ∈ '''N<sub>λ,1</sub> \ '''{0}'''}}.
| |
| | |
| Thus we can expect that for each '''x''' ∈
| |
| {{nowrap| {'''x<sub>r<sub>1</sub>+1</sub>''', ..., '''x<sub>r<sub>1</sub>+r<sub>2</sub></sub>'''} }},
| |
| '''A<sub>λ,1</sub>''' '''x''' = α<sub>1</sub> '''x<sub>1</sub>''' + ... + α<sub>r<sub>1</sub></sub> '''x<sub>r<sub>1</sub></sub>''', for some {{nowrap|α<sub>1</sub>, ..., α<sub>r<sub>1</sub></sub>}}, depending on '''x'''.
| |
| | |
| Suppose we have reached the stage in the construction so that m-1 sets,
| |
| | |
| :{'''x<sub>1</sub>''', ..., '''x<sub>r<sub>1</sub></sub>'''}, {'''x<sub>r<sub>1</sub>+1</sub>''', ..., '''x<sub>r<sub>1</sub>+r<sub>2</sub></sub>'''}, ..., {'''x<sub>r<sub>1</sub>+ ... + r<sub>m-2</sub>+1</sub>''', ..., '''x<sub>r<sub>1</sub>+ ... + r<sub>m-1</sub></sub>'''}
| |
| | |
| such that
| |
| | |
| :'''x<sub>1</sub>''', ..., '''x<sub>r<sub>1</sub></sub>''' , '''x<sub>r<sub>1</sub>+1</sub>''', ..., '''x<sub>r<sub>1</sub>+r<sub>2</sub></sub>''', ..., '''x<sub>r<sub>1</sub>+ ... + r<sub>m-2</sub> + 1</sub>''', ..., '''x<sub>r<sub>1</sub>+ ... + r<sub>m-1</sub></sub>'''
| |
| | |
| is a ''basis'' for '''N<sub>λ,m-1</sub>''', have been found.
| |
| | |
| We can expect to find some
| |
| | |
| :r<sub>m</sub> = dim('''N<sub>λ,m</sub>''') − dim('''N<sub>λ,m-1</sub>''')
| |
| | |
| ''linearly independent'' vectors
| |
| | |
| :'''x<sub>r<sub>1</sub>+ ... + r<sub>m-1</sub>+1</sub>''', ..., '''x<sub>r<sub>1</sub>+ ... + r<sub>m</sub></sub>'''
| |
| | |
| such that
| |
| | |
| :'''x<sub>1</sub>''', ..., '''x<sub>r<sub>1</sub></sub>''' , '''x<sub>r<sub>1</sub>+1</sub>''', ..., '''x<sub>r<sub>1</sub>+ r<sub>2</sub></sub>''', ..., '''x<sub>r<sub>1</sub>+ ... + r<sub>m-1</sub> + 1</sub>''', ..., '''x<sub>r<sub>1</sub>+ ... + r<sub>m</sub></sub>'''
| |
| | |
| is a ''basis'' for '''N<sub>λ, m</sub>'''
| |
| | |
| Again, {{nowrap|'''x''' ∈ '''N<sub>λ,m</sub>''' \ '''N<sub>λ,m-1</sub>'''}}, if and only if {{nowrap|'''A<sub>λ,1</sub>''' '''x''' ∈ '''N<sub>λ,m-1</sub>''' \ '''N<sub>λ,m-2</sub>'''}}.
| |
| | |
| Thus we can expect that for each '''x''' ∈ {'''x<sub>r<sub>1</sub>+ ... + r<sub>m-1</sub> + 1</sub>''', ..., '''x<sub>r<sub>1</sub>+ .... + r<sub>m</sub></sub>'''}, {{nowrap|'''A<sub>λ,1</sub>''' '''x''' {{=}}}}
| |
| α<sub>1</sub> '''x<sub>1</sub>''' + ... + α<sub>r<sub>1</sub>+ ... + r<sub>m-1</sub></sub> '''x<sub>r<sub>1</sub>+ ... + r<sub>m-1</sub></sub>''', for some {{nowrap|α<sub>1</sub>, ..., α<sub>r<sub>1</sub>+ ... + r<sub>m-1</sub></sub>}},
| |
| depending on '''x'''.
| |
| | |
| Some of the {{nowrap|{α<sub>r<sub>1</sub>+ ... + r<sub>m-2</sub> + 1</sub>, ..., α<sub>r<sub>1</sub>+ .... + r<sub>m-1</sub></sub>}}},
| |
| will be non zero, since {{nowrap|'''A<sub>λ,1</sub>''' '''x'''}} must lie in {{nowrap|'''N<sub>λ,m-1</sub>''' \ '''N<sub>λ,m-2</sub>'''}}.
| |
| | |
| The procedure is continued until {{nowrap|m {{=}} r}}.
| |
| | |
| The α<sub>i</sub> are not truly arbitrary and must be chosen, accordingly, so that sums
| |
| α<sub>1</sub> '''x<sub>1</sub>''' + α<sub>2</sub> '''x<sub>2</sub>''' + ... are in the range of '''A<sub>λ,1</sub>'''.
| |
| | |
| === Generalized Eigenspace Decomposition ===
| |
| | |
| As was stated in the Introduction, if a {{nowrap|n × n}} matrix has
| |
| ''eigenvalues'' {{nowrap|λ<sub>1</sub>, λ<sub>2</sub>, ..., λ<sub>r</sub>}}
| |
| with ''algebraic multiplicities'' {{nowrap|k<sub>1</sub>, k<sub>2</sub>, ..., k<sub>r</sub>}},
| |
| then {{nowrap|k<sub>1</sub> + k<sub>2</sub> + ... + k<sub>r</sub> {{=}} n}}.
| |
| | |
| When '''V<sub>1</sub>''' and '''V<sub>2</sub>''' are two ''subspaces'', satisfying
| |
| {{nowrap|'''V<sub>1</sub>''' ∩ '''V<sub>2</sub>''' {{=}} '''{0}'''}},
| |
| their ''direct sum'', {{resize|'''⊕'''}} is defined and notated by
| |
| * {{nowrap|'''V<sub>1</sub>''' {{resize|'''⊕'''}} '''V<sub>2</sub>''' {{=}} {v<sub>1</sub> + v<sub>2</sub> : v<sub>1</sub> ∈ '''V<sub>1</sub>''' and v<sub>2</sub> ∈ '''V<sub>2</sub>'''} }}.
| |
| | |
| {{nowrap|'''V<sub>1</sub>''' {{resize|'''⊕'''}} '''V<sub>2</sub>''' }}
| |
| is also a ''subspace'' and
| |
| {{nowrap|dim('''V<sub>1</sub>''' {{resize|'''⊕'''}} '''V<sub>2</sub>''')}}
| |
| = {{nowrap|dim('''V<sub>1</sub>''') + dim('''V<sub>2</sub>''')}}.
| |
| | |
| Since {{nowrap|dim('''N<sub>λ<sub>i</sub>,k<sub>i</sub></sub>''') {{=}} k<sub>i</sub>}},
| |
| for {{nowrap|i {{=}} 1, 2, ..., r}}, after it is shown that
| |
| {{nowrap|'''N<sub>λ<sub>i</sub>,k<sub>i</sub></sub>''' ∩}}
| |
| {{nowrap|'''N<sub>λ<sub>j</sub>,k<sub>j</sub></sub>''' {{=}} '''{0}'''}},
| |
| for {{nowrap|i ≠ j}}, we have the main result.
| |
| | |
| '''Theorem: ''Generalized Eigenspace Decomposition Theorem'''''
| |
| | |
| {{nowrap|'''C<sup>n</sup>''' {{=}}}}
| |
| {{nowrap|'''N<sub>λ<sub>1</sub>,k<sub>1</sub></sub>''' {{resize|'''⊕'''}}}}
| |
| {{nowrap|'''N<sub>λ<sub>2</sub>,k<sub>2</sub></sub>''' {{resize|'''⊕'''}}}}
| |
| {{nowrap|... {{resize|'''⊕'''}} '''N<sub>λ<sub>r</sub>,k<sub>r</sub></sub>'''}}.
| |
| | |
| This follows easily after we prove the theorem below.
| |
| | |
| '''Theorem:'''<br>
| |
| Let λ be an ''eigenvalue'' of '''A''' and β ≠ λ.
| |
| Then {{nowrap|'''A<sub>β,r</sub>'''('''N<sub>λ,m</sub>''' \ '''N<sub>λ,m-1</sub>''')}} =
| |
| {{nowrap|'''N<sub>λ,m</sub>''' \ '''N<sub>λ,m-1</sub>'''}},
| |
| for any positive integers m and r.
| |
| | |
| '''Proof:'''<br>
| |
| If {{nowrap|'''x''' ∈ '''N<sub>λ,1</sub>''' \ '''{0}'''}},
| |
| {{nowrap|'''A<sub>λ,1</sub>''' '''x''' {{=}} ('''A''' − λ '''I''')'''x'''}} = '''0''',
| |
| then
| |
| {{nowrap|'''A''' '''x''' {{=}} λ '''x'''}} and
| |
| {{nowrap|'''A<sub>β,1</sub>''' '''x''' {{=}}}}
| |
| {{nowrap|('''A''' − β'''I''')'''x''' {{=}} (λ − β)'''x'''}}.
| |
| | |
| So {{nowrap|'''A<sub>β,1</sub>''' '''x''' ∈}}
| |
| {{nowrap|'''N<sub>λ,1</sub>''' \ '''{0}'''}} and
| |
| {{nowrap|'''A<sub>β,1</sub>''' (λ − β)<sup>−1</sup>'''x''' {{=}} '''x'''}}.
| |
| | |
| It holds
| |
| {{nowrap|'''A<sub>β,1</sub>''' ('''N<sub>λ,1</sub>''' \ '''{0}''') {{=}}}}
| |
| {{nowrap|'''N<sub>λ,1</sub>''' \ '''{0}'''}}.
| |
| | |
| Now, {{nowrap|'''x''' ∈ '''N<sub>λ,m</sub>''' \ '''N<sub>λ,m-1</sub>'''}},
| |
| if and only if {{nowrap|'''A<sub>λ,m</sub>''' '''x'''}} =
| |
| {{nowrap|('''A''' − λ'''I''')'''A<sub>λ,m-1</sub>''' '''x''' {{=}} '''0'''}},
| |
| and {{nowrap|'''A<sub>λ,m-1</sub>''' '''x''' ≠ '''0'''}}.
| |
| | |
| In the case,
| |
| {{nowrap|'''x''' ∈ '''N<sub>λ,m</sub>''' \ '''N<sub>λ,m-1</sub>'''}},
| |
| {{nowrap|'''A<sub>λ,m-1</sub>''' '''x''' ∈ '''N<sub>λ,1</sub>''' \ '''0'''}},
| |
| and
| |
| {{nowrap|'''A<sub>β,1</sub>''' '''A<sub>λ,m-1</sub>''' '''x''' {{=}}}}
| |
| {{nowrap|(λ − β) '''A<sub>λ,m-1</sub>''' '''x''' ≠ '''0'''}}.
| |
| The ''operators'' '''A<sub>β,1</sub>''' and
| |
| '''A<sub>λ,m-1</sub>''' commute.
| |
| Thus
| |
| {{nowrap|'''A<sub>λ,m</sub>''' ('''A<sub>β,1</sub>''' '''x''') {{=}} '''0'''}} and
| |
| {{nowrap|'''A<sub>λ,m-1</sub>''' ('''A<sub>β,1</sub>''' '''x''') ≠ '''0'''}},
| |
| which means
| |
| {{nowrap|'''A<sub>β,1</sub>''' '''x'''}}
| |
| {{nowrap|∈ '''N<sub>λ,m</sub>''' \ '''N<sub>λ,m-1</sub>'''}}.
| |
| | |
| Now, let our ''induction hypothesis'' be,
| |
| {{nowrap|'''A<sub>β,1</sub>'''('''N<sub>λ,m</sub>''' \ '''N<sub>λ,m-1</sub>)}} = {{nowrap|'''N<sub>λ,m</sub>''' \ '''N<sub>λ,m-1</sub>'''}}.
| |
| | |
| The relation {{nowrap| '''A<sub>β,1</sub>''' '''x''' {{=}}}}
| |
| {{nowrap|(λ − β) '''x''' + '''A<sub>λ,1</sub>''' '''x'''}} holds.
| |
| | |
| For {{nowrap| '''y''' ∈ '''N<sub>λ,m+1</sub>''' \ '''N<sub>λ, m</sub>'''}},
| |
| let {{nowrap| '''x''' {{=}} (λ − β)<sup>-1</sup> '''y''' + '''z'''}}.
| |
| | |
| Then
| |
| {{nowrap|'''A<sub>β,1</sub>''' '''x'''}}
| |
| {{nowrap|{{=}} '''y''' + (λ − β)<sup>-1</sup>'''A<sub>λ,1</sub>''' '''y''' + (λ − β) '''z''' + '''A<sub>λ,1</sub>''' '''z'''}}
| |
| {{nowrap|{{=}} '''y''' + (λ − β)<sup>-1</sup>'''A<sub>λ,1</sub>''' '''y''' + '''A<sub>β,1</sub>''' '''z'''}}.
| |
| | |
| Now, {{nowrap|'''A<sub>λ,1</sub>''' '''y''' ∈ '''N<sub>λ,m</sub>''' \ '''N<sub>λ,m-1</sub>'''}}
| |
| and, by the induction hypothesis, there exists
| |
| {{nowrap|'''z''' ∈ '''N<sub>λ,m</sub>''' \ '''N<sub>λ,m-1</sub>'''}} that solves
| |
| {{nowrap|'''A<sub>β,1</sub>''' '''z''' {{=}} −(λ − β)<sup>-1</sup>'''A<sub>λ,1</sub>''' '''y'''}}.
| |
| | |
| It follows {{nowrap|'''x''' ∈ '''N<sub>λ,m+1</sub>''' \ '''N<sub>λ,m</sub>'''}}
| |
| and solves {{nowrap|'''A<sub>β,1</sub>''' '''x''' {{=}} '''y'''}}.
| |
| | |
| So {{nowrap|'''A<sub>β,1</sub>'''('''N<sub>λ,m+1</sub>''' \ '''N<sub>λ,m</sub>''') {{=}}}}
| |
| {{nowrap|'''N<sub>λ,m+1</sub>''' \ '''N<sub>λ,m</sub>'''}}.
| |
| | |
| Repeatedly applying {{nowrap|'''A<sub>β,r</sub>''' {{=}} '''A<sub>β,1</sub>''' '''A<sub>β,r-1</sub>'''}} finishes the proof.
| |
| | |
| ¶
| |
| | |
| In fact, from the theorem just proved, for {{nowrap|i ≠ j}},
| |
| {{nowrap|'''A<sub>λ<sub>i</sub>,k<sub>i</sub></sub>'''('''N<sub>λ<sub>j</sub>,k<sub>j</sub></sub>''')}}{{nowrap|{{=}} '''N<sub>λ<sub>j</sub>,k<sub>j</sub></sub>'''}}.
| |
| | |
| Now, suppose that
| |
| {{nowrap|'''N<sub>λ<sub>i</sub>,k<sub>i</sub></sub>''' ∩ '''N<sub>λ<sub>j</sub>,k<sub>j</sub></sub>''' ≠ '''{0}'''}},
| |
| for some {{nowrap|i ≠ j}}.
| |
| | |
| Choose {{nowrap|'''x''' ∈ '''N<sub>λ<sub>i</sub>,k<sub>i</sub></sub>''' ∩
| |
| '''N<sub>λ<sub>j</sub>,k<sub>j</sub></sub>''' ≠ '''0'''}}.
| |
| | |
| Since {{nowrap| '''x''' ∈ '''N<sub>λ<sub>i</sub>,k<sub>i</sub></sub>'''}}, it follows {{nowrap|'''A<sub>λ<sub>i</sub>,k<sub>i</sub></sub>''' '''x''' {{=}} '''0'''}}.
| |
| | |
| Since {{nowrap| '''x''' ∈ '''N<sub>λ<sub>j</sub>,k<sub>j</sub></sub>'''}},
| |
| it follows
| |
| {{nowrap|'''A<sub>λ<sub>i</sub>,k<sub>i</sub></sub>''' '''x''' ≠ '''0'''}},
| |
| because '''A<sub>λ<sub>i</sub>,k<sub>i</sub></sub>''' preserves
| |
| dimension on '''N<sub>λ<sub>j</sub>,k<sub>j</sub></sub>'''.
| |
| | |
| So it must be {{nowrap|'''N<sub>λ<sub>i</sub>,k<sub>i</sub></sub>''' ∩ '''N<sub>λ<sub>j</sub>,k<sub>j</sub></sub>''' {{=}} '''{0}'''}}, for {{nowrap|i ≠ j}}.
| |
| | |
| This concludes the proof of the ''Generalized Eigenspace Decomposition Theorem''.
| |
| | |
| === Powers of a Matrix ===
| |
| <!-- <meta content="powers of a matrix"> -->
| |
| | |
| ==== Using generalized eigenvectors ====
| |
| | |
| Assume '''A''' is a {{nowrap|n × n}} matrix with
| |
| ''eigenvalues'' {{nowrap|'''λ<sub>1</sub>, λ<sub>2</sub>, ..., λ<sub>r</sub>'''}}
| |
| <br>of ''algebraic multiplicities'' {{nowrap|'''k<sub>1</sub>, k<sub>2</sub>, ..., k<sub>r</sub>'''}}.
| |
| | |
| For notational convenience
| |
| '''A<sub>λ, 0</sub> = I'''.
| |
| | |
| Note that
| |
| '''A<sub>β, 1</sub> = '''
| |
| '''(λ − β)I'''
| |
| + '''A<sub>λ, 1</sub> '''.
| |
| and apply the ''binomial theorem''.
| |
| | |
| :<math>A_{\beta,s}=((\lambda-\beta)I+A_{\lambda,1})^s=\sum_{m=0}^s\binom{s}{m}(\lambda-\beta)^{s-m}A_{\lambda,m}</math>
| |
| | |
| When '''λ''' is an ''eigenvalue'' of ''algebraic multiplicity'' '''k''',
| |
| and {{nowrap| '''x ∈ N<sub>λ, k</sub>''',}}<br>
| |
| then {{nowrap| '''A<sub>λ, m</sub> x {{=}} 0''', }}
| |
| for {{nowrap| '''m <span}} style="font-size:100%;">≥</span> k''', so in this case:
| |
| | |
| :<math>A_{\beta,s}x=\sum_{m=0}^{\min(s,k-1)}\binom{s}{m}(\lambda-\beta)^{s-m}A_{\lambda,m}x</math>
| |
| | |
| Since
| |
| '''C<sup>n</sup> =
| |
| N<sub>λ<sub>1</sub>, k<sub>1</sub></sub> {{resize|⊕}}
| |
| N<sub>λ<sub>2</sub>, k<sub>2</sub></sub> {{resize|⊕}}
| |
| ... {{resize|⊕}}
| |
| N<sub>λ<sub>r</sub>, k<sub>r</sub></sub>''',
| |
| <br>
| |
| any '''x''' in {{nowrap| '''C<sup>n</sup>''' }} can be expressed as
| |
| '''x = x<sub>1</sub> + x<sub>2</sub> + ... + x<sub>r</sub> ''',
| |
| <br>with each {{nowrap| '''x<sub>i</sub> ∈ N<sub>λ<sub>i</sub>, k<sub>i</sub></sub>'''}}.
| |
| Hence:
| |
| | |
| :<math>A_{\beta,s}x=\sum_{i=1}^r\sum_{m=0}^{\min(s,k_i-1)}\binom{s}{m}(\lambda_i-\beta)^{s-m}A_{\lambda_i,m}x_i</math>
| |
| | |
| The ''columns'' of {{nowrap| '''A<sub>β, s</sub>''' }} are obtained by letting
| |
| '''x''' vary across the ''standard basis'' vectors.
| |
| | |
| The case {{nowrap| '''A<sub>0, s</sub>''' }} is the power
| |
| '''A<sup>s</sup>''' of '''A'''.
| |
| | |
| ==== The minimal polynomial of a matrix ====
| |
| | |
| Assume '''A''' is a {{nowrap|n × n}} matrix with
| |
| ''eigenvalues'' {{nowrap|'''λ<sub>1</sub>, λ<sub>2</sub>, ..., λ<sub>r</sub>'''}}
| |
| <br>of ''algebraic multiplicities'' {{nowrap|'''k<sub>1</sub>, k<sub>2</sub>, ..., k<sub>r</sub>'''}}.
| |
| | |
| For each {{nowrap| '''i''' }} define '''α(λ<sub>i</sub>)''',
| |
| the ''null index'' of {{nowrap| '''λ<sub>i</sub>''', }} to be the<br>smallest
| |
| positive integer {{nowrap| '''α''' }} such that
| |
| '''N<sub>λ<sub>i</sub>, α</sub>
| |
| = N<sub>λ<sub>i</sub>, k<sub>i</sub></sub>'''.
| |
| | |
| It is often the case that {{nowrap| '''α(λ<sub>i</sub>) < k<sub>i</sub></sub>'''}}.
| |
| | |
| Then
| |
| '''p(x) = ∏ (x − λ<sub>i</sub>)<sup>α(λ<sub>i</sub>)</sup>'''
| |
| is the ''minimal polynomial'' for '''A'''.
| |
| | |
| To see this note
| |
| '''p(A) = ∏ A <sub>λ<sub>i</sub>,α(λ<sub>i</sub>)</sub>'''
| |
| and the factors can be commuted in any order.
| |
| | |
| So
| |
| '''p(A) (N<sub>λ<sub>j</sub>, k<sub>j</sub></sub> ) = {0}''',
| |
| because
| |
| '''A <sub>λ<sub>j</sub>,α(λ<sub>j</sub>)</sub>'''
| |
| ''' (N<sub>λ<sub>j</sub>, k<sub>j</sub></sub> ) = {0}'''.
| |
| Being that
| |
| | |
| '''C<sup>n</sup> =
| |
| N<sub>λ<sub>1</sub>, k<sub>1</sub></sub> {{resize|⊕}}
| |
| N<sub>λ<sub>2</sub>, k<sub>2</sub></sub> {{resize|⊕}}
| |
| ... {{resize|⊕}}
| |
| N<sub>λ<sub>r</sub>, k<sub>r</sub></sub>''',
| |
| it is clear {{nowrap| '''p(A) {{=}} 0'''}}.
| |
| | |
| Now '''p(x)''' can not be of less degree because
| |
| '''A <sub>β, 1</sub>'''
| |
| '''(N<sub>λ<sub>j</sub>, k<sub>j</sub></sub> ) = '''
| |
| '''N<sub>λ<sub>j</sub>, k<sub>j</sub></sub> ''',
| |
| | |
| when {{nowrap| '''β ≠ λ<sub>j</sub>'''}},
| |
| and so
| |
| '''A <sub>λ<sub>j</sub>,α(λ<sub>j</sub>)</sub>'''
| |
| must be a factor of {{nowrap| '''p(A)''', }} for each '''j'''.
| |
| | |
| ==== Using confluent Vandermonde matrices ====
| |
| | |
| An alternative strategy is to use the ''characteristic polynomial'' of matrix '''A'''.
| |
| | |
| Let
| |
| '''p(x) = a<sub>0</sub> + a<sub>1</sub> x + a<sub>2</sub> x<sup>2</sup> + ... +'''
| |
| '''a<sub>n-1</sub> x<sup>n-1</sup> + x<sup>n</sup>'''
| |
| | |
| be the ''characteristic polynomial'' of '''A'''.
| |
| | |
| The ''minimal polynomial'' of '''A''' can be substituted for '''p(x)''' in this
| |
| discussion, if it is known,<br>and different, to reduce the degree '''n''' and the multiplicities
| |
| of the eigenvalues.
| |
| | |
| Then {{nowrap| '''p(A) {{=}} 0'''}} and {{nowrap|'''A<sup>n</sup> {{=}} '''}}
| |
| '''−(a<sub>0</sub> I + a<sub>1</sub> A + a<sub>2</sub> A<sup>2</sup> + ... +'''
| |
| '''a<sub>n-1</sub> A<sup>n-1</sup>)'''.
| |
| | |
| So {{nowrap| '''A<sup>n+m</sup> {{=}} '''}}
| |
| '''b<sub>m, 0</sub> I + b<sub>m, 1</sub> A + b<sub>m, 2</sub> A<sup>2</sup> + ... +'''
| |
| '''b<sub>m, n-1</sub> A<sup>n-1</sup>''',
| |
| | |
| where the
| |
| '''b<sub>m, 0</sub>, b<sub>m, 1</sub>, b<sub>m, 2</sub>,'''
| |
| '''..., b<sub>m, n-1</sub>, ''' satisfy the recurrence relation
| |
| | |
| <br>'''b<sub>m, 0</sub> = −a<sub>0</sub> b<sub>m-1, n-1</sub>, '''
| |
| <br>'''b<sub>m, 1</sub> = b<sub>m-1, 0</sub> − a<sub>1</sub> b<sub>m-1, n-1</sub>, '''
| |
| <br>'''b<sub>m, 2</sub> = b<sub>m-1, 1</sub> − a<sub>2</sub> b<sub>m-1, n-1</sub>, '''
| |
| <br>'''...,'''
| |
| <br>'''b<sub>m, n-1</sub> = b<sub>m-1, n-2</sub> − a<sub>n-1</sub> b<sub>m-1, n-1</sub>'''
| |
| | |
| with
| |
| '''b<sub>0, 0</sub> = b<sub>0, 1</sub> = b<sub>0, 2</sub>'''
| |
| '''= ... = b<sub>0, n-2</sub> = 0, '''and''' b<sub>0, n-1</sub> = 1'''.
| |
| | |
| This alone will reduce the number of multiplications needed to calculate a higher<br>
| |
| power of '''A''' by a factor of '''n<sup>2</sup>''', as compared to simply
| |
| multiplying '''A<sup>n+m</sup>''' by '''A'''.
| |
| | |
| In fact the
| |
| '''b<sub>m, 0</sub>, b<sub>m, 1</sub>, b<sub>m, 2</sub>,'''
| |
| '''..., b<sub>m, n-1</sub>, ''' can be calculated by a formula.
| |
| | |
| Consider first when '''A''' has ''distinct eigenvalues''
| |
| '''λ<sub>1</sub>, λ<sub>2</sub>, ..., λ<sub>n</sub>'''.
| |
| <br>Since {{nowrap|'''p(λ<sub>i</sub>) {{=}} 0''', }} for each {{nowrap| '''i''', }}
| |
| the {{nowrap| '''λ<sub>i</sub>''' }} satisfy the recurrence relation also. So:
| |
| | |
| :<math>\begin{bmatrix}
| |
| 1 & \lambda_1 & \lambda_1^2 & \cdots & \lambda_1^{n-1} \\
| |
| 1 & \lambda_2 & \lambda_2^2 & \cdots & \lambda_2^{n-1} \\
| |
| \vdots & \vdots & \vdots & & \vdots \\
| |
| 1 & \lambda_n & \lambda_n^2 & \cdots & \lambda_n^{n-1}
| |
| \end{bmatrix}\begin{bmatrix}
| |
| b_{m,0} \\ b_{m,1} \\ \vdots \\ b_{m,n-1}
| |
| \end{bmatrix}=\begin{bmatrix}
| |
| \lambda_1^{n+m} \\ \lambda_2^{n+m} \\ \vdots \\ \lambda_n^{n+m}
| |
| \end{bmatrix}</math>
| |
| | |
| The matrix {{nowrap| '''V''' }} in the equation is the well studied ''Vandermonde's'',
| |
| <br>for which formulas for its determinant and inverse are known.
| |
| | |
| :<math>\det(V(\lambda_1,\lambda_2,\ldots,\lambda_n))=\prod_{1\le i<j\le n}(\lambda_j-\lambda_i)</math>
| |
| | |
| In the case that {{nowrap|''' λ<sub>2</sub> {{=}} λ<sub>1</sub> '''}},
| |
| consider instead when
| |
| ''' λ<sub>1</sub> ''' is near {{nowrap|''' λ<sub>2</sub> '''}},
| |
| and<br>subtract row {{nowrap| '''1''' }} from row '''2''', which does not
| |
| affect the determinant.
| |
| | |
| :<math>\begin{bmatrix}
| |
| 1 & \lambda_1 & \lambda_1^2 & \cdots & \lambda_1^{n-1} \\
| |
| 0 & \lambda_2-\lambda_1 & \lambda_2^2-\lambda_1^2 & \cdots & \lambda_2^{n-1}-\lambda_1^{n-1} \\
| |
| \vdots & \vdots & \vdots & & \vdots \\
| |
| 1 & \lambda_n & \lambda_n^2 & \cdots & \lambda_n^{n-1}
| |
| \end{bmatrix}=
| |
| \begin{bmatrix}
| |
| \lambda_1^{n+m} \\
| |
| \lambda_2^{n+m}-\lambda_1^{n+m} \\
| |
| \vdots \\
| |
| \lambda_n^{n+m}
| |
| \end{bmatrix}</math>
| |
| | |
| After dividing the second row by
| |
| '''(λ<sub>2</sub> − λ<sub>1</sub>)'''
| |
| the determinant will be affected by<br>the removal of this factor and still be non-zero.
| |
| | |
| :<math>\begin{vmatrix}
| |
| 0 &
| |
| \frac{\lambda_2-\lambda_1}{(\lambda_2-\lambda_1)} &
| |
| \frac{\lambda_2^2-\lambda_1^2}{(\lambda_2-\lambda_1)} &
| |
| \cdots &
| |
| \frac{\lambda_2^{n-1}-\lambda_1^{n-1} }{(\lambda_2-\lambda_1)}
| |
| \end{vmatrix}
| |
| \quad \begin{vmatrix}\frac{\lambda_2^{n+m}-\lambda_1^{n+m} }{(\lambda_2-\lambda_1)}\end{vmatrix}</math>
| |
| | |
| Taking the limit as {{nowrap| '''λ<sub>1</sub>→ λ<sub>2</sub>'''}},
| |
| the new system has the second row ''differentiated''.
| |
| | |
| :<math>\begin{bmatrix}
| |
| 1 & \lambda_2 & \lambda_2^2 & \cdots & \lambda_2^{n-1} \\
| |
| 0 & 1 & 2\lambda_2 & \cdots & (n-1)\lambda_2^{n-2} \\
| |
| 1 & \lambda_3 & \lambda_3^2 & \cdots & \lambda_3^{n-1} \\
| |
| \vdots & \vdots & \vdots & & \vdots \\
| |
| 1 & \lambda_n & \lambda_n^2 & \cdots & \lambda_n^{n-1}
| |
| \end{bmatrix}=
| |
| \begin{bmatrix}
| |
| \lambda_2^{n+m} \\
| |
| (n+m)\lambda_2^{n+m-1} \\
| |
| \lambda_3^{n+m} \\
| |
| \vdots \\
| |
| \lambda_n^{n+m}
| |
| \end{bmatrix}</math>
| |
| | |
| The new system has determinant:
| |
| | |
| :<math>\det(V(\lambda_2,\ldots,\lambda_n))=\prod_{3\le j\le n}(\lambda_j-\lambda_2)^2\prod_{3\le i<j\le n}(\lambda_j-\lambda_i)</math>
| |
| | |
| In the case that {{nowrap|''' λ<sub>3</sub> {{=}} λ<sub>2</sub> '''}},
| |
| also, consider like before when
| |
| ''' λ<sub>2</sub> ''' is near ''' λ<sub>3</sub>''',
| |
| and<br>subtract row {{nowrap| '''1''' }} from row '''3''', which does not
| |
| affect the determinant. Next divide<br>row three by
| |
| '''(λ<sub>3</sub> − λ<sub>2</sub>)'''
| |
| and then subtract row {{nowrap| '''2''' }} from the new row '''3'''
| |
| and<br>follow by dividing the resulting row '''3''' by
| |
| '''(λ<sub>3</sub> − λ<sub>2</sub>)'''
| |
| again. This will affect the<br>determinant by removing a factor of
| |
| '''(λ<sub>3</sub> − λ<sub>2</sub>)<sup>2</sup>'''.
| |
| | |
| Each element of row '''3''' is now of the form
| |
| | |
| :<math>((f(\lambda_3)-f(\lambda_2))/(\lambda_3-\lambda_2)-f'(\lambda_2))/(\lambda_3-\lambda_2)</math>
| |
| and
| |
| :<math>((f(\lambda_3)-f(\lambda_2))/(\lambda_3-\lambda_2)-f'(\lambda_2))/(\lambda_3-\lambda_2)\rightarrow\tfrac{1}{2}f''(\lambda_3)\text{ as }\lambda_2\rightarrow\lambda_3</math>
| |
| | |
| The effect is to differentiate twice and multiply by one half.
| |
| | |
| :<math>\begin{bmatrix}
| |
| 1 & \lambda_3 & \lambda_3^2 & \lambda_3^3 & \cdots & \lambda_3^{n-1} \\
| |
| 0 & 1 & 2\lambda_3 & 3\lambda_3^2 & \cdots & (n-1)\lambda_3^{n-2} \\
| |
| 0 & 0 & 1 & 3\lambda_3 & \cdots & \tfrac{1}{2}(n-1)(n-2)\lambda_3^{n-3} \\
| |
| 1 & \lambda_4 & \lambda_4^2 & \lambda_4^3 & \dots & \lambda_4^{n-1} \\
| |
| \vdots & \vdots & \vdots & \vdots & & \vdots \\
| |
| 1 & \lambda_n & \lambda_n^2 & \lambda_n^3 & \cdots & \lambda_n^{n-1}
| |
| \end{bmatrix}
| |
| \begin{bmatrix}
| |
| \lambda_3^{n+m} \\
| |
| (n+m)\lambda_3^{n+m-1} \\
| |
| \tfrac{1}{2}(n+m)(n+m-1)\lambda_3^{n+m-2} \\
| |
| \lambda_4^{n+m} \\
| |
| \vdots \\
| |
| \lambda_n^{n+m}
| |
| \end{bmatrix}</math>
| |
| | |
| The new system has determinant:
| |
| | |
| :<math>\det(V(\lambda_3,\ldots,\lambda_n))=\prod_{4\le j\le n}(\lambda_j-\lambda_3)^3\prod_{4\le i<j\le n}(\lambda_j-\lambda_i)</math>
| |
| | |
| If it were that the multiplicity of the eigenvalue was even higher, then the next
| |
| row would<br>be differentiated three times and multiplied by {{nowrap| '''1/3!'''. }}
| |
| The progression is '''1/s! f<sup>(s)</sup>''', with the<br>constant coming from the
| |
| coefficients of the derivatives in the ''Taylor'' expansion. This<br>being done for
| |
| each ''eigenvalue'' of ''algebraic multiplicity'' greater than '''1'''.
| |
| | |
| '''example'''
| |
| | |
| The matrix
| |
| <math>A=\begin{bmatrix}
| |
| 1 & 0 & 0 & 0 & 0 \\
| |
| 3 & 1 & 0 & 0 & 0 \\
| |
| 6 & 3 & 2 & 0 & 0 \\
| |
| 10 & 6 & 3 & 2 & 0 \\
| |
| 15 & 10 & 6 & 3 & 2
| |
| \end{bmatrix}</math>
| |
| | |
| has ''characteristic polynomial''
| |
| '''p(x) = (x − 1)<sup>2</sup>(x − 2)<sup>3</sup>'''.
| |
| | |
| The
| |
| '''b<sub>m, 0</sub>, b<sub>m, 1</sub>, b<sub>m, 2</sub>,'''
| |
| ''' b<sub>m, 3</sub>, b<sub>m, 4</sub>, '''
| |
| for which
| |
| | |
| '''A<sup>5+m</sup> = '''
| |
| '''b<sub>m, 0</sub> I + b<sub>m, 1</sub> A + b<sub>m, 2</sub> A<sup>2</sup> +'''
| |
| '''b<sub>m, 3</sub> A<sup>3</sup> + b<sub>m, 4</sub> A<sup>4</sup>''',<br>
| |
| <br>satisfy the ''confluent'' Vandermonde system next.
| |
| | |
| :<math>\begin{bmatrix}
| |
| 1 & 1 & 1^2 & 1^3 & 1^4 \\
| |
| 0 & 1 & 2\cdot 1 & 3 \cdot 1^2 & 4\cdot 1^3 \\
| |
| 1 & 2 & 2^2 & 2^3 & 2^4 \\
| |
| 0 & 1 & 2\cdot 2 & 3\cdot 2^2 & 4\cdot 2^3 \\
| |
| 0 & 0 & 1 & 3\cdot 2 & 6\cdot 2^2
| |
| \end{bmatrix}
| |
| \begin{bmatrix} b_{m,0} \\ b_{m,1} \\ b_{m,2} \\ b_{m,3} \\ b_{m,4} \end{bmatrix} =
| |
| \begin{bmatrix}
| |
| 1^{5+m} \\
| |
| (5+m)\cdot 1^{5+m-1} \\
| |
| 2^{5+m} \\
| |
| (5+m)\cdot 2^{5+m-1} \\
| |
| \tfrac{1}{2}(5+m)(5+m-1)\cdot 2^{5+m-2}
| |
| \end{bmatrix}</math>
| |
| | |
| :<math>\begin{bmatrix} b_{m,0} \\ b_{m,1} \\ b_{m,2} \\ b_{m,3} \\ b_{m,4} \end{bmatrix} =
| |
| \begin{bmatrix}
| |
| -16 & -8 & 17 & -10 & 4 \\
| |
| 48 & 20 & -48 & 29 & -12 \\
| |
| -48 & -18 & 48 & -30 & 13 \\
| |
| 20 & 7 & -20 & 13 & -6 \\
| |
| -3 & -1 & 3 & -2 & 1
| |
| \end{bmatrix}
| |
| \begin{bmatrix}
| |
| 1 \\
| |
| (5+m) \\
| |
| 32 \cdot 2^m \\
| |
| 16(5+m) \cdot 2^m \\
| |
| 4(5+m)(5+m-1) \cdot 2^m
| |
| \end{bmatrix}</math>
| |
| | |
| ====Using difference equations====
| |
| | |
| <p style="line-height:150%;">
| |
| Returning to the recurrence relation for
| |
| '''b<sub>m, 0</sub>, b<sub>m, 1</sub>, b<sub>m, 2</sub>,'''
| |
| '''..., b<sub>m, n-1</sub>,'''
| |
| <br>'''b<sub>m, 0</sub> = −a<sub>0</sub> b<sub>m-1, n-1</sub>, '''
| |
| <br>'''b<sub>m, 1</sub> = b<sub>m-1, 0</sub> − a<sub>1</sub> b<sub>m-1, n-1</sub>, '''
| |
| <br>'''b<sub>m, 2</sub> = b<sub>m-1, 1</sub> − a<sub>2</sub> b<sub>m-1, n-1</sub>, '''
| |
| <br>'''...,'''
| |
| <br>'''b<sub>m, n-1</sub> = b<sub>m-1, n-2</sub> − a<sub>n-1</sub> b<sub>m-1, n-1</sub>'''
| |
| <br>with
| |
| '''b<sub>0, 0</sub> = b<sub>0, 1</sub> = b<sub>0, 2</sub>'''
| |
| '''= ... = b<sub>0, n-2</sub> = 0, '''and''' b<sub>0, n-1</sub> = 1'''.<br>
| |
| </p>
| |
| | |
| <p style="line-height:150%;">
| |
| Upon substituting the first relation into the second,
| |
| <br>'''b<sub>m, 1</sub> = −a<sub>0</sub> b<sub>m-2, n-1</sub> − a<sub>1</sub> b<sub>m-1, n-1</sub>, '''
| |
| <br>and now this one into the {{nowrap|next }}
| |
| '''b<sub>m, 2</sub> = b<sub>m-1, 1</sub> − a<sub>2</sub> b<sub>m-1, n-1</sub>, '''
| |
| <br>'''b<sub>m, 2</sub> =
| |
| −a<sub>0</sub> b<sub>m-3, n-1</sub> − a<sub>1</sub> b<sub>m-2, n-1</sub>
| |
| − a<sub>2</sub> b<sub>m-1, n-1</sub>, '''
| |
| <br>'''...,''' and so on, the following difference equation is found.
| |
| <br>'''b<sub>m, n-1</sub> =
| |
| <br>'''
| |
| −a<sub>0</sub> b<sub>m-n, n-1</sub> − a<sub>1</sub> b<sub>m-n+1, n-1</sub>
| |
| − a<sub>2</sub> b<sub>m-n+2, n-1</sub>'''
| |
| − ... − a<sub>n-2</sub> b<sub>m-2, n-1</sub> − a<sub>n-1</sub> b<sub>m-1, n-1</sub>'''
| |
| <br>with
| |
| '''b<sub>0, n-1</sub> = b<sub>1, n-1</sub> = b<sub>2, n-1</sub>'''
| |
| '''= ... = b<sub>n-2, n-1</sub> = 0, '''and''' b<sub>n-1, n-1</sub> = 1'''.<br>
| |
| </p>
| |
| | |
| See the subsection on ''linear difference equations'' for more explanation.
| |
| | |
| ===Chains of generalized eigenvectors===
| |
| | |
| Some notation and results from previous sections are restated.
| |
| | |
| <ul style="margin-left:15;">
| |
| | |
| <li>
| |
| '''A''' is a {{nowrap|n × n}} matrix of complex numbers.
| |
| </li>
| |
| <li style="line-height:175%">
| |
| '''A<sub>λ, k</sub> = (A − λ I)<sup>k</sup>'''
| |
| </li>
| |
| <li style="line-height:175%">
| |
| '''N<sub>λ, k</sub> = N((A − λ I)<sup>k</sup>)
| |
| = N(A<sub>λ, k</sub>)'''
| |
| </li>
| |
| <li>
| |
| For
| |
| {{nowrap|'''V<sub>1</sub>''' {{resize|140%|'''∩'''}} '''V<sub>2</sub> {{=}} {0}'''}},
| |
| {{nowrap|'''V<sub>1</sub>''' {{resize|140%|'''⊕'''}} '''V<sub>2</sub>''' }}
| |
| '''= {v<sub>1</sub> + v<sub>2</sub> : {{nowrap|v<sub>1</sub> ∈ V<sub>1</sub>}}
| |
| '''and''' v<sub>2</sub> ∈ V<sub>2</sub>}'''.
| |
| </li>
| |
| | |
| </ul>
| |
| | |
| Assume '''A''' has
| |
| ''eigenvalues'' {{nowrap|'''λ<sub>1</sub>, λ<sub>2</sub>, ..., λ<sub>r</sub>'''}}
| |
| <br>of ''algebraic multiplicities'' {{nowrap|'''k<sub>1</sub>, k<sub>2</sub>, ..., k<sub>r</sub>'''}}.
| |
| | |
| For each {{nowrap| '''i''' }} define '''α(λ<sub>i</sub>)''',
| |
| the ''null index'' of {{nowrap| '''λ<sub>i</sub>''', }} to be the<br>smallest
| |
| positive integer {{nowrap| '''α''' }} such that
| |
| '''N<sub>λ<sub>i</sub>, α</sub>
| |
| = N<sub>λ<sub>i</sub>, k<sub>i</sub></sub>'''.
| |
| | |
| It is always the case that {{nowrap| '''α(λ<sub>i</sub>) <span}} style="font-size:100%;">≤</span> k<sub>i</sub>'''.
| |
| | |
| When {{nowrap| '''α(λ) <span}} style="font-size:100%;">≥</span> 2 ''',
| |
| | |
| <p style="line-height:175%">
| |
| '''N<sub>λ, 1</sub> ⊂
| |
| N<sub>λ, 2</sub> ⊂ ... ⊂
| |
| N<sub>λ, α-1</sub> ⊂ N<sub>λ, α</sub>
| |
| = N<sub>λ, α+1</sub> = ...''',<br>
| |
| ''' N<sub>λ, α</sub> \ {0} = ∪
| |
| (N<sub>λ, m</sub> \ N<sub>λ, m-1</sub>)''',
| |
| for {{nowrap|'''m {{=}} 1, ..., α'''}} and {{nowrap|''' N<sub>λ, 0</sub> {{=}} {0}'''.}}
| |
| </p>
| |
| | |
| '''x ∈ N<sub>λ, m</sub> \ N<sub>λ, m-1</sub>''',
| |
| if and only if
| |
| '''A<sub>λ, 1</sub> x ∈ N<sub>λ, m-1</sub> \ N<sub>λ, m-2</sub>'''
| |
| | |
| Define a '''''chain''''' of ''generalized eigenvectors'' to be a set<br>
| |
| '''{ x<sub>1</sub>, x<sub>2</sub>, {{nowrap|..., x<sub>m</sub> }'''}}
| |
| such that
| |
| '''x<sub>1</sub> ∈ N<sub>λ, m</sub> \ N<sub>λ, m-1</sub>''',
| |
| and
| |
| '''x<sub>i+1</sub> = A<sub>λ, 1</sub> x<sub>i</sub>'''.
| |
| | |
| Then {{nowrap| '''x<sub>m</sub> ≠ 0''' }} and
| |
| '''A<sub>λ, 1</sub> x<sub>m</sub> = 0'''.
| |
| | |
| When
| |
| '''x<sub>1</sub> ∈ N<sub>λ, 1</sub> \ {0}''',
| |
| '''{x<sub>1</sub>}''' can be, for the sake of not requiring extra<br>
| |
| terminology, considered ''trivially'' a ''chain''.
| |
| | |
| When a ''disjoint'' collection of ''chains'' combined form a ''basis set''
| |
| for {{nowrap|'''N<sub>λ, α(λ)</sub>''' ,<br>they}} are often referred
| |
| to as ''Jordan chains'' and are the vectors used for<br> the columns of a
| |
| ''transformation'' matrix in the ''Jordan canonical form''.
| |
| | |
| When a ''disjoint'' collection of ''chains'' that combined form a ''basis set'',
| |
| <br>is needed that satisfy
| |
| '''β<sub>i+1</sub>x<sub>i+1</sub> = A<sub>λ, 1</sub> x<sub>i</sub>''',
| |
| for some scalars {{nowrap| '''β<sub>i</sub>''', ''chains''<br>as}} already defined
| |
| can be scaled for this purpose.
| |
| | |
| What will be proven here is that such a ''disjoint'' collection of ''chains''
| |
| <br>can always be constructed.
| |
| | |
| <p style="line-height:175%">
| |
| Before the proof is started, recall a few facts about ''direct sums''.<br>
| |
| When the notation {{nowrap|'''V<sub>1</sub>''' ⊕ V<sub>2</sub>''' '''}}
| |
| is used, it is assumed
| |
| {{nowrap|'''V<sub>1</sub>''' {{resize|140%|'''∩'''}} '''V<sub>2</sub> {{=}} {0}'''.}}
| |
| <br>For
| |
| '''x = v<sub>1</sub> + v<sub>2</sub> ''' with '''v<sub>1</sub> ∈ V<sub>1</sub>
| |
| '''and''' v<sub>2</sub> ∈ V<sub>2</sub>''' ,
| |
| then {{nowrap| '''x {{=}} 0''',<br>if}} and only if
| |
| '''v<sub>1</sub> = v<sub>2</sub> = 0'''.
| |
| </p>
| |
| | |
| In the discussion below<br>
| |
| '''δ<sub>i</sub> ='''
| |
| ''' dim(N<sub>λ, i</sub>) −'''
| |
| ''' dim(N<sub>λ, i−1</sub>)''', with
| |
| '''δ<sub>1</sub> ='''
| |
| ''' dim(N<sub>λ, 1</sub>)'''.
| |
| | |
| <p style="line-height:175%">
| |
| First consider when
| |
| ''' N<sub>λ, 2</sub> \ N<sub>λ, 1</sub>
| |
| ≠ {0} ''', Then a ''basis'' for
| |
| ''' N<sub>λ, 1</sub> ''' can be<br>''extended'' to a ''basis'' for
| |
| ''' N<sub>λ, 2</sub>'''. If
| |
| '''δ<sub>2</sub> = 1''', then there exists
| |
| '''x<sub>1</sub> ∈ N<sub>λ, 2</sub> \ N<sub>λ, 1</sub>''',
| |
| <br>such that
| |
| ''' N<sub>λ, 2</sub> ='''
| |
| ''' N<sub>λ, 1</sub> ⊕ span{x<sub>1</sub>}'''.
| |
| Let
| |
| '''x<sub>2</sub> = A<sub>λ, 1</sub> x<sub>1</sub>'''.
| |
| Then<br>
| |
| '''x<sub>2</sub> ∈ N<sub>λ, 1</sub> \ {0}''',
| |
| with '''x<sub>1</sub>''' and '''x<sub>2</sub>''' ''linearly independent''.
| |
| If {{nowrap|'''dim(N<sub>λ, 2</sub>) {{=}} 2''',<br>since}}
| |
| '''{x<sub>1</sub>, x<sub>2</sub>}''' is a ''chain'' we are through.
| |
| Otherwise {{nowrap|'''x<sub>1</sub>, x<sub>2</sub>''' }} can be extended<br>to a ''basis''
| |
| '''x<sub>1</sub>, x<sub>2</sub>, ..., x<sub>δ<sub>1</sub></sub>'''
| |
| for {{nowrap| '''N<sub>λ, 2</sub>'''. }} The sets
| |
| '''{x<sub>1</sub>, x<sub>2</sub>}, {x<sub>3</sub>}, ..., {x<sub>δ<sub>1</sub></sub>}'''
| |
| <br>form a ''disjoint'' collection of ''chains''.
| |
| In the case that
| |
| '''δ<sub>2</sub> > 1''', then there exist<br>
| |
| ''linearly independent''
| |
| '''x<sub>1</sub>, x<sub>2</sub>, ..., x<sub>δ<sub>2</sub></sub> ∈ N<sub>λ, 2</sub> \ N<sub>λ, 1</sub>''',
| |
| such that<br>
| |
| ''' N<sub>λ, 2</sub> ='''
| |
| ''' N<sub>λ, 1</sub> ⊕ span{x<sub>1</sub>, x<sub>2</sub>, ..., x<sub>δ<sub>2</sub></sub>}'''.
| |
| Let
| |
| '''y<sub>i</sub> = A<sub>λ, 1</sub> x<sub>i</sub>'''.
| |
| <br>Then
| |
| '''y<sub>i</sub> ∈ N<sub>λ, 1</sub> \ {0}''',
| |
| for '''i = {{nowrap|1, 2, ..., δ<sub>2</sub>'''}}.
| |
| To see the {{nowrap| '''y<sub>1</sub>, y<sub>2</sub>, ..., y<sub>δ<sub>2</sub></sub>'''}}
| |
| <br>are ''linearly independent'', assume that for some
| |
| '''β<sub>1</sub>, β<sub>2</sub>, ..., β<sub>δ<sub>2</sub></sub>''',
| |
| <br>that
| |
| '''β<sub>1</sub>y<sub>1</sub> + β<sub>2</sub>y<sub>2</sub> + ... + β<sub>δ<sub>2</sub></sub>y<sub>δ<sub>2</sub></sub> = 0''',
| |
| Then for
| |
| '''x = β<sub>1</sub>x<sub>1</sub> + β<sub>2</sub>x<sub>2</sub> + ... + β<sub>δ<sub>2</sub></sub>x<sub>δ<sub>2</sub></sub>''',
| |
| <br>'''x ∈ N<sub>λ, 1</sub> ''',
| |
| and
| |
| '''x ∈ span{x<sub>1</sub>, x<sub>2</sub>, ..., x<sub>δ<sub>2</sub></sub>}''',
| |
| which implies that {{nowrap|'''x {{=}} 0'''}}, and<br>
| |
| '''β<sub>1</sub>= β<sub>2</sub>= ... = β<sub>δ<sub>2</sub></sub> = 0'''.
| |
| Since '''span{y<sub>1</sub>, y<sub>2</sub>, ..., y<sub>δ<sub>2</sub></sub>}'''
| |
| '''⊆ N<sub>λ, 1</sub>''', the vectors<br>
| |
| '''x<sub>1</sub>, x<sub>2</sub>, ..., x<sub>δ<sub>2</sub></sub>'''
| |
| ''', y<sub>1</sub>, y<sub>2</sub>, ..., y<sub>δ<sub>2</sub></sub>'''
| |
| are a ''linearly independent'' set.<br>
| |
| If {{nowrap| '''δ<sub>2</sub></sub> {{=}} '''δ<sub>1</sub>'''</sub>''', }}
| |
| then the sets
| |
| '''{x<sub>1</sub>, y<sub>1</sub>}, {x<sub>2</sub>, y<sub>2</sub>}, ..., {x<sub>δ<sub>2</sub></sub>, y<sub>δ<sub>2</sub></sub>}'''
| |
| form a<br>''disjoint'' collection of ''chains'' that when combined are a ''basis set''
| |
| for {{nowrap|''' N<sub>λ, 2</sub>'''.}}<br>
| |
| If {{nowrap| '''δ<sub>1</sub></sub> > '''δ<sub>2</sub>'''</sub>''', }}
| |
| then
| |
| '''x<sub>1</sub>, x<sub>2</sub>, ..., x<sub>'''δ'''<sub>2</sub></sub>'''
| |
| ''', y<sub>1</sub>, y<sub>2</sub>, ..., y<sub>δ<sub>2</sub></sub>'''
| |
| can be extended to a ''basis''<br>for {{nowrap|''' N<sub>λ, 2</sub>''' }}
| |
| by some vectors
| |
| '''x<sub>δ<sub>2</sub>+1</sub>, ..., x<sub>δ<sub>1</sub></sub>'''
| |
| in {{nowrap|''' N<sub>λ, 1</sub>''', }} so that<br>
| |
| '''{x<sub>1</sub>, y<sub>1</sub>}, {x<sub>2</sub>, y<sub>2</sub>}, ..., {x<sub>δ<sub>2</sub></sub>, y<sub>δ<sub>2</sub></sub>}'''
| |
| ''', {x<sub>δ<sub>2</sub>+1</sub>}, ..., {x<sub>δ<sub>1</sub></sub>}'''
| |
| <br>forms a ''disjoint'' collection of ''chains''.
| |
| </p>
| |
| | |
| To reduce redundancy, in the next paragraph, when {{nowrap| '''δ {{=}} 1''' }}
| |
| the notation<br>
| |
| '''x<sub>1</sub>, x<sub>2</sub>, ..., x<sub>δ</sub>'''
| |
| will be understood simply to mean just '''x<sub>1</sub>''' and when
| |
| '''δ = 2''' <br>to mean
| |
| '''x<sub>1</sub>, x<sub>2</sub>'''.
| |
| | |
| <p style="line-height:175%">
| |
| So far it has been shown that, if ''linearly independent''<br>
| |
| '''x<sub>1</sub>, x<sub>2</sub>, ..., x<sub>δ<sub>2</sub></sub> ∈ N<sub>λ, 2</sub> \ N<sub>λ, 1</sub>''',
| |
| are chosen, such that<br>
| |
| ''' N<sub>λ, 2</sub> ='''
| |
| ''' N<sub>λ, 1</sub> ⊕ span{x<sub>1</sub>, x<sub>2</sub>, ..., x<sub>δ<sub>2</sub></sub>}''',
| |
| then there exists a ''disjoint''<br>collection of ''chains'' with each of the
| |
| '''x<sub>1</sub>, x<sub>2</sub>, ..., x<sub>δ<sub>2</sub></sub> '''
| |
| being the first member or ''top''<br>of one of the ''chains''. Furthermore, this
| |
| collection of ''vectors'', when combined,<br>forms a ''basis'' for
| |
| ''' N<sub>λ, 2</sub>'''.
| |
| </p>
| |
| | |
| <p style="line-height:175%">
| |
| Now, let the ''induction hypothesis'' be that, if ''linearly independent''<br>
| |
| '''x<sub>1</sub>, x<sub>2</sub>, ..., x<sub>δ<sub>m</sub></sub> ∈ N<sub>λ, m</sub> \ N<sub>λ, m−1</sub>''',
| |
| are chosen, such that<br>
| |
| ''' N<sub>λ, m</sub> ='''
| |
| ''' N<sub>λ, m−1</sub> ⊕ span{x<sub>1</sub>, x<sub>2</sub>, ..., x<sub>δ<sub>m</sub></sub>}''',
| |
| then there exists a ''disjoint''<br>collection of ''chains'' with each of the
| |
| '''x<sub>1</sub>, x<sub>2</sub>, ..., x<sub>δ<sub>m</sub></sub> '''
| |
| being the first member or ''top''<br>of one of the ''chains''. Furthermore, this
| |
| collection of ''vectors'', when combined,<br>forms a ''basis'' for
| |
| ''' N<sub>λ, m</sub>'''.
| |
| </p>
| |
| | |
| <p style="line-height:175%">
| |
| Consider {{nowrap| '''m < α(λ)'''. }} A ''basis'' for
| |
| ''' N<sub>λ, m</sub>''' can always be ''extended'' to a ''basis''
| |
| for<br>''' N<sub>λ, m+1</sub>'''. So ''linearly independent''
| |
| '''x<sub>1</sub>, x<sub>2</sub>, ..., x<sub>δ<sub>m+1</sub></sub> ∈ N<sub>λ, m+1</sub> \ N<sub>λ, m</sub>''',
| |
| such that<br>
| |
| ''' N<sub>λ, m+1</sub> ='''
| |
| ''' N<sub>λ, m</sub> ⊕ span{x<sub>1</sub>, x<sub>2</sub>, ..., x<sub>δ<sub>m+1</sub></sub>}''',
| |
| can be chosen.
| |
| Let
| |
| '''y<sub>i</sub> = A<sub>λ, 1</sub> x<sub>i</sub>'''.
| |
| <br>Then
| |
| '''y<sub>i</sub> ∈ N<sub>λ, m</sub> \ N<sub>λ, m−1</sub>''',
| |
| for '''i = {{nowrap|1, 2, ..., δ<sub>m+1</sub>'''}}.
| |
| To see the {{nowrap| '''y<sub>1</sub>, y<sub>2</sub>, ..., y<sub>δ<sub>m+1</sub></sub>'''}}
| |
| <br>are ''linearly independent'', assume that for some
| |
| '''β<sub>1</sub>, β<sub>2</sub>, ..., β<sub>δ<sub>m+1</sub></sub>''',
| |
| <br>that
| |
| '''β<sub>1</sub>y<sub>1</sub> + β<sub>2</sub>y<sub>2</sub> + ... + β<sub>δ<sub>m+1</sub></sub>y<sub>δ<sub>m+1</sub></sub> = 0''',
| |
| Then for<br>
| |
| '''x = β<sub>1</sub>x<sub>1</sub> + β<sub>2</sub>x<sub>2</sub> + ... + β<sub>δ<sub>m+1</sub></sub>x<sub>δ<sub>m+1</sub></sub>''',
| |
| '''x ∈ N<sub>λ, 1</sub> ''',
| |
| and<br>
| |
| '''x ∈ span{x<sub>1</sub>, x<sub>2</sub>, ..., x<sub>δ<sub>m+1</sub></sub>}''',
| |
| which implies that {{nowrap|'''x {{=}} 0'''}}, and<br>
| |
| '''β<sub>1</sub>= β<sub>2</sub>= ... = β<sub>δ<sub>m+1</sub></sub> = 0'''.
| |
| In addition,
| |
| ''' span{y<sub>1</sub>, y<sub>2</sub>, ..., y<sub>δ<sub>m+1</sub></sub>}'''
| |
| '''∩ N<sub>λ, m−1</sub> = {0}'''.
| |
| <br>To see this assume that for some
| |
| '''β<sub>1</sub>, β<sub>2</sub>, ..., β<sub>δ<sub>m+1</sub></sub>''',
| |
| <br>that
| |
| '''β<sub>1</sub>y<sub>1</sub> + β<sub>2</sub>y<sub>2</sub> + ... + β<sub>δ<sub>m+1</sub></sub>y<sub>δ<sub>m+1</sub></sub> ∈ '''
| |
| '''N<sub>λ, m−1</sub> '''
| |
| Then for<br>
| |
| '''x = β<sub>1</sub>x<sub>1</sub> + β<sub>2</sub>x<sub>2</sub> + ... + β<sub>δ<sub>m+1</sub></sub>x<sub>δ<sub>m+1</sub></sub>''',
| |
| '''x ∈ N<sub>λ, m</sub> ''',
| |
| and<br>
| |
| '''x ∈ span{x<sub>1</sub>, x<sub>2</sub>, ..., x<sub>δ<sub>m+1</sub></sub>}''',
| |
| which implies that {{nowrap|'''x {{=}} 0'''}}, and<br>
| |
| '''β<sub>1</sub>= β<sub>2</sub>= ... = β<sub>δ<sub>m+1</sub></sub> = 0'''.
| |
| The proof is nearly done.
| |
| </p>
| |
| | |
| <p style="line-height:175%">
| |
| At this point suppose that
| |
| '''b<sub>1</sub>, {{nowrap|b<sub>2</sub>, ..., b<sub>d<sub>m−1</sub></sub>'''}}
| |
| is any ''basis'' for '''N<sub>λ, m−1</sub>'''.<br>
| |
| Then '''''B'' = '''
| |
| '''span{b<sub>1</sub>, b<sub>2</sub>, ..., b<sub>d<sub>m−1</sub></sub>} ⊕'''
| |
| '''span{y<sub>1</sub>, y<sub>2</sub>, ..., y<sub>δ<sub>m+1</sub></sub>}'''
| |
| <br>is a ''subspace'' of '''N<sub>λ, m</sub>'''.
| |
| If '''''B'' ≠ N<sub>λ, m</sub>''',
| |
| then<br>
| |
| '''b<sub>1</sub>, b<sub>2</sub>, ..., b<sub>d<sub>m−1</sub></sub>, '''
| |
| '''y<sub>1</sub>, y<sub>2</sub>, ..., y<sub>δ<sub>m+1</sub></sub>'''
| |
| can be ''extended'' to a ''basis'' for '''N<sub>λ, m</sub>''',
| |
| <br>by some set of vectors
| |
| '''z<sub>1</sub>, z<sub>2</sub>, ..., z<sub>(δ<sub>m</sub>− δ<sub>m+1</sub>)</sub>''' ,
| |
| in which case<br>
| |
| '''N<sub>λ, m</sub> ='''
| |
| ''' N<sub>λ, m−1</sub> ⊕ span{y<sub>1</sub>, y<sub>2</sub>, ..., y<sub>δ<sub>m+1</sub></sub>}'''
| |
| '''⊕ span{z<sub>1</sub>, z<sub>2</sub>, ..., z<sub>(δ<sub>m</sub>− δ<sub>m+1</sub>)</sub>}'''.
| |
| </p>
| |
| | |
| <p style="line-height:175%">
| |
| If {{nowrap| '''δ<sub>m</sub> {{=}} δ<sub>m+1</sub>''', }} then<br>
| |
| ''' N<sub>λ, m</sub> ='''
| |
| ''' N<sub>λ, m−1</sub> ⊕ span{y<sub>1</sub>, y<sub>2</sub>, ..., y<sub>δ<sub>m+1</sub></sub>}'''
| |
| <br>or if {{nowrap| '''δ<sub>m</sub> > δ<sub>m+1</sub>''', }} then<br>
| |
| ''' N<sub>λ, m</sub> ='''
| |
| ''' N<sub>λ, m−1</sub> ⊕ span{z<sub>1</sub>, z<sub>2</sub>, ..., z<sub>(δ<sub>m</sub>− δ<sub>m+1</sub>)</sub>'''
| |
| ''', y<sub>1</sub>, y<sub>2</sub>, ..., y<sub>δ<sub>m+1</sub></sub>}'''
| |
| <br>In either case apply the ''induction hypothesis'' to get that
| |
| there exists a ''disjoint''<br>collection of ''chains'' with each of the
| |
| '''y<sub>1</sub>, y<sub>2</sub>, ..., y<sub>δ<sub>m+1</sub></sub> '''
| |
| being the first member or ''top''<br>of one of the ''chains''. Furthermore, this
| |
| collection of ''vectors'', when combined,<br>forms a ''basis'' for
| |
| ''' N<sub>λ, m</sub>'''. Now,
| |
| '''y<sub>i</sub> = A<sub>λ, 1</sub> x<sub>i</sub>''',
| |
| for '''i = {{nowrap|1, 2, ..., δ<sub>m+1</sub>'''}}, so each of
| |
| the<br>''chains'' beginning with {{nowrap| '''y<sub>i</sub>''' }} can be extended upwards into
| |
| ''' N<sub>λ, m+1</sub> \ N<sub>λ, m</sub>'''
| |
| to a ''chain''<br>beginning with {{nowrap| '''x<sub>i</sub>'''. }} Since
| |
| ''' N<sub>λ, m+1</sub> ='''
| |
| ''' N<sub>λ, m</sub> ⊕ span{x<sub>1</sub>, x<sub>2</sub>, ..., x<sub>δ<sub>m+1</sub></sub>}''',
| |
| <br>the ''combined vectors'' of the ''new chains'' form a ''basis'' for
| |
| ''' N<sub>λ, m+1</sub>'''.
| |
| </p>
| |
| | |
| ====Differential equations ''y′= Ay''====
| |
| | |
| <p style="line-height:175%;">
| |
| Let {{nowrap| '''A''' }} be a '''n×n''' matrix of complex numbers and
| |
| '''λ''' an ''eigenvalue'' of {{nowrap| '''A''' }},
| |
| with<br>''associated eigenvector'' {{nowrap| '''x''' }}.
| |
| Suppose {{nowrap| '''y(t)''' }} is a '''n''' ''dimensional vector valued<br>
| |
| function'', sufficiently smooth, so that {{nowrap| '''y′(t)''' }} is continuous. The restriction that '''y(t)'''<br> be ''smooth'' can be relaxed somewhat, but is not the main focus of this discussion.
| |
| </p>
| |
| | |
| <p style="line-height:150%;">
| |
| The solutions to the equation '''''y′(t) = Ay(t)''''' are sought.
| |
| The first observation
| |
| is that<br> '''y(t) = e<sup>λt</sup>x''' will be a solution.
| |
| When {{nowrap| '''A''' }} does not have {{nowrap| '''n''' }} ''linearly independent''
| |
| <br>''eigenvectors'', solutions of this kind will not provide the
| |
| total of {{nowrap| '''n''' }} needed for a<br>''fundamental basis set''.
| |
| </p>
| |
| | |
| <p style="line-height:175%;">
| |
| In view of the existence of ''chains'' of ''generalized eigenvectors'' seek a
| |
| solution of<br>the form
| |
| '''y(t) = e<sup>λt</sup>x<sub>1</sub> + t''''''e<sup>λt</sup>x<sub>2</sub>''' ,
| |
| then<br>
| |
| '''y′(t) =''''''
| |
| λ e<sup>λt</sup>x<sub>1</sub> + e<sup>λt</sup>x<sub>2</sub>
| |
| + λ t e<sup>λt</sup>x<sub>2</sub> =''''''
| |
| e<sup>λt</sup>(λ </sup>'''x<sub>1</sub> + x<sub>2</sub>)'''''''''
| |
| + t e<sup>λt</sup>(λx <sub>2</sub>)'''
| |
| <br>and<br>
| |
| '''Ay(t) = e<sup>λt</sup>A x<sub>1</sub> + t e<sup>λt</sup>A x<sub>2</sub>''' .
| |
| </p>
| |
| | |
| <p style="line-height:175%;">
| |
| In view of this, {{nowrap| '''y(t)''' }} will be a solution to '''y′(t) = Ay(t)''' , when
| |
| '''A x<sub>1</sub> = λ '''</sup>x<sub>1</sub> + x<sub>2</sub>'''''' and<br>
| |
| '''A x<sub>2</sub> = λ x<sub>2</sub>''' .
| |
| That is when
| |
| '''(A − λ I)x<sub>1</sub> = x<sub>2</sub>''' and
| |
| '''(A − λ I)x<sub>2</sub> = 0''' .
| |
| Equivalently,<br>when '''{x<sub>1</sub>, x<sub>2</sub>}''' is a ''chain'' of
| |
| ''generalized eigenvectors''.
| |
| </p>
| |
| | |
| <p style="line-height:175%;">
| |
| Continuing with this reasoning seek a solution of
| |
| the form<br>
| |
| '''y(t) = e<sup>λt</sup>x<sub>1</sub> + t e<sup>λt</sup>x<sub>2</sub>
| |
| + t<sup>2</sup> e<sup>λt</sup>x<sub>3</sub>''' ,
| |
| then<br>
| |
| '''y′(t) =
| |
| λ e<sup>λt</sup>x<sub>1</sub> + e<sup>λt</sup>x<sub>2</sub>
| |
| + λ t e<sup>λt</sup>x<sub>2</sub>
| |
| + 2 t e<sup>λt</sup>x<sub>3</sub>
| |
| + λ t<sup>2</sup> e<sup>λt</sup>x<sub>3</sub>
| |
| <br> =
| |
| e<sup>λt</sup>(λ '''</sup>x<sub>1</sub> + x<sub>2</sub>)
| |
| + t e<sup>λt</sup>(λ x<sub>2</sub> + 2 x<sub>3</sub>)
| |
| + t<sup>2</sup> e<sup>λt</sup>(λ x<sub>3</sub>)''''''
| |
| and<br>
| |
| '''Ay(t) = e<sup>λt</sup>A x<sub>1</sub> + t e<sup>λt</sup>A x<sub>2</sub>
| |
| + t<sup>2</sup> e<sup>λt</sup>A x<sub>3</sub>''' .
| |
| </p>
| |
| | |
| <p style="line-height:150%;">
| |
| Like before, {{nowrap| '''y(t)''' }} will be a solution to
| |
| '''y′(t) = Ay(t)''' , when
| |
| '''A x<sub>1</sub> = λ '''</sup>x<sub>1</sub> + x<sub>2</sub>'''''' ,<br>
| |
| '''A x<sub>2</sub> = λ '''x<sub>2</sub> + 2 x<sub>3</sub>'''''' , and
| |
| '''A x<sub>3</sub> = λ x<sub>3</sub>''' .
| |
| That is when
| |
| '''(A − λ I)x<sub>1</sub> = x<sub>2</sub>''' ,<br>
| |
| '''(A − λ I)x<sub>2</sub> = 2 x<sub>3</sub>''' , and
| |
| '''(A − λ I)x<sub>3</sub> = 0''' .
| |
| Since it will hold '''(A − λ I)(2 x<sub>3</sub>) = 0''' ,<br>
| |
| also, equivalently, when '''{x<sub>1</sub>, x<sub>2</sub>, 2 x<sub>3</sub>}'''
| |
| is a ''chain'' of ''generalized eigenvectors''.
| |
| </p>
| |
| | |
| <p style="line-height:175%;">
| |
| More generally, to find the progression, seek a solution of the form<br>
| |
| '''y(t) = e<sup>λt</sup>x<sub>1</sub> + t e<sup>λt</sup>x<sub>2</sub>
| |
| + t<sup>2</sup> e<sup>λt</sup>x<sub>3</sub>
| |
| + t<sup>3</sup> e<sup>λt</sup>x<sub>4</sub> + ...
| |
| + t<sup>m−2</sup> e<sup>λt</sup>x<sub>m−1</sub>
| |
| + t<sup>m−1</sup> e<sup>λt</sup>x<sub>m</sub>''' ,
| |
| <br>then<br>
| |
| '''y′(t) =
| |
| λ e<sup>λt</sup>x<sub>1</sub> + e<sup>λt</sup>x<sub>2</sub>
| |
| + λ t e<sup>λt</sup>x<sub>2</sub>''''''
| |
| + 2 t e<sup>λt</sup>x<sub>3</sub>''''''
| |
| + λ t<sup>2</sup> e<sup>λt</sup>x<sub>3</sub>''''''
| |
| + 3 t<sup>2</sup> e<sup>λt</sup>x<sub>4</sub>''''''
| |
| + λ t<sup>3</sup> e<sup>λt</sup>x<sub>4</sub>''''''<br>
| |
| + ...''''''
| |
| + (m−2)t<sup>m−3</sup>e<sup>λt</sup>x<sub>m−1</sub>''''''
| |
| + λ t<sup>m−2</sup> e<sup>λt</sup>x<sub>m−1</sub>''''''
| |
| + (m−1)t<sup>m−2</sup> e<sup>λt</sup>x<sub>m</sub>''''''
| |
| + λ t<sup>m−1</sup> e<sup>λt</sup>x<sub>m</sub>
| |
| <br> =''''''
| |
| e<sup>λt</sup>(λ '''</sup>x<sub>1</sub> + x<sub>2</sub>)
| |
| + t e<sup>λt</sup>(λ x<sub>2</sub> + 2 x<sub>3</sub>)
| |
| + t<sup>2</sup> e<sup>λt</sup>(λ x<sub>3</sub> + 3 x<sub>4</sub>)
| |
| + t<sup>3</sup> e<sup>λt</sup>(λ x<sub>4</sub> + 4 x<sub>5</sub>)
| |
| '''<br>'''+ ...'''<br>'''
| |
| + t<sup>m−3</sup> e<sup>λt</sup>(λ x<sub>m−2</sub> + (m−2) x<sub>m−1</sub>)
| |
| + t<sup>m−2</sup> e<sup>λt</sup>(λ x<sub>m−1</sub> + (m−1) x<sub>m</sub>)
| |
| + t<sup>m−1</sup> e<sup>λt</sup>(λ x<sub>m</sub>)''''''
| |
| <br>and<br>
| |
| '''Ay(t) =<br> e<sup>λt</sup>A x<sub>1</sub> + t e<sup>λt</sup>A x<sub>2</sub>
| |
| + t<sup>2</sup> e<sup>λt</sup>A x<sub>3</sub>
| |
| + t<sup>3</sup> e<sup>λt</sup>A x<sub>4</sub> + ...
| |
| + t<sup>m−2</sup> e<sup>λt</sup>A x<sub>m−1</sub>
| |
| + t<sup>m−1</sup> e<sup>λt</sup>A x<sub>m</sub>''' .
| |
| </p>
| |
| | |
| <p style="line-height:175%;">
| |
| Again, {{nowrap| '''y(t)''' }} will be a solution to '''y′(t) = Ay(t)''' , when<br>
| |
| '''A x<sub>1</sub> = λ '''</sup>x<sub>1</sub> + x<sub>2</sub>'''''' ,
| |
| '''A x<sub>2</sub> = λ x<sub>2</sub> + 2 x<sub>3</sub>''' ,
| |
| '''A x<sub>3</sub> = λ x<sub>3</sub> + 3 x<sub>4</sub>''' ,
| |
| '''A x<sub>4</sub> = λ x<sub>4</sub> + 4 x<sub>5</sub>''' ,<br>
| |
| '''A x<sub>m−2</sub> = λ x<sub>m−2</sub> + (m−2) x<sub>m−1</sub>''' ,
| |
| '''A x<sub>m−1</sub> = λ x<sub>m−1</sub> + (m−1) x<sub>m</sub>''' ,
| |
| <br>and '''A x<sub>m</sub> = λ x<sub>m</sub>''' .
| |
| <br>That is when<br>
| |
| '''(A − λ I)x<sub>1</sub> = x<sub>2</sub>''' ,
| |
| '''(A − λ I)x<sub>2</sub> = 2 x<sub>3</sub>''' ,
| |
| '''(A − λ I)x<sub>3</sub> = 3 x<sub>4</sub>''' ,
| |
| '''(A − λ I)x<sub>4</sub> = 4 x<sub>5</sub>''' ,<br>
| |
| '''...,''' <br>
| |
| '''(A − λ I)x<sub>m−2</sub> = (m−2) x<sub>m−1</sub>''' ,
| |
| '''(A − λ I)x<sub>m−1</sub> = (m−1) x<sub>m</sub>''' , and<br>
| |
| '''(A − λ I)x<sub>m</sub> = 0''' .
| |
| <br>Since it will hold '''(A − λ I)((m−1)! x<sub>3</sub>) = 0''' , also,
| |
| equivalently, when<br>
| |
| '''{x<sub>1</sub>, 1! x<sub>2</sub>, 2! x<sub>3</sub>, 3! x<sub>4</sub>, ...,
| |
| (m−2)! x<sub>m−1</sub>, (m−1)! x<sub>m</sub>}'''
| |
| <br>is a ''chain'' of ''generalized eigenvectors''.
| |
| </p>
| |
| | |
| Now, the ''basis set'' for all solutions will be found through a
| |
| ''disjoint collection<br>of chains of generalized eigenvectors'' of
| |
| the matrix '''A'''.
| |
| | |
| <p style="line-height:150%;">
| |
| Assume '''A''' has
| |
| ''eigenvalues'' {{nowrap|'''λ<sub>1</sub>, λ<sub>2</sub>, ..., λ<sub>r</sub>'''}}
| |
| <br>of ''algebraic multiplicities'' {{nowrap|'''k<sub>1</sub>, k<sub>2</sub>, ..., k<sub>r</sub>'''}}.
| |
| </p>
| |
| | |
| <p style="line-height:175%;">
| |
| For a given ''eigenvalue'' '''λ<sub>i</sub>''' there is a ''collection'' of
| |
| '''s''', with '''s''' depending on '''i''',<br>''disjoint chains'' of
| |
| ''generalized eigenvectors''<br>
| |
| '''
| |
| ''C<sub>i,1</sub>'' = {<sup>1</sup>z<sub>1</sub>, <sup>1</sup>z<sub>2</sub>, ...,<sup>1</sup>z<sub>j1</sub>},
| |
| ''C<sub>i,2</sub>'' = {<sup>2</sup>z<sub>1</sub>, <sup>2</sup>z<sub>2</sub>, ...,<sup>2</sup>z<sub>j2</sub>},
| |
| ...,
| |
| ''C<sub>i,js(i)</sub>'' = {<sup>s</sup>z<sub>1</sub>, <sup>s</sup>z<sub>2</sub>, ...,<sup>s</sup>z<sub>js</sub>},
| |
| '''
| |
| <br>that when ''combined'' form a ''basis set'' for '''N<sub>λ<sub>i</sub>, k<sub>i</sub></sub>'''.
| |
| The total number of ''vectors''<br>in this set will be
| |
| '''j1 + j2 + ... + js = k<sub>i</sub>'''.
| |
| Sets in this collection may have only one<br>or two members so in this discussion understand
| |
| the notation
| |
| '''{<sup>β</sup>z<sub>1</sub>, <sup>β</sup>z<sub>2</sub>, ...,<sup>β</sup>z<sub>jβ</sub>}'''
| |
| <br>will mean '''{<sup>β</sup>z<sub>1</sub>}''' when '''jβ = 1''', and
| |
| '''{<sup>β</sup>z<sub>1</sub>, <sup>β</sup>z<sub>2</sub>}''' when '''jβ = 2''',
| |
| and so forth.<br>
| |
| </p>
| |
| | |
| <p style="line-height:150%;">
| |
| Being that this ''notation'' is cumbersome with many ''indices'', in the next
| |
| paragraphs<br> any particular '''''C<sub>i,β</sub>''''', when more explanation
| |
| is not needed, may just be notated as<br>
| |
| '''
| |
| ''C'' = {z<sub>1</sub>, z<sub>2</sub>, ..., z<sub>j</sub>}.
| |
| '''
| |
| </p>
| |
| | |
| <p style="line-height:150%;">
| |
| For each such of these ''chain'' sets,
| |
| '''''C'' = {z<sub>1</sub>, z<sub>2</sub>, ..., z<sub>j</sub>}'''<br>
| |
| the sets '''{x<sub>j</sub>}''', '''{x<sub>j−1</sub>, x<sub>j</sub>},'''
| |
| '''{x<sub>j−2</sub>, x<sub>j−1</sub>, x<sub>j</sub>}, ...,'''
| |
| '''{z<sub>2</sub>, z<sub>3</sub>, ..., z<sub>j</sub>},'''
| |
| '''{z<sub>1</sub>, z<sub>2</sub>, ..., z<sub>j</sub>}'''<br>
| |
| are also ''chains''. This notation being understood to mean when<br>
| |
| '''''C'' = {z<sub>1</sub>}''' just '''{z<sub>1</sub>}''',
| |
| when '''''C'' = {z<sub>1</sub>, z<sub>2</sub>}''' just '''{z<sub>2</sub>},'''
| |
| '''{z<sub>1</sub>, z<sub>2</sub>}''' and when<br>
| |
| '''''C'' = {z<sub>1</sub>, z<sub>2</sub>, z<sub>2</sub>}''' just '''{z<sub>3</sub>},'''
| |
| '''{z<sub>2</sub>, z<sub>3</sub>},''' '''{z<sub>1</sub>, z<sub>2</sub>, z<sub>3</sub>}''',
| |
| and so on.
| |
| </p>
| |
| | |
| <p style="line-height:175%;">
| |
| The conclusion of the top of the discussion was that<br>
| |
| '''y(t) = e<sup>λt</sup>x<sub>1</sub>''', is a solution when '''{x<sub>1</sub>}'''
| |
| is a ''chain''.<br>
| |
| '''y(t) = e<sup>λt</sup>x<sub>1</sub> + t e<sup>λt</sup>x<sub>2</sub>''',
| |
| is a solution when '''{x<sub>1</sub>, 1! x<sub>2</sub>}'''
| |
| is a ''chain''.<br>
| |
| '''y(t) = e<sup>λt</sup>x<sub>1</sub> + t e<sup>λt</sup>x<sub>2</sub>
| |
| + t<sup>2</sup> e<sup>λt</sup>x<sub>3</sub>''',
| |
| is a solution when '''{x<sub>1</sub>, 1! x<sub>2</sub>, , 2! x<sub>3</sub>}'''
| |
| is a ''chain''.<br>
| |
| The progression continues to<br>
| |
| '''y(t) = e<sup>λt</sup>x<sub>1</sub> + t e<sup>λt</sup>x<sub>2</sub>
| |
| + t<sup>2</sup> e<sup>λt</sup>x<sub>3</sub>
| |
| + t<sup>3</sup> e<sup>λt</sup>x<sub>4</sub> + ...
| |
| + t<sup>m−2</sup> e<sup>λt</sup>x<sub>m−1</sub>
| |
| + t<sup>m−1</sup> e<sup>λt</sup>x<sub>m</sub>''',<br>
| |
| is a solution when
| |
| '''{x<sub>1</sub>, 1! x<sub>2</sub>, 2! x<sub>3</sub>, 3! x<sub>4</sub>, ...,
| |
| (m−2)! x<sub>m−1</sub>, (m−1)! x<sub>m</sub>}''',
| |
| <br>is a ''chain'' of ''generalized eigenvectors''.
| |
| </p>
| |
| | |
| <p style="line-height:175%;">
| |
| In light of the preceding calculations, all that must be done is to provide
| |
| the proper<br> ''scaling'' for each of the ''chains'' arising from the
| |
| set '''''C'' = {z<sub>1</sub>, z<sub>2</sub>, ..., z<sub>j</sub>}'''.
| |
| <br>The progression for the ''solutions'' is given by<br>
| |
| '''y(t) = e<sup>λt</sup>z<sub>j</sub>''', for ''chain'' '''{z<sub>j</sub>}'''<br>
| |
| '''y(t) = e<sup>λt</sup>z<sub>j−1</sub> + (1 ⁄ 1!) t e<sup>λt</sup>z<sub>j</sub>''',
| |
| for ''chain'' '''{z<sub>j−1</sub>, 1!(1 ⁄ 1!) z<sub>j</sub>}'''<br>
| |
| '''y(t) = e<sup>λt</sup>z<sub>j−2</sub> + (1 ⁄ 1!) t e<sup>λt</sup>z<sub>j−1</sub>
| |
| + (1 ⁄ 2!) t<sup>2</sup> e<sup>λt</sup>z<sub>j</sub>''',
| |
| <br>for ''chain'' '''{z<sub>j−2</sub>, 1!(1 ⁄ 1!) z<sub>j−1</sub>, 2!(1 ⁄ 2!) z<sub>j</sub>}'''<br>
| |
| '''y(t) = e<sup>λt</sup>z<sub>j−3</sub> + (1 ⁄ 1!) t e<sup>λt</sup>z<sub>j−2</sub>
| |
| + (1 ⁄ 2!) t<sup>2</sup> e<sup>λt</sup>z<sub>j−1</sub> + (1 ⁄ 3!) t<sup>3</sup> e<sup>λt</sup>z<sub>j</sub>''',
| |
| <br>for ''chain'' '''{z<sub>j−3</sub>, 1!(1 ⁄ 1!) z<sub>j−2</sub>, 2!(1 ⁄ 2!) z<sub>j−1</sub>, 3!(1 ⁄ 3!) z<sub>j</sub>}''',<br>
| |
| and so on until,<br>
| |
| '''y(t) = e<sup>λt</sup>z<sub>1</sub> + (1 ⁄ 1!) t e<sup>λt</sup>z<sub>2</sub>
| |
| + (1 ⁄ 2!) t<sup>2</sup> e<sup>λt</sup>z<sub>3</sub> + ... + (1 ⁄ (j−1)!) t<sup> j−1</sup> e<sup>λt</sup>z<sub>j</sub>''',
| |
| <br>for the ''chain'' of ''generalized eigenvectors'',
| |
| <br>
| |
| '''{z<sub>1</sub>, 1!(1 ⁄ 1!) z<sub>2</sub>, 2!(1 ⁄ 2!) z<sub>3</sub>, ...,
| |
| (j−2)!(1 ⁄ (j−2)!) x<sub>j−1</sub>, (j−1)!(1 ⁄ (j−1)!) z<sub>j</sub>}'''.
| |
| </p>
| |
| | |
| What is left to show is that when all the ''solutions'' constructed from the
| |
| ''chain sets'',<br>as described, are considered, they form a ''fundamental set''
| |
| of ''solutions''.<br> To do this it has to be shown that there are '''n''' of them
| |
| and that they are<br>''linearly independent''.
| |
| | |
| <p style="line-height:175%;">
| |
| Reiterating, for a given ''eigenvalue'' '''λ<sub>i</sub>''' there is a ''collection'' of
| |
| '''s''', with '''s''' depending on '''i''',<br>''disjoint chains'' of
| |
| ''generalized eigenvectors''<br>
| |
| '''
| |
| ''C<sub>i,1</sub>'' = {<sup>1</sup>z<sub>1</sub>, <sup>1</sup>z<sub>2</sub>, ...,<sup>1</sup>z<sub>j1(i)</sub>},
| |
| ''C<sub>i,2</sub>'' = {<sup>2</sup>z<sub>1</sub>, <sup>2</sup>z<sub>2</sub>, ...,<sup>2</sup>z<sub>j2(i)</sub>},
| |
| <br>...,
| |
| ''C<sub>i,js(i)</sub>'' = {<sup>s(i)</sup>z<sub>1</sub>, <sup>s(i)</sup>z<sub>2</sub>, ...,<sup>s(i)</sup>z<sub>js(i)</sub>},
| |
| '''
| |
| <br>that when ''combined'' form a ''basis set'' for '''N<sub>λ<sub>i</sub>, k<sub>i</sub></sub>'''.
| |
| The total number of ''vectors''<br>in this set will be
| |
| '''j1(i) + j2(i) + ... + js(i) = k<sub>i</sub>'''.
| |
| </p>
| |
| | |
| <p style="line-height:150%;">
| |
| Thus the total number of all such ''basis vectors'' and so ''solutions'' is
| |
| <br>'''k<sub>1</sub> + k<sub>2</sub> + ... + k<sub>r</sub> = n'''.
| |
| </p>
| |
| | |
| <p style="line-height:175%;">
| |
| Each solution is one of the forms
| |
| '''y(t) = e<sup>λt</sup>x<sub>1</sub>''',
| |
| '''y(t) = e<sup>λt</sup>x<sub>1</sub> + t e<sup>λt</sup>x<sub>2</sub>''',<br>
| |
| '''y(t) = e<sup>λt</sup>x<sub>1</sub> + t e<sup>λt</sup>x<sub>2</sub>
| |
| + t<sup>2</sup> e<sup>λt</sup>x<sub>3</sub>''',
| |
| '''y(t) = e<sup>λt</sup>x<sub>1</sub> + t e<sup>λt</sup>x<sub>2</sub>
| |
| + t<sup>2</sup> e<sup>λt</sup>x<sub>3</sub> + ...'''.<br>
| |
| Now each ''basis vector'' '''v<sub>j</sub>''', for '''j = 1, 2, ..., n''';
| |
| of the ''combined'' set of<br>''generalized eigenvectors'', occurs as
| |
| '''x<sub>1</sub>''' in one of the expressions immediately<br>
| |
| above '''''precisely once'''''. That is, for each '''j''', there is one
| |
| '''y<sub>j</sub>(t) = e<sup>λt</sup>v<sub>j</sub> + ...'''<br>
| |
| Since '''y<sub>j</sub>(0) = e<sup>λ0</sup>v<sub>j</sub> = v<sub>j</sub>''',
| |
| the set of ''solutions'' are ''linearly independent'' at '''t = 0'''.<br>
| |
| </p>
| |
| | |
| ====Revisiting the powers of a matrix====
| |
| | |
| As a notational convenience {{nowrap| '''A<sub>λ, 0</sub> {{=}} I'''}}.
| |
| | |
| Note that
| |
| '''A = '''
| |
| '''λ I'''
| |
| + '''A<sub>λ, 1</sub> '''.
| |
| and apply the ''binomial theorem''.
| |
| | |
| :<math>A^s=(\lambda I+A_{\lambda,1})^s=\sum_{r=0}^s\binom{s}{r}\lambda^{s-r}A_{\lambda,r}</math>
| |
| | |
| Assume '''λ''' is an ''eigenvalue'' of {{nowrap| '''A''', }} and let
| |
| '''{ x<sub>1</sub>, x<sub>2</sub>, {{nowrap|..., x<sub>m</sub> }'''}}
| |
| <br>be a '''''chain''''' of ''generalized eigenvectors'' such that
| |
| '''x<sub>1</sub> ∈ N<sub>λ, m</sub> \ N<sub>λ, m-1</sub> ,'''
| |
| <br> '''x<sub>i+1</sub> = A<sub>λ, 1</sub> x<sub>i</sub> ,'''.
| |
| '''x<sub>m</sub> ≠ 0 ,''' and
| |
| '''A<sub>λ, 1</sub> x<sub>m</sub> = 0'''.
| |
| | |
| Then {{nowrap|'''x<sub>r+1</sub> {{=}} A<sub>λ, r</sub> x<sub>1</sub>'''}},
| |
| for '''r = 0, 1, ..., m-1'''.
| |
| | |
| :<math>A^s x_1=\sum_{r=0}^s\binom{s}{r}\lambda^{s-r}A_{\lambda,r}x_1=\sum_{r=0}^s\binom{s}{r}\lambda^{s-r}x_{r+1}</math>
| |
| | |
| So for '''s ≤ m − 1'''
| |
| | |
| :<math>A^s x_1=\sum_{r=0}^s\binom{s}{r}\lambda^{s-r}x_{r+1}</math>
| |
| | |
| and for '''s ≥ m − 1''', since
| |
| '''A<sub>λ, m</sub> x<sub>1</sub> = 0''',
| |
| | |
| :<math>A^s x_1=\sum_{r=0}^{m-1}\binom{s}{r}\lambda^{s-r}x_{r+1}</math>
| |
| | |
| ===Ordinary linear difference equations===
| |
| | |
| <p style="line-height:150%;">
| |
| Ordinary ''linear difference equations'' are equations of the sort:<br>
| |
| '''
| |
| y<sub>n</sub> = a y<sub>n−1</sub> + b<br>
| |
| '''
| |
| '''
| |
| y<sub>n</sub> = a y<sub>n−1</sub> + b y<sub>n−2</sub> {{nowrap| + c}}<br>
| |
| '''
| |
| or more generally,<br>
| |
| '''
| |
| y<sub>n</sub> = a<sub>m</sub>y<sub>n−1</sub> + a<sub>m−1</sub>y<sub>n−2</sub> + ... +
| |
| a<sub>2</sub>y<sub>n−m + 1</sub> + a<sub>1</sub>y<sub>n−m</sub> + a<sub>0</sub><br>
| |
| '''
| |
| with initial conditions<br>
| |
| '''
| |
| y<sub>0</sub>, y<sub>1</sub>, y<sub>2</sub>, ..., y<sub>m−2</sub>, y<sub>m−1</sub>.
| |
| '''
| |
| </p>
| |
| | |
| A case with {{nowrap| '''a<sub>1</sub> {{=}} 0''' }} can be excluded, since
| |
| it represents an equation of less degree.
| |
| | |
| <p style="line-height:175%;">
| |
| They have a characteristic polynomial<br>
| |
| '''
| |
| p(x) =
| |
| x<sup>m</sup> − a<sub>m</sub>x<sup>m−1</sup> − a<sub>m−1</sub>x<sup>m−2</sup> − ... −
| |
| a<sub>2</sub>x − a<sub>1</sub>.<br>
| |
| '''
| |
| To solve a ''difference equation'' it is first observed, if
| |
| '''y<sub>n</sub>''' and {{nowrap| '''z<sub>n</sub>''' }} are both solutions,<br>then
| |
| '''(y<sub>n</sub> − z<sub>n</sub>)''' is a solution of the ''homogeneous'' equation:<br>
| |
| '''
| |
| y<sub>n</sub> = a<sub>m</sub>y<sub>n−1</sub> + a<sub>m−1</sub>y<sub>n−2</sub> + ... +
| |
| a<sub>2</sub>y<sub>n−m + 1</sub> + a<sub>1</sub>y<sub>n−m</sub>.
| |
| '''
| |
| </p>
| |
| <p style="line-height:150%;">
| |
| So a ''particular'' solution to the ''difference equation'' must be
| |
| found together with<br>all solutions of the ''homogeneous'' equation to get the
| |
| ''general solution'' for the<br>''difference equation''.
| |
| Another observation to make is that, if {{nowrap|''' y<sub>n</sub>''' }} is a solution to<br>
| |
| the ''inhomogeneous'' equation, then<br>
| |
| '''z<sub>n</sub> = y<sub>n+1</sub> − y<sub>n</sub>'''<br>
| |
| is also a solution to the ''homogeneous'' equation.<br>
| |
| So all solutions of the ''homogeneous'' equation will be found first.
| |
| </p>
| |
| <p style="line-height:175%;">
| |
| When {{nowrap| '''β''' }} is a root of {{nowrap| '''p(x) {{=}} 0''', then}} it is easily seen<br>
| |
| '''y<sub>n</sub> = β<sup>n</sup>'''
| |
| is a solution to the ''homogeneous'' equation since<br>
| |
| '''
| |
| y<sub>n</sub> − a<sub>m</sub>y<sub>n−1</sub> − a<sub>m−1</sub>y<sub>n−2</sub> − ... −
| |
| a<sub>2</sub>y<sub>n−m + 1</sub> − a<sub>1</sub>y<sub>n−m</sub>,<br>
| |
| '''
| |
| becomes upon the substitution {{nowrap| '''y<sub>n</sub> {{=}} β<sup>n</sup>''',}}<br>
| |
| '''
| |
| β<sup>n</sup> − a<sub>m</sub>β<sup>n−1</sup> − a<sub>m−1</sub>β<sup>n−2</sup> − ... −
| |
| a<sub>2</sub>β<sup>n−m + 1</sup> − a<sub>1</sub>β<sup>n−m</sup><br>
| |
| '''
| |
| '''
| |
| = β<sup>n−m</sup>(β<sup>m</sup> − a<sub>m</sub>β<sup>m−1</sup> − a<sub>m−1</sub>β<sup>m−2</sup> − ... −
| |
| a<sub>2</sub>β − a<sub>1</sub>)<br>
| |
| '''
| |
| '''
| |
| = β<sup>n−m</sup>p(β) = 0.
| |
| </p>
| |
| <p style="line-height:175%;">
| |
| '''
| |
| When {{nowrap| '''β''' }} is a repeated root of {{nowrap| '''p(x) {{=}} 0''', then}}<br>
| |
| '''y<sub>n</sub> = nβ<sup>n−1</sup>'''
| |
| is a solution to the ''homogeneous'' equation since<br>
| |
| '''
| |
| nβ<sup>n−1</sup> − a<sub>m</sub>(n−1)β<sup>n−2</sup> − a<sub>m−1</sub>(n−2)β<sup>n−3</sup> − ... −
| |
| a<sub>2</sub>(n−m + 1)β<sup>n−m</sup> − a<sub>1</sub>(n−m)β<sup>n−m − 1</sup><br>
| |
| '''
| |
| '''
| |
| = (n−m)β<sup>n−m − 1</sup>(β<sup>m</sup> − a<sub>m</sub>β<sup>m−1</sup> − a<sub>m−1</sub>β<sup>m−2</sup> − ... −
| |
| a<sub>2</sub>β − a<sub>1</sub>)<br>
| |
| '''
| |
| '''
| |
| + β<sup>n−m − 1</sup>(mβ<sup>m−1</sup> − (m−1)a<sub>m</sub>β<sup>m−2</sup> − (m−2)a<sub>m−1</sub>β<sup>m−3</sup> − ... −
| |
| 2a<sub>3</sub>β − a<sub>2</sub>)<br>
| |
| '''
| |
| '''
| |
| = (n−m)β<sup>n−m − 1</sup>p(β) + β<sup>n−m − 1</sup>p′(β) == 0.
| |
| '''
| |
| </p>
| |
| <p style="line-height:175%;">
| |
| After reaching this point in the calculation the ''mystery'' is solved. Just notice when<br>
| |
| '''β''' is a root of {{nowrap| '''p(x) {{=}} 0'''}} with ''multiplicity'' '''k''',
| |
| then for {{nowrap| '''s {{=}} 1, 2, ..., k−1'''}}<br>
| |
| '''
| |
| d<sup>s</sup>(β<sup>n−m</sup>p(β))/dβ<sup>s</sup> = 0.<br>
| |
| '''
| |
| Referring this back to the original equation<br>
| |
| '''
| |
| β<sup>n</sup> − a<sub>m</sub>β<sup>n−1</sup> − a<sub>m−1</sub>β<sup>n−2</sup> − ... −
| |
| a<sub>2</sub>β<sup>n−m + 1</sup> − a<sub>1</sub>β<sup>n−m</sup><br>
| |
| '''
| |
| it is seen that<br>
| |
| '''
| |
| y<sub>n</sub> = d<sup>s</sup>(β<sup>n</sup>)/dβ<sup>s</sup><br>
| |
| '''
| |
| are solutions to the ''homogeneous'' equation. For example, if '''β''' is a root
| |
| of<br>''multiplicity'' '''3''', then '''y<sub>n</sub> = n(n−1)β<sup>n−2</sup>'''
| |
| is a solution. In any case this gives
| |
| '''m'''<br>''linearly independent'' solutions to the ''homogeneous'' equation.
| |
| </p>
| |
| <p style="line-height:175%;">
| |
| To look for a ''particular solution'' first consider the simpliest equation.<br>
| |
| '''
| |
| y<sub>n</sub> = a y<sub>n−1</sub> + b.<br>
| |
| '''
| |
| It has a ''particular'' solution {{nowrap| '''y<sub>p,n</sub>''' }} given by<br>
| |
| '''
| |
| y<sub>p,0</sub> = 0, y<sub>p,1</sub> = b, y<sub>p,2</sub> = (1 + a)b, ...,
| |
| y<sub>p,n</sub> = (1 + a + a<sup>2</sup> + ... + a<sup>n−1</sup>)b, ..., .<br>
| |
| '''
| |
| It's ''homogeneous'' equation {{nowrap| '''y<sub>n</sub> {{=}} a y<sub>n−1</sub>''' }} has
| |
| solutions {{nowrap| '''y<sub>n</sub> {{=}} a<sup>n</sup>y<sub>0</sub>'''.}}<br>
| |
| So {{nowrap| '''z<sub>n</sub> {{=}} y<sub>n+1</sub> − y<sub>n</sub> {{=}} a<sup>n</sup>b'''}}<br>
| |
| can be ''telescoped'' to get<br>
| |
| '''
| |
| y<sub>n</sub> = (y<sub>n</sub> − y<sub>n−1</sub>) + (y<sub>n−1</sub> − y<sub>n−2</sub>)
| |
| + ... + (y<sub>2</sub> − y<sub>1</sub>) + (y<sub>1</sub> − y<sub>0</sub>) + y<sub>0</sub>
| |
| '''
| |
| <br>
| |
| '''
| |
| = z<sub>n−1</sub> + z<sub>n−2</sub> + ... + z<sub>1</sub> + z<sub>0</sub> + y<sub>0</sub><br>
| |
| = (1 + a + a<sup>2</sup> + ... + a<sup>n−1</sup>)b
| |
| ''',
| |
| <br>
| |
| the ''particular'' solution with {{nowrap|'''y<sub>0</sub> {{=}} 0'''}}.
| |
| </p>
| |
| <p style="line-height:150%;">
| |
| Now, returning to the general problem, the equation<br>
| |
| '''
| |
| y<sub>n</sub> = a<sub>m</sub>y<sub>n−1</sub> + a<sub>m−1</sub>y<sub>n−2</sub> + ... +
| |
| a<sub>2</sub>y<sub>n−m + 1</sub> + a<sub>1</sub>y<sub>n−m</sub> + a<sub>0</sub>.<br>
| |
| '''
| |
| When '''y<sub>p,n</sub>''' is a ''particular'' solution with
| |
| '''y<sub>p,0</sub> = 0''', then<br>
| |
| '''z<sub>n</sub> = y<sub>p,n+1</sub> − y<sub>p,n</sub> '''<br>
| |
| is a solution to the ''homogeneous'' equation with
| |
| '''z<sub>0</sub> = y<sub>p,1</sub> '''.<br>
| |
| So {{nowrap| '''z<sub>n</sub> {{=}} y<sub>p,n+1</sub> − y<sub>p,n</sub> '''}}<br>
| |
| can be ''telescoped'' to get<br>
| |
| '''
| |
| y<sub>p,n</sub> = (y<sub>p,n</sub> − y<sub>p,n−1</sub>) + (y<sub>p,n−1</sub> − y<sub>p,n−2</sub>)
| |
| + ... + (y<sub>p,2</sub> − y<sub>p,1</sub>) + (y<sub>p,1</sub> − y<sub>p,0</sub>) + y<sub>p,0</sub>
| |
| '''
| |
| <br>
| |
| '''
| |
| = z<sub>n−1</sub> + z<sub>n−2</sub> + ... + z<sub>1</sub> + z<sub>0</sub> <br>
| |
| '''
| |
| Considering<br>
| |
| '''
| |
| y<sub>p,m</sub> = a<sub>m</sub>y<sub>p,m−1</sub> + a<sub>m−1</sub>y<sub>p,m−2</sub> + ... +
| |
| a<sub>2</sub>y<sub>p,1</sub> + a<sub>1</sub>y<sub>p,0</sub> + a<sub>0</sub>.<br>
| |
| '''
| |
| and rewriting the equation in the '''z<sub>i</sub>'''<br>
| |
| '''
| |
| z<sub>m−1</sub> + z<sub>m−2</sub> + ... + z<sub>1</sub> + z<sub>0</sub> <br>
| |
| '''
| |
| '''
| |
| = (a<sub>m</sub>) (
| |
| z<sub>m−2</sub> + z<sub>m−3</sub> + ... + z<sub>1</sub> + z<sub>0</sub>)
| |
| '''
| |
| '''
| |
| + (a<sub>m−1</sub>) (
| |
| z<sub>m−3</sub> + z<sub>m−4</sub> + ... + z<sub>1</sub> + z<sub>0</sub>) <br>
| |
| '''
| |
| '''
| |
| + (a<sub>m−2</sub>) (
| |
| z<sub>m−4</sub> + z<sub>m−5</sub> + ... + z<sub>1</sub> + z<sub>0</sub>) <br>
| |
| '''
| |
| '''
| |
| + · · ·<br>
| |
| '''
| |
| '''
| |
| + (a<sub>3</sub>) ( z<sub>1</sub> + z<sub>0</sub>)
| |
| '''
| |
| '''
| |
| + (a<sub>2</sub>) ( z<sub>0</sub>)
| |
| '''
| |
| '''
| |
| + (a<sub>0</sub>)<br>
| |
| '''
| |
| and<br>
| |
| '''
| |
| z<sub>m−1</sub><br>
| |
| '''
| |
| '''
| |
| = (a<sub>m</sub> − 1) z<sub>m−2</sub>
| |
| '''
| |
| '''
| |
| + (a<sub>m</sub> + a<sub>m−1</sub> − 1) z<sub>m−3</sub>
| |
| '''
| |
| '''
| |
| + (a<sub>m</sub> + a<sub>m−1</sub> +
| |
| a<sub>m−2</sub> − 1) z<sub>m−4</sub><br>
| |
| '''
| |
| '''
| |
| + · · ·<br>
| |
| '''
| |
| '''
| |
| + (a<sub>m</sub> + a<sub>m−1</sub> + ... +
| |
| a<sub>4</sub> + a<sub>3</sub> − 1) z<sub>1</sub>
| |
| '''
| |
| '''
| |
| + (a<sub>m</sub> + a<sub>m−1</sub> + ... +
| |
| a<sub>3</sub> + a<sub>2</sub> − 1) z<sub>0</sub><br>
| |
| '''
| |
| '''
| |
| + (a<sub>0</sub>).<br>
| |
| '''
| |
| </p>
| |
| <p style="line-height:150%;">
| |
| Since a solution of the ''homogeneous'' equation can be found for any ''initial conditions''<br>
| |
| '''
| |
| z<sub>0</sub>, z<sub>1</sub>, z<sub>2</sub>, ..., z<sub>m−2</sub>, z<sub>m−1</sub>.
| |
| '''
| |
| <br>reasoning ''conversely'' find such '''z<sub>i</sub>''' satisfying the
| |
| equation,<br> just before and define '''y<sub>p,n</sub>''' by the relation<br>
| |
| '''
| |
| y<sub>p,0</sub> = 0, y<sub>p,n</sub>
| |
| = z<sub>n−1</sub> + z<sub>n−2</sub> + ... + z<sub>1</sub> + z<sub>0</sub> <br>
| |
| '''
| |
| </p>
| |
| | |
| <p style="line-height:150%;">
| |
| One choice is, for example,
| |
| '''
| |
| z<sub>m−1</sub> = a<sub>0</sub>,
| |
| z<sub>0</sub> = z<sub>1</sub> = z<sub>2</sub> = ... = z<sub>m−2</sub> = 0.
| |
| '''
| |
| <br>This solution solves the problem for all ''initial values'' equal to ''zero''.
| |
| </p>
| |
| | |
| <p style="line-height:150%;">
| |
| The ''general solution'' to the ''inhomogeneous'' equation is given by<br>
| |
| '''y<sub>n</sub> = y<sub>p,n</sub> + γ<sub>1</sub> w(1)<sub>n</sub>
| |
| + γ<sub>2</sub> w(2)<sub>n</sub> + ... + γ<sub>m−1</sub> w(m−1)<sub>n</sub>
| |
| + γ<sub>m</sub> w(m)<sub>n</sub>'''<br>
| |
| where<br>
| |
| '''w(1)<sub>n</sub>, w(2)<sub>n</sub>, ..., w(m−1)<sub>n</sub>, w(m)<sub>n</sub>'''<br>
| |
| are a ''basis'' for the ''homogeneous'' equation, and<br>
| |
| '''γ<sub>1</sub>, γ<sub>2</sub>, ..., γ<sub>m−1</sub>, γ<sub>m</sub>'''<br>
| |
| are ''scalars''.
| |
| </p>
| |
| | |
| '''example'''
| |
| | |
| <p style="line-height:150%;">
| |
| '''
| |
| y<sub>n</sub> = 8 y<sub>n−1</sub> − 25 y<sub>n−2</sub> +
| |
| 38 y<sub>n−3</sub> − 28 y<sub>n−4</sub>
| |
| + 8 y<sub>n−5</sub> + 1<br>
| |
| '''
| |
| with initial conditions<br>
| |
| '''
| |
| y<sub>0</sub> = 0, y<sub>1</sub> = 0, y<sub>2</sub> = 0,
| |
| y<sub>3</sub> = 0, '''and''' y<sub>4</sub> = 0.
| |
| '''
| |
| </p>
| |
| | |
| <p style="line-height:175%;">
| |
| The ''characteristic polynomial'' for the equation is<br>
| |
| '''
| |
| p(x) =
| |
| x<sup>5</sup> − 8x<sup>4</sup> + 25x<sup>3</sup> −
| |
| 38x<sup>2</sup> + 28x − 8
| |
| = (x − 1)<sup>2</sup>(x − 2)<sup>3</sup>.<br>
| |
| '''
| |
| </p>
| |
| | |
| <p style="line-height:175%;">
| |
| The ''homogeneous'' equation has ''independent'' solutions<br>
| |
| '''
| |
| w1<sub>n</sub> = 1<sup>n</sup> = 1,
| |
| w2<sub>n</sub> = n·1<sup>n−1</sup> = n,
| |
| and<br>
| |
| w3<sub>n</sub> = 2<sup>n</sup>,
| |
| w4<sub>n</sub> = n·2<sup>n−1</sup>,
| |
| w5<sub>n</sub> = n(n−1)·2<sup>n−2</sup>.<br>
| |
| '''
| |
| The solution to the ''homogeneous'' equation<br>
| |
| '''
| |
| z<sub>n</sub> = −3 w1<sub>n</sub> − w2<sub>n</sub> +
| |
| 3 w3<sub>n</sub> − 2 w4<sub>n</sub> + ½ w5<sub>n</sub>
| |
| '''
| |
| <br>satisfies the ''initial conditions''<br>
| |
| '''
| |
| z<sub>4</sub> = 1,
| |
| z<sub>0</sub> = z<sub>1</sub> = z<sub>2</sub> = z<sub>3</sub> = 0.
| |
| '''
| |
| <br>A ''particular solution'' can be found by<br>
| |
| '''y<sub>p,0</sub> = 0, y<sub>p,n</sub> '''
| |
| '''
| |
| = z<sub>n−1</sub> + z<sub>n−2</sub> + ... + z<sub>1</sub> + z<sub>0</sub> .<br>
| |
| '''
| |
| </p>
| |
| | |
| <p style="line-height:175%;">
| |
| Calculating sums:<br>
| |
| '''
| |
| ∑w1 = w1<sub>n−1</sub> + w1<sub>n−2</sub> + ... + w1<sub>1</sub> + w1<sub>0</sub>
| |
| = n .<br>
| |
| '''
| |
| '''
| |
| ∑w2 = w2<sub>n−1</sub> + w2<sub>n−2</sub> + ... + w2<sub>1</sub> + w2<sub>0</sub>
| |
| = (n−1)n / 2 .<br>
| |
| '''
| |
| '''
| |
| ∑w3 = w3<sub>n−1</sub> + w3<sub>n−2</sub> + ... + w3<sub>1</sub> + w3<sub>0</sub>
| |
| = 2<sup>n</sup> − 1 .<br>
| |
| '''
| |
| Sums of these kinds are found by differentiating {{nowrap|'''(x<sup>n</sup> − 1) / (x − 1)'''}}.<br>
| |
| '''
| |
| ∑w4 = w4<sub>n−1</sub> + w4<sub>n−2</sub> + ... + w4<sub>1</sub> + w4<sub>0</sub>
| |
| = (n−2)2<sup>n−1</sup> + 1 .<br>
| |
| '''
| |
| '''
| |
| ∑w5 = w5<sub>n−1</sub> + w5<sub>n−2</sub> + ... + w5<sub>1</sub> + w5<sub>0</sub>
| |
| = (n<sup>2</sup> − 5n + 8)2<sup>n−2</sup> − 2 .<br>
| |
| '''
| |
| </p>
| |
| | |
| <p style="line-height:175%;">
| |
| Now,<br>
| |
| '''
| |
| y<sub>p,n</sub> = −3 ∑w1<sub>n</sub> − ∑w2<sub>n</sub> +
| |
| 3 ∑w3<sub>n</sub> − 2 ∑w4<sub>n</sub> + ½ ∑w5<sub>n</sub>
| |
| '''
| |
| <br>solves the ''initial value problem'' of this example.
| |
| </p>
| |
| | |
| <p style="line-height:175%;">
| |
| At this point it is worthwhile to notice that all the ''terms'' that are combinations of
| |
| <br>''scalar multiples'' of ''basis elements'' can be removed. These are any multiples of<br>
| |
| '''
| |
| 1, n, 2<sup>n</sup>, n·2<sup>n−1</sup>,'''
| |
| and '''n<sup>2</sup>·2<sup>n−2</sup>.
| |
| '''
| |
| <br>So instead the ''particular'' solution next, may be preferred.<br>
| |
| '''
| |
| y<sub>p,n</sub> = −½ n<sup>2</sup> .<br>
| |
| '''
| |
| This solution has ''non zero'' initial values, which must be taken into account.<br>
| |
| '''
| |
| y<sub>0</sub> = 0, y<sub>1</sub> = −1 ⁄ 2, y<sub>2</sub> = −2,
| |
| y<sub>3</sub> = −9 ⁄ 2, '''and''' y<sub>4</sub> = −8.
| |
| '''
| |
| </p>
| |
| | |
| == References ==
| |
| *{{Cite book
| |
| | last = Axler
| |
| | first = Sheldon
| |
| | title = Linear Algebra Done Right
| |
| | publisher = Springer
| |
| | year = 1997
| |
| | edition = 2nd
| |
| | isbn = 978-0-387-98258-8}}
| |
| | |
| {{DEFAULTSORT:Generalized Eigenvector}}
| |
| [[Category:Linear algebra]]
| |