Matrix function

One very useful application of the Cayley-Hamilton theorem is in finding explicit formulas for matrix functions. While this may seem very arbitrary, matrix functions are extremely helpful in solving systems of differential equations.

Consider an n x n square matrix A which has characteristic equation: (-1)ⁿλⁿ+c_n-1λ^n-1 + ... + c₁λ + c₀ = 0. By the Cayley-Hamilton theorem, the matrix A satisfies it's own characteristic equation so:

(-1)ⁿAⁿ+c_n-1A^n-1 + ... + c₁A + c₀I = 0.

Which means that Aⁿ can be expressed as an (n-1)th degree polynomial function of A and similarly, λⁿ can be expressed as the same (n-1)th degree polynomial as a function of λ instead of A. So we can find an explicit formula for Aⁿ by solving the system of equations which arise for various eigenvalues. In the case of k repeated eigenvalues, the equation for λⁿ can be differentiated k times to ensure a total of n linearly independent equations so the system may be solved.

We can extend this theory to obtain explicit formulas for other matrix functions like e^A, sinA, logA, or even A^k.

Consider a function f(x) with Taylor series ∑a_kx^k which is convergent for all x. By the Cayley-Hamilton theorem, f(λ)=∑a_kλ^k and f(A) = ∑a_kA^k. Here the summation goes from 0 to infinity.

Now given that f(A) = ∑a_nAⁿ, let's define a function q(A) such that ∑a_kA^k = q(A)*{(-1)ⁿAⁿ+c_n-1A^n-1 + ... + c₁A + c₀I}. Where summation on the left goes from n to infinity. Notice that the summation on the right is actually the characteristic equation of the matrix A which is identically zero. Hence we can derive the formula for the function of a matrix f(A) = s_n-1A^n-1 + ... + s₁A + s₀I similarly f(λ) = s_n-1λ^n-1 + ... + s₁λ + s₀. For the n x n matrix an explicit formula for the function f(A) can be found by the solving the system of equations for f(λ).

Having such formulas help reduce solving systems of differential equations to something similar to solving one. For example, the system dX/dt = AX can be solved by a number of methods but by comparing it to a linear homogenous first order differential equation, we can immediately notice the solution is the matrix exponential X = e^At.

As with matrices, matrix functions may or may not be commutative. In other words, e^Ae^B may not equal e^A+B or e^Be^A. Note also that for functions that are not convergent for all x, all |λ|s must be within the radius of convergence.

In sum, given a matrix and its eigenvalues, it is possible to derive a formula for function of that matrix. The process is very simple and involves solving one system of linear equations. For clarity, let us use a 3 x 3 matrix, A, as an example with eigenvalues λ₁, λ₂, λ₃. Now to find the expression for a function of A, we must first ensure that the eigenvalues are within the radius of convergence of that function. Meaning that the absolute value of the greatest eigenvalue must be less than the radius of convergence of the function's taylor series, otherwise the system may not have solutions. Also, for obvious reasons, f(λ_n) must be defined. The functions can be derived by determining the coefficients of the system:

f(λ₁) = a₂λ₁² + a₁λ₁ + a₀

f(λ₂) = a₂λ₂² + a₁λ₂ + a₀

f(λ₃) = a₂λ₃² + a₁λ₃ + a₀

In the case of repeated eigenvalues, we can differentiate the equation to obtain new, distinct equation. Supposing λ₁=λ₂ our system would be:

f(λ₁) = a₂λ₁² + a₁λ₁ + a₀

(d/dt)f(λ₁) = 2a₂λ₁ + a₁

f(λ₃) = a₂λ₃² + a₁λ₃ + a₀

And in the case of three repeated eigenvalues:

f(λ₁) = a₂λ₁² + a₁λ₁ + a₀

(d/dt)f(λ₁) = 2a₂λ₁ + a₁

(d²/dt²)f(λ₁) = 2a₂

Solving for a₂, a₁, and a₀ we obtain the expression:

f(A) = a₂A² + a₁A = a₀I

Source: Zill and Cullen, Advanced Engineering Mathematics 3rd ed. Jones and Bartlett

Recommended Reading

About Everything2

User Picks

Editor Picks

New Writeups

Login
Password

Matrix function

Sign In

Recommended Reading

About Everything2

User Picks

Editor Picks

New Writeups