data:image/s3,"s3://crabby-images/f4d76/f4d76f3cdd5431ee0d2314822603c90dca87fa91" alt="Deep Learning Essentials"
Matrix properties
In this section, we will look at some of the important properties matrices which are very useful for deep learning applications.
- Norm: Norm is an important property of a vector or a matrix that measures the size of the vector or the matrix. Geometrically it can also be interpreted as the distance of a point, x, from an origin. A Lp norm is therefore defined as follows:
data:image/s3,"s3://crabby-images/b798e/b798e20501302b48bb3af3761c4d274bcf2a5fdf" alt=""
Though a norm can be computed for various orders of p, most popularly known norms are L1 and L2 norm. L1 norm is usually considered a good choice for sparse models:
data:image/s3,"s3://crabby-images/98a63/98a6342d4dbf4f182046fb28da769f7ddda7263e" alt=""
Another norm popular in the deep learning community is the max norm, also referred to as L∞. This is simply equivalent to the value of the largest element in the vector:
data:image/s3,"s3://crabby-images/a68db/a68dbcdf8b2334aed52c3c13894a2440c9d0f191" alt=""
So far, all the previously mentioned norms are applicable to vectors. When we want to compute the size of a matrix, we use Frobenius norm, defined as follows:
data:image/s3,"s3://crabby-images/3b6b2/3b6b2a4a78f7b3396ecd72a6f74a288709151cf7" alt=""
Norms are usually used as they can be used to compute the dot product of two vectors directly:
data:image/s3,"s3://crabby-images/7331a/7331a3cb700635dfbc7a37c2db7b6494421ea754" alt=""
- Trace: Trace is an operator that is defined as the sum of all the diagonal elements of a matrix:
data:image/s3,"s3://crabby-images/89edb/89edb9ea635cef1db80c609fe4a1181ddb880d01" alt=""
Trace operators are quite useful in computing the Frobenius norm of the matrix, as follows:
data:image/s3,"s3://crabby-images/8055b/8055b36fd3374036a5eaa614274c6a2e16b23ee3" alt=""
Another interesting property of trace operator is that it is invariant to matrix transpose operations. Hence, it is often used to manipulate matrix expressions to yield meaningful identities:
data:image/s3,"s3://crabby-images/df234/df234f6feaf3e33c7525b1bdb05763c8c7069fa2" alt=""
- Determinant: A determinant of a matrix is defined as a scalar value which is simply a product of all the eigenvalues of a matrix. They are generally very useful in the analysis and solution of systems of linear equations. For instance, according to Cramer's rule, a system of linear equations has a unique solution, if and only if, the determinant of the matrix composed of the system of linear equations is non-zero.