贡献者: JierPeter; addis
It is known that a complex numbers can be defined as odered pairs of real numbers, or the field extension of $\mathbb{R}$ by adding the new element $ \mathrm{i} $.
Complex numbers are broadly applied, because they have almost perfect algebraic and analytic properties, making many difficult calculations simple or even possible. John Derbyshire addressed in his book Unknown Quantity: Real and Imaginary History of Algebra that "If passing from the real numbers, which are merely onedimensional, to the complex numbers, which are two-dimensional, gives us such a huge increase in power and insight, why stop there? Might there not be other kinds of numbers waiting to be discovered-hyper-complex numbers, so to speak—whose natural representation is three-dimensional? And might those numbers not bring with them a vast new increase in our mathematical understanding?"1
The Irish mathematician from the early 19th century, William Hamilton, was in search of three-dimensional numbers with a well-defined multiplication, hopefully corresponding to the three dimensional coordinates. However, he exerted years of efforts, and still couldn't find a multiplication on the ordered triple of real numbers that satisfies associativity and distributivity. The fact is, there is no such three dimensional algebra, but Hamilton didn't know at that time.
The day was October 16, 1843, when Hamilton was walking to the Council of the Royal Irish Academy with his wife, "a spark flashed forth the herald of many long years to come of definitely directed thought and work", he realized what he should have been chasing after. He couldn't "resist the impulse—unphilosophical as it may have been—to cut with a knife on a stone of Brougham Bridge" the fundamental formula with the symbols $ \mathrm{i} , \mathrm{J}, \mathrm{K}$:
Today, we have a much deeper understanding in quaternions. In this section, we will use a modern version of definition to discuss the properties of quaternions, which naturally leads us to probably the most commen application of quaternions: representing the rotations in $\mathbb{R}^3$. The power of quaternions is not just confined to the rotations; in fact, it is also useful in describing physics models, including electrodynamics, special relativity theory and classical mechanics. We may delve into that topic in the future.
The modern version of the definition takes each quaternion as a combinition of one real number and one three-dimensional real vector: $(a_0, \boldsymbol{\mathbf{a}} )$. In such a context, $ \mathrm{i} $, $\mathrm{j}$, $\mathrm{k}$ are interpreted as vectors that forms an orthonormal basis. To avoid potential confusions, we rephrase the three base vectors as $ \hat{\boldsymbol{\mathbf{i}}} , \hat{\boldsymbol{\mathbf{j}}} $ and $ \hat{\boldsymbol{\mathbf{k}}} $, and define the operation of quaternions as follows:
Note that though the addition of quaternions apparently commutes, the multiplication does not, because generally the cross product of vectors doesn't commute. Only when $ \boldsymbol{\mathbf{a}} $ is parallel to $ \boldsymbol{\mathbf{b}} $ (including the case where one of them is $ \boldsymbol{\mathbf{0}} $) can we say $ \boldsymbol{\mathbf{a}} \times \boldsymbol{\mathbf{b}} = \boldsymbol{\mathbf{0}} = \boldsymbol{\mathbf{b}} \times \boldsymbol{\mathbf{a}} $. Nonetheless, it is safe to say that the multiplication satisfies the distributivity.
For a quaternion $A=(a_0, \boldsymbol{\mathbf{a}} )$, the real number $a_0$ is called the real part , and the vector $ \boldsymbol{\mathbf{a}} $ is called the vector part. For convenience, sometimes we also write such a quaternion as $a_0+ \boldsymbol{\mathbf{a}} $, comparable to the representation of a complex number $(a, b)=a+b \mathrm{i} $. When the real part is $0$, we cal also just write the quaternion as $ \boldsymbol{\mathbf{a}} $; similarly, when the vector part is $ \boldsymbol{\mathbf{0}} $, we can write the quaternion as $a_0$. In that notation, we can interpret $a_0+ \boldsymbol{\mathbf{a}} $ as the addition of two quaternions. Similar to the case for complex numbers, we take the quaternions whose realparts are $0$ as real numbers.
If the direction for the vector parts is stricted along a long passing the origin, then the set of quaternions is actually the set of all complex numbers. You may see why by comparing the multiplications of restricted quaternions and complex numbers--note that in this restriction, the quaternions commute.
Instead of interpreting the quaternions as combinitions of a real number and a three-dimensional vector, we can also think of them as four-dimensional vectors. It is the same as the case when a complex number can be either the combinition of a real nunber and a one-dimensional vector,or a two-dimensional vector. In fact, the set of quaternions form a four-dimensional algebra3.
Similar to complex numbers, we can define the conjugates and modulus of quaternions.
Let $ \mathrm{e} $ is the base of the natural logarithm, then for any real number $x$, we have $ \mathrm{e} ^x = \sum^\infty\limits_{n=0} x^n / n!$. Now substitute $x$ with other objects that we can do multiplications, say matrices or complex numbers, we can extend the concept of exponential to more objects and construct the exponential of complex numbers or matrices.
Since we can multiply quaternions, we can also define the exponential of quaternions:
Since the addition of the quaternions commute and the multiplication generally doesn't , the condition $ \mathrm{e} ^{A+B} = \mathrm{e} ^A \mathrm{e} ^B$ is not always satisfied. Therefore, the exponential representation of quaternions is not as broadly applied as that of the complex numbers.
For a similar topic, please refer to 矩阵指数(matrix exponential). The property of the matrix exponential correspond to that of the quaternions, related by the matrix representation of quaternions.
The quaternions we just defined form a four-dimensional vector space with a base $\{1, \hat{\boldsymbol{\mathbf{i}}} , \hat{\boldsymbol{\mathbf{j}}} , \hat{\boldsymbol{\mathbf{k}}} \}$. If we merely extend the choice of the field from $\mathbb{R}$ to $\mathbb{C}$, then we get the so-called double quaternions or complex quaternions. A double quaternion is also interpreted as the combinition of a complex number and a complex vector.
For a double quaternion $A=a_0+a_1 \hat{\boldsymbol{\mathbf{i}}} +a_2 \hat{\boldsymbol{\mathbf{j}}} +a_3 \hat{\boldsymbol{\mathbf{k}}} $, the concept of conjugate is subdivided into the follow categories:
The modulus of a double quaternion is defined with the quaternion conjugate: $||A||=A\widetilde{A}=\widetilde{A}A=\sum^{3}_{n=0}a^2_n$. Differently from the case of quaternions, the modulus of a double quaternion can take the value of any complex number.
Unlike the "real" quaternions, there are non-zero complex quaternions whoses modulus is $0$. An example can be the $u$ and $v$ in ex. 1 .
Similarly, we can even let the coefficients of a quaternion be real quaternions, and that will be the secondary quaternions. However, since quaternions don't form a field, the properties of secondary quaternions are not as good as the complex coefficients, hence the application is limited and we will go too far into it.
The complex quaternions are used in representing special relativity and classical field theories. They also directly correspond to the Pauli matrices.
If we rephrase the four base vectors of a quaternion $1, \hat{\boldsymbol{\mathbf{i}}} , \hat{\boldsymbol{\mathbf{j}}} $ and $ \hat{\boldsymbol{\mathbf{k}}} $ with the following matrices respectively:
By direct calculations, you can check that the multiplication and addition of quaternions are isomorphic to the multiplication and addition of matrices after we associate the quaternions with the matrices.
In terms of the matrix representations, the transpose of $A$ is the transpose of the corresponding matrix; the complex conjugate is taking the conjugate of the matrix entries; the hermitian conjugate corresponds to the hermitian conjugate of the matrix (transpose and take the conjugate); the modulus square of a quaternion is the determinante of its matrix: $||A||= \operatorname {det}A$. Therefore, the (complex) quaternions whose modulus are not $0$ have inverses, which correspond to the reverse matrices.
One significant application of quaternions is the representation of rotations in a three dimensional real space. The application of quaternions largely conserved the amount of calculation needed.
Represent a three-dimensional real vector with a quaternion $v= \begin{pmatrix}0, \boldsymbol{\mathbf{v}} \end{pmatrix} $. If we rotate $ \boldsymbol{\mathbf{V}} $ regarding the unit vector $\hat{ \boldsymbol{\mathbf{n}} }$ by the angle $\theta$, what should the result be? Take the quaternion $q= \begin{pmatrix}\cos{\frac{\theta}{2}}, \hat{ \boldsymbol{\mathbf{n}} }\sin{\frac{\theta}{2}}\end{pmatrix} $, then the resultant vector is simply
Let's check it with a trivial example. Let $v=(0, 1, 0, 0)$, which is a unit vector on the $x$ axis. Now we rotate it regarding the a unit vector aloing the positive $z$ direction by $\pi/2$, then the quaternion representation of the resultant vector is$(0, 0, 1, 0)$. Check: the rotation is represented by $q=(\sqrt{2}/2, 0, 0, \sqrt{2}/2)$, and direct calculations give us $qvq^{-1}=(0,0,1,0)$.
Now we will prove the theorem so it works in all cases.
Take the quaternion for rotation $q= \begin{pmatrix}\cos{\frac{\theta}{2}}, \hat{ \boldsymbol{\mathbf{n}} }\sin{\frac{\theta}{2}}\end{pmatrix} $ and an arbitrary vector $v=(0, \boldsymbol{\mathbf{v}} )$. Direct calculations yield:
If we take $ \boldsymbol{\mathbf{v}} \perp\hat{ \boldsymbol{\mathbf{n}} }$, considering that $\hat{ \boldsymbol{\mathbf{n}} }\times(\hat{ \boldsymbol{\mathbf{n}} }\times \boldsymbol{\mathbf{v}} )=- \boldsymbol{\mathbf{v}} $ and $\hat{ \boldsymbol{\mathbf{n}} }\cdot \boldsymbol{\mathbf{v}} =0$, the result of eq. 12 becomes:
The is just the rotation of $ \boldsymbol{\mathbf{v}} $ pivoting on $\hat{ \boldsymbol{\mathbf{n}} }$ by $\theta$.
If we take $ \boldsymbol{\mathbf{v}} \parallel\hat{ \boldsymbol{\mathbf{n}} }$, considering that $\hat{ \boldsymbol{\mathbf{n}} }\times \boldsymbol{\mathbf{v}} =0$ and $(\hat{ \boldsymbol{\mathbf{n}} }\cdot \boldsymbol{\mathbf{v}} ) \boldsymbol{\mathbf{v}} = \boldsymbol{\mathbf{v}} $, then the result of eq. 12 becomes:
This is also the rotation of $ \boldsymbol{\mathbf{v}} $ pivoting on $\hat{ \boldsymbol{\mathbf{n}} }$ by $\theta$.
Summing up, if we take an arbitrary vector $ \boldsymbol{\mathbf{v}} $ and decompose it into two parts: $ \boldsymbol{\mathbf{v}} _\perp$ which is perpendicular to $ \hat{\boldsymbol{\mathbf{n}}} $ and $ \boldsymbol{\mathbf{v}} _\parallel$ parallel to it, then $q v_\perp q^{-1}$ and $q v_\parallel q^{-1}$ are the results of the rotation for these two vectors, hence their combinition, $qvq^{-1}$, is the rotation of $v$ because rotation is a linear transformation.
A comment: since the quaternions become the complex numbers, the case where $ \boldsymbol{\mathbf{v}} \parallel\hat{ \boldsymbol{\mathbf{n}} }$ can simply be deduced from the commutivity of complex multiplications, namely $qvq^{-1}=qq^{-1}v=v$.
1. ^ John Derbyshire, Unknown Quantity: A Real and Imaginary History of Algebra, 2006, Joseph Henry Press, Ch. 8, pp. 148.
2. ^ Derbyshire, pp. 151.
3. ^ In terms of algebra, the problem that Hamilton was dying to solve can be rephrased as "is there a three-dimensional algebra over the field $\mathbb{R}$". Now we know that the "algebra with multiplicity and the existence of multiplication inverses over the field $\mathbb{R}$" only occurs in one, two, four or eight dimensions, where quaternions already lose their commutivity and the octonions (eight dimensions) lose their associativity. Here the "multiplicity" refers to the property that $ \left\lvert a \right\rvert \left\lvert b \right\rvert = \left\lvert ab \right\rvert $.