Assumptions of SR
We begin with Einstein's introduction to his theory of Special Relativity (SR):
"There is hardly a simpler law in physics than that according to which light is propagated in empty space. Every child at school knows, or believes he knows, that this propagation takes place in straight lines with velocity $c=300,000 ~km/sec$. Who would imagine that this simple law has plunged the conscientiously thoughtful physicist into the greatest intellectual difficulties?"
The elaboration of these intellectual difficulties arising from this seemingly "simple law" is the subject of our discussion today.
What are these difficulties? Namely to reconcile the following axiomatic assumptions of SR:
(A1) that the laws of physics are the same in all nonaccelerated reference frames, i.e. if $K'$ is a coordinate system moving uniformly (and devoid of rotation) with respect to a coordinate system $K$, then natural phenomena run their course with respect to $K'$ according to exactly the same laws as with respect to $K$.
(A2) that light in vacuum propagates along straight lines with constant velocity $c\approx 300,000 ~km/sec$.
Apparent Difficulties in (A12)
The difficulties arise from the conjunction (A1) and (A2), which we denote by (A12). $$(A12):=(A1) \wedge (A2).$$
If Assumption (A2) is indeed a Law of nature, then according to (A1) this law of propagation should be invariant in every inertial reference frame. But (A12) appears contradictory to the principles of classical mechanics, e.g. Fizeau law of addition of velocities.
Einstein alleges that these difficulties are only apparent incompatibilities, all of which are reconciled by postulating Lorentz-Fitzgerald formulae for length contractions and time dilations.
The assertion that the Lorentz formulae reconcile these difficulties contains two claims:
(i) that inertial frames $K, K'$ are related by Lorentz transformations.
(ii) that the law of propagation of light (A2) is Lorentz invariant. I.e. if $(A2)$ is satisfied in $K$, then $(A2)$ is satisfied in every Lorentz translate $K'=\lambda.K$.
We critically examine these claims below. In what follows we will address both wave and particle models of light. Ultimately we argue that (A12) is incompatible with both models, i.e. incompatible with Bohr's complementarity principle.
Lorentz Transformations
The linear algebra of Minkowski space and Lorentz transformations plays a definitive role in SR.
Lorentz transformations in the setting of SR can be defined as the group of linear transformations $\lambda: {\bf{R}}^{4} \to {\bf{R}}^4$ which satisfy $\lambda^*(h)=h$ where $h=ds^2$ is the Minkowski-Lorentz quadratic form $$h:=ds^2=-c^2dt^2+dx^2+dy^2+dz^2.$$ Here $c$ is the constant luminal velocity in vacuum posited by (A2).
The Lorentz invariance of $h$ says $$\xi^2+\eta^2+\zeta^2-c^2 \tau^2$$ is numerically equal to $$x^2+y^2+z^2-c^2 t^2$$ for every Lorentz transform $\lambda$ satisfying $$(\xi, \eta, \zeta, \tau)=\lambda.(x,y,z,t).$$ Thus it is assumed that $h$ is a scalar invariant for all inertial observers.
The group of Lorentz transformations was hypothesized as an attempt to explain the observed null result of Michelson-Morley's interferometer experiment. The experiment was intended to measure variations of the speed of light relative to the aether. No such variations were discovered, and it was postulated that the usual space and time coordinates $(x,y,z,t)$ and $(\xi, \eta, \zeta, \tau)$ of two inertial observers $K$ and $K'$, respectively, were not related by Galilean transformations, but related by the Lorentz-Fitzgerald formulae. Incredibly and contrary to all expectations, the material arm of the interferometer contracted in the direction of motion and simultaneously the time parameter was contracted by the same ratio, namely the so-called beta factor $\beta = 1/\sqrt{1-v^2/c^2}$.
Uniqueness of Lorentz Invariant Quadratic Forms
It's interesting result of Elton and Arminjon [insert ref] that Minkowski's form $h$ is the only Lorentz invariant quadratic form on ${\bf{R}}^4$ modulo homothety. This has useful consequence for the homogeneous wave equation, as we discuss below.
Minkowski Space and Null Cone
Now we say something about the photon model of light as treated in Levi-Civita's book [III.XI.6, pp.301]. If light satisfies (A2), then in a reference frame $K$, light is something $$\gamma(t)=(x(t),y(t),z(t))$$ that travels through space with time, and whose velocity $\gamma'$ if it could be materially measured as a function of $t$ would satisfy
- [Eq1] $$||\gamma'||^2=(dx/dt)^2+(dy/dt)^2+(dz/dt)^2=c^2.$$
Thus it is argued that light trajectories are constrained to the null cone $$N:=\{h=0\} \subset {\bf{R}}^4$$ of Minkowski's metric $h$. Obviously the null cone $N$ is Lorentz invariant subspace, and defined by the equation $$x^2+y^2+z^2=c^2 t^2$$ in any reference frame $K$ with coordinates $(x,y,z,t)$.
What does [Eq1] say about the Propagation of Light?
Again if light is something $\gamma$ that travels, then [Eq1] implies the speed of $\gamma$ is identically equal to $c$ as measured in $K$.
But the speed of $\gamma$ is independant of the direction of motion of $\gamma$ constrained to $N$.
Einstein's assumption (A2) attempts to prescribe the velocity by assuming the propagation is along straight lines with constant speed. But again the velocity is still underdetermined, since the direction of the particle is not determined by it's belonging to a straight line.
We are proposing that the direction and magnitude are not determined uniquely by (A2). Moreover the Lorentz transformations do not satisfy Claim (ii), i.e. (A2) is not Lorentz invariant.
The problem is that the schoolboy's image of the propagation of light in vacuum is a Riemannian perspective. But this Riemannian perspective is not Lorentz invariant. Indeed in Riemannian geometry, if a particle is travelling in a straight line and with constant velocity, then the propagation of the particle, namely it's position as a function of time, \emph{is} uniquely determined. However, in Lorentzian geometry, a particle which is travelling in a straight line along the null cone will always have a constant speed, regardless of its trajectory. Supposing that the trajectory is confined to a straight line, there still remains the question of the position of the particle as a function of time. The problem is that the \emph{uniformity} of the straight line propagation is meaningless on the null cone of Lorentzian geometry.
Levi-Civita's Variational Equation is Trivial on the Null Cone
Here we introduce Levi-Civita's approach as represented in his excellent textbook. Levi-Civita modifies (A2) somewhat by asserting that "the propagation of light is rectilinear, uniform, and with velocity $c$".
The term "uniform" does not feature in Einstein's formulation of (A2), although it speaks to the hidden assumption that the light rays have a canonical parameter describing the \emph{uniform} motion of the light ray. The above remarks are directly related to Levi-Civita's characterization of geometric optics in the following two equations (see [III.XI.16]):
[Eq2] $$\delta \int ds=0 $$ and
[Eq3] $$ds^2=0.$$
The first equation [Eq2] says the variational derivative of the functional $\gamma \mapsto \int_\gamma ds$ vanishes on the light trajectories, and the second equation [Eq3] says the trajectory is constrained to the null cone.
In the Riemannian setting where $ds$ is positive definite, the equation [Eq2] is essentially equivalent to the geodesic equation $\nabla_{\gamma'} \gamma'=0$.
However in the Lorentzian setting we find [Eq2]reduces to $0=0$ on the null cone $N$. Thus the usual Riemannian $ds>0$ argument does \emph{not} establish the corresponding "geodesic" equation on $N$. This is acknowledged in Levi-Civita [III.XI.14], but Levi-Civita argues that zero length geodesics are limits of Riemannian geodesics ($ds>0$) and that "there is a process of passing to the limit (in conditions of complete analytical regularity) from ordinary geodesics". Levi-Civita maintains that the variational equation [Eq2] somehow "implies" the geodesic-type equation $\nabla_{\gamma'} \gamma'=0$ for light rays.
Our viewpoint is that $\nabla_{\gamma'} \gamma'=0$ is an independant hypothesis, and by no means a formal consequence of [Eq2]. This is related to our contention that contrary to Levi-Civita's claims, the variational equation [Eq2] on $N=\{ds=0\}$ is trivial. In Riemannian geometric terms, we find straight lines on $N$ have no canonical parameterizations, even affine. This reveals a clear distinction between Riemannian straight lines which do have a canonical arclength parameter $ds$, and the null lines $\ell \subset N$ which do not admit canonical $ds$ arclength parameter except the trivial $ds=0$.