General Relativity: Part I: Mathematical Background
General Relativity: Part I: Mathematical Background
General Relativity
The paper presents non-quantum physics with emphasis on the general theory of relativity. The entire
theory but not many applications or approximation techniques are included. Required mathematics is
rigorously developed.
Part I: Mathematical Background
Section 1: Special Mathematical Notation
To keep formulas simple, special notation is used. When a statement has free indexes
(superscripts and subscripts that are variable and not assigned a particular value), the statement is
assumed true for all values of the indexes. For example, “xi=cit where i can be 1, 2, or 3” means
“x1=c1t, x2=c2t, and x3=c3t”. Because of the extensive need for indexes, superscripts of variables are
typically interpreted as indexes instead of powers. Instead, use parenthesis to express power. For
example, x2≠x*x=(x)2. Sequences are denoted as follows: x1, x2, …, xn is x1, x2, …, xn- 1, xn if n>1, is x1
if n=1, and is nothing when n=0.
Summation convention states that when an index variable appears twice in an expression,
summation over the allowed values of the index is assumed. For example, the chain rule simplifies to
z z yi
. To avoid using the summation convention, state “no summation” or enclose the indexes
x yi x
in parenthesis. The expression must be expanded before applying the summation:
(xi+yi)*(xi+yi)=xixi+2xiyi+yiyi=(x1+y1)2+(x2+y2)2 with i=1, 2. The summation is not assumed when the
indexes (after the expression is expanded) are separated by addition or subtraction: ai+bi is not
summed up over all i. The summation occurs as close to the indexes as reasonable: (xiyi)2 with i=1, 2
is (x1y1+x2y2)2. When manipulating expressions using the summation convention, you can usually
ignore implicit summation sign.
decreased by 1.) Inner product is an outer product followed by contraction(s). Addition, subtraction,
multiplication by a scalar, contraction, outer product, and thus inner product of tensors can be shown to
transform under coordinate change as tensors would by the definition. For example, C=A⊗B⇔
C A B where the bars indicate a new coordinate system. Tensor differentiation and integration, if
i
done component by component, does not transform properly: often, T x T x i . Section 6
describes how to differentiate tensors.
When a quantity with proper components is defined in multiple coordinate systems or in terms
of tensors, the quantity is said to be a tensor if it always obeys the transformation equation. Tensors
are usually preferred over “nontensors” because a tensor needs to be defined in only one coordinate
system and because change of tensor components under coordinate transformations is predictable and
depend only on the coordinates and the nature of the tensor.
Otherwise, if a1a2…an is an even permutation of b1b2…bn, the component is one, odd permutation—the
component is -1. Otherwise (that is if ∃i (1≤i≤n) ∀j (1≤j≤n) aj≠bi), the component is zero. T is
called is symmetric with respect to ix and iy (x≠y) if exchange of values of ix and iy does not affect any
of the components, and anti-symmetric if the exchange multiplies every component by -1. (Both ix
and iy must be either subscripts or superscripts.) For example, δij is a symmetric tensor; b11b22...bnn is an
a a ...a
anti-symmetric tensor with respect to any subscripts or superscripts. Note that if T is anti-symmetric
with respect to ix and iy, then the value of every component where ix=iy is zero. Permutation tensor, e,
is anti-symmetric with respect to any two components, and e12..n=e12..n=1. Weight of eij..k, eij..k, δij, and δij
is not zero.
A tensor of rank (0, 0) is called a scalar and has only one component. A tensor of rank (1, 0) is
called contravariant vector or simply vector. For example, position and velocity are vectors. A
contravariant vector can be displayed as a sequence such as (a1, a2, …, an). The transformation
equation implies that in smaller coordinates vector components are larger. A tensor of rank (0, 1) is
called covariant vector or a 1-form. Covariant vectors can be interpreted as assigning for each
contravariant vector a real number in a linear way through a dot product. The transformation equation
implies that larger coordinates cause larger components.
ij ...k
Let EiEj…EkElEm…En be defined such that (EiEj…EkElEm…En) lm...n =1 and other components are
ij ...k
zero. EiEj…EkElEm…En form a basis since an arbitrary tensor A= Alm...n EiEj…EkElEm…En (The
summation is over all components.) (Visualization of 1-forms: xiEi as a set of yiEi such that xiyi=1.)
A multilinear form is a function that takes n vectors and returns a scalar in a linear way with
respect to each of the vectors. For example, f(cE1, E2) = cf(E1, E2), f(E1, E2+E3) = f(E1, E2)+f(E1, E3).
Multilinear forms are, informally, essentially tensors of rank (0, n). That is T ij…k*uivj…wk=f(u, v, …, w)
for an appropriate absolute tensor T.
ij ...k
Note: Determinant of a square matrix A, det A= e A1i * A2 j *...* Ank .
made of metric tensors. Geometry of any space with inner product—including distance, angles, and
curvature—is entirely determined by the metric. Length squared, ||u||2≡<u, u>. Note that for some
spaces ||u||2 can be zero or even negative for nonzero u. Metric is called positive definitive if in all
cases <u, u> ≤ 0. An associate tensor to tensor T is a tensor to which T can be transformed through
inner product(s) with metric tensor and/or conjugate metric tensor. Example: gabTa=Tb.
Note: Outside of relativity, inner product is defined so that <u, u> ≥ 0.
Theorem: (a) gab is an absolute tensor of rank (0, 2); (b) gab=gba; (c) if metric tensor exists, conjugate
metric tensor also exists; (d) gabuavb = <u, v>. (e) For every symmetric tensor g such that the matrix g ij
is invertible, there exists an inner product such that gabuavb = <u, v>. (f) If metric is positive definitive,
<u, u> = 0⇔u=0.
Section 5: Manifolds
Definition: n-dimensional smooth manifold is a set of points combined with a set of coordinate
systems such that:
1. Every coordinate system (also called smooth chart) consists of a subset R of the manifold and (x1,
x2, …, xn) for each point in R. All xi are real numbers.
2. In every coordinate system, different points have different coordinates.
3. Every coordinate system is open and connected. Open means that for each (x1, x2, …, xn) in the
system there exists r>0 such that every (y1, y2, …, yn) belongs to the coordinate system provided
that (xi-yi)*(xi-yi)<r*r. Connected means that for every pair of coordinates y=(y1, y2, …, yn)
and z=(z1, z2, …, zn), there exists a continuous function f(t)= (x1, x2, …, xn)(t) (where 0≤t≤1,
and each (x1, x2, …, xn) are valid coordinates for the coordinate system) such that f(0)=y and
f(1)=z.
4. Union of all R is the set of points that make up the manifold.
5. For every 2 coordinate systems that share some of the points, the transformation between the
coordinate systems must be smooth (that is infinitely differentiable) and thus have a nonzero
Jacobian.
6. For every 2 points A and B, there exists A1, A2, …, An such that A1=A, An=B, and for every integer i
(0<i<n) there exists a coordinate system such that Ai and Ai+1 belong to the system. In other
words, the manifold is connected.
Theorem: For every point in every pseudo-Riemannian manifold, there exists a coordinate system
(called geodesic coordinate system) for which ∂gij/∂xk (and thus all of the Christoffel symbols) are
zero at the point.
Part II: General Theory of Relativity
Section 1: Geometry of space-time.
Our world is modeled by a 4-dimensional pseudo-Riemannian manifold called space-time. For
every point, there exists a frame of reference, that is a coordinate system, such that in the frame at that
point, the metric, g=η where η≡E1E1+E2E2+E3E3-E4E4. Such frame of reference is called locally
normal. If g=η and ∂gab/∂xc=0, the frame of reference (for each point, such frame can be proven to
exist) is called locally inertial. MCRF is a locally normal frame of reference where the point object is
currently at rest.
To be a complete theory, the mathematical model must be related to human perception. If g= η,
x4=A4E4 refers to a point in time and A iEi = (x1, x2, x3) (i=1, 2, 3) refers to a location in space. Let an
observer be at rest in a locally inertial frame of reference. The observer will locally define distance in
space between (x1, x2, x3) and (y1, y2, y3) as √((x1-y1)2+(x2-y2)2+(x3-y3)2) and interval in time between x4
and y4 as |x4-y4|. Observers disagreeing on who is at rest may disagree about distance in space and
interval in time. An observer at time t tends to consider space at time T>t as the future (and not yet
occurring), space at T=t to be the present real world, and space at T<t as the past (what has already
happened) provided that the direction of time is positive; otherwise, reverse ‘<’ and ‘>’.
If g→η at xi and d→0, then V(R)/(d)3→1, where V(R) is volume in space of all points ri such that
xi<ri<xi+d (i=1, 2, 3). For a property F based on a region of space, its density is dF/dV.
Exercises
1. Show that C=A*B⇔Cik=AijBjk for all matrices A, B, and C.
2. For a tensor of rank (l, m) and weight W in n-dimensional space increase of coordinates in ‘a’ times
increases the tensor components in ‘b’ times. Prove: W = (l-m+logba)/n.
3. How does coordinate change affect
(a) scalars (b) contravariant vectors (c) covariant vectors
(d) tensors of rank (0, 2) (e) tensors of rank (2, 0) (f) tensors of rank (1, 1).
4. Prove that (a) b11b22...bnn =det A where Aij= b ji (b) b11b22...bnn e 1 2 n eb1b2 ...bn .
a a ...a a a a ...a a a ...a