From Wikipedia, the free encyclopedia  View original article

In geometry, curvilinear coordinates are a coordinate system for Euclidean space in which the coordinate lines may be curved. These coordinates may be derived from a set of Cartesian coordinates by using a transformation that is locally invertible (a onetoone map) at each point. This means that one can convert a point given in a Cartesian coordinate system to its curvilinear coordinates and back. The name curvilinear coordinates, coined by the French mathematician Lamé, derives from the fact that the coordinate surfaces of the curvilinear systems are curved.
Wellknown examples of curvilinear coordinate systems in threedimensional Euclidean space (R^{3}) are Cartesian, cylindrical and spherical polar coordinates. A Cartesian coordinate surface in this space is a plane; for example z = 0 defines the xy plane. In the same space, the coordinate surface r = 1 in spherical polar coordinates is the surface of a unit sphere, which is curved. The formalism of curvilinear coordinates provides a unified and general description of the standard coordinate systems.
Curvilinear coordinates are often used to define the location or distribution of physical quantities which may be, for example, scalars, vectors, or tensors. Mathematical expressions involving these quantities in vector calculus and tensor analysis (such as the gradient, divergence, curl, and Laplacian) can be transformed from one coordinate system to another, according to transformation rules for scalars, vectors, and tensors. Such expressions then become valid for any curvilinear coordinate system.
Depending on the application, a curvilinear coordinate system may be simpler to use than the Cartesian coordinate system. For instance, a physical problem with spherical symmetry defined in R^{3} (for example, motion of particles under the influence of central forces) is usually easier to solve in spherical polar coordinates than in Cartesian coordinates. Equations with boundary conditions that follow coordinate surfaces for a particular curvilinear coordinate system may be easier to solve in that system. One would for instance describe the motion of a particle in a rectangular box in Cartesian coordinates, whereas one would prefer spherical coordinates for a particle in a sphere. Spherical coordinates are one of the most used curvilinear coordinate systems in such fields as Earth sciences, cartography, and physics (in particular quantum mechanics, relativity), and engineering.
For now, consider 3d space. A point P in 3d space can be defined using Cartesian coordinates (x, y, z) [equivalently written (x_{1}, x_{2}, x_{3})], or in another system (q_{1}, q_{2}, q_{3}), as shown in Fig. 1. The latter is a curvilinear coordinate system, and (q_{1}, q_{2}, q_{3}) are the curvilinear coordinates of the point P.
The surfaces q_{1} = constant, q_{2} = constant, q_{3} = constant are called the coordinate surfaces; and the space curves formed by their intersection in pairs are called the coordinate curves. The coordinate axes are determined by the tangents to the coordinate curves at the intersection of three surfaces. They are not in general fixed directions in space, which happens to be the case for simple Cartesian coordinates.
A basis whose vectors change their direction and/or magnitude from point to point is called local basis. All bases associated with curvilinear coordinates are necessarily local. Basis vectors that are the same at all points are global bases, and can be associated only with linear or affine coordinate systems.
Note: usually all basis vectors are denoted by e, for this article e is for the standard basis (Cartesian) and b is for the curvilinear basis.
The relation between the coordinates is given by the invertible transformations:
Any point can be written as a position vector r in Cartesian coordinates:
where x, y, z are the coordinates of the position vector with respect to the standard basis vectors e_{x}, e_{y}, e_{z}.
However, in a general curvilinear system, there may well not be any natural global basis vectors. Instead, we note that in the Cartesian system, we have the property that
We can apply the same idea to the curvilinear system to determine a system of basis vectors at P. We define
These may not have unit length, and may also not be orthogonal. In the case that they are orthogonal at all points where the derivatives are welldefined, we define the Lamé coefficients (after Gabriel Lamé) by
and the curvilinear orthonormal basis vectors by
It is important to note that these basis vectors may well depend upon the position of P; it is therefore necessary that they are not assumed to be constant over a region. (They technically form a basis for the tangent bundle of at P, and so are local to P.)
In general, curvilinear coordinates allow the generality of basis vectors not all mutually perpendicular to each other, and not required to be of unit length: they can be of arbitrary magnitude and direction. The use of an orthogonal basis makes vector manipulations simpler than for nonorthogonal. However, some areas of physics and engineering, particularly fluid mechanics and continuum mechanics, require nonorthogonal bases to describe deformations and fluid transport to account for complicated directional dependences of physical quantities. A discussion of the general case appears later on this page.
Since the total differential change in r is
so scale factors are
They can also be written for each component of r:
However, this designation is very rarely used, largely replaced with the components of the metric tensor g_{ik} (see below).
The basis vectors, gradients, and scale factors are all interrelated within a coordinate system by two methods:
So depending on the method by which they are built, for a general curvilinear coordinate system there are two sets of basis vectors for every point: {b_{1}, b_{2}, b_{3}} is the covariant basis, and {b^{1}, b^{2}, b^{3}} is the contravariant basis.
A vector v can be given in terms either basis, i.e.,
The basis vectors relate to the components by^{[2]}^{(pp30–32)}
and
where g is the metric tensor (see below).
A vector is covariant or contravariant if, respectively, its components are covariant (lowered indices, written v_{k}) or contravariant (raised indices, written v^{k}). From the above vector sums, it can be seen that contravariant vectors are represented with covariant basis vectors, and covariant vectors are represented with contravariant basis vectors.
A key convention in the representation of vectors and tensors in terms of indexed components and basis vectors is invariance in the sense that vector components which transform in a covariant manner (or contravariant manner) are paired with basis vectors that transform in a contravariant manner (or covariant manner).
Consider the onedimensional curve shown in Fig. 3. At point P, taken as an origin, x is one of the Cartesian coordinates, and q^{1} is one of the curvilinear coordinates (Fig. 3). The local (nonunit) basis vector is b_{1} (notated h_{1} above, with b reserved for unit vectors) and it is built on the q^{1} axis which is a tangent to that coordinate line at the point P. The axis q^{1} and thus the vector b_{1} form an angle α with the Cartesian x axis and the Cartesian basis vector e_{1}.
It can be seen from triangle PAB that
where e_{1}, b_{1} are the magnitudes of the two basis vectors, i.e., the scalar intercepts PB and PA. Note that PA is also the projection of b_{1} on the x axis.
However, this method for basis vector transformations using directional cosines is inapplicable to curvilinear coordinates for the following reasons:
The angles that the q^{1} line and that axis form with the x axis become closer in value the closer one moves towards point P and become exactly equal at P.
Let point E be located very close to P, so close that the distance PE is infinitesimally small. Then PE measured on the q^{1} axis almost coincides with PE measured on the q^{1} line. At the same time, the ratio PD/PE (PD being the projection of PE on the x axis) becomes almost exactly equal to cos α.
Let the infinitesimally small intercepts PD and PE be labelled, respectively, as dx and dq^{1}. Then
Thus, the directional cosines can be substituted in transformations with the more exact ratios between infinitesimally small coordinate intercepts. It follows that the component (projection) of b_{1} on the x axis is
If q^{i} = q^{i}(x_{1}, x_{2}, x_{3}) and x_{i} = x_{i}(q^{1}, q^{2}, q^{3}) are smooth (continuously differentiable) functions the transformation ratios can be written as and . That is, those ratios are partial derivatives of coordinates belonging to one system with respect to coordinates belonging to the other system.
Doing the same for the coordinates in the other 2 dimensions, b_{1} can be expressed as:
Similar equations hold for b_{2} and b_{3} so that the standard basis {e_{1}, e_{2}, e_{3}} is transformed to a local (ordered and normalised) basis {b_{1}, b_{2}, b_{3}} by the following system of equations:
By analogous reasoning, one can obtain the inverse transformation from local basis to standard basis:
The above systems of linear equations can be written in matrix form as
This coefficient matrix of the linear system is the Jacobian matrix (and its inverse) of the transformation. These are the equations that can be used to transform a Cartesian basis into a curvilinear basis, and vice versa.
In three dimensions, the expanded forms of these matrices are
In the inverse transformation (second equation system), the unknowns are the curvilinear basis vectors. For all points there can only exist one and only one set of basis vectors (else vectors are not well defined at those points). This condition is satisfied if and only if the equation system has a single solution, from linear algebra, a linear equation system has a single solution (nontrivial) only if the determinant of its system matrix is nonzero:
which shows the rationale behind the above requirement concerning the inverse Jacobian determinant.
The formalism extends to any finite dimension as follows.
Consider the real Euclidean ndimensional space, that is R^{n} = R × R × ... × R (n times) where R is the set of real numbers and × denotes the Cartesian product, which is a vector space.
The coordinates of this space can be denoted by: x = (x_{1}, x_{2},...,x_{n}). Since this is a vector (an element of the vector space), it can be written as:
where e^{1} = (1,0,0...,0), e^{2} = (0,1,0...,0), e^{3} = (0,0,1...,0),...,e^{n} = (0,0,0...,1) is the standard basis set of vectors for the space R^{n}, and i = 1, 2,...n is an index labelling components. Each vector has exactly one component in each dimension (or "axis") and they are mutually orthogonal (perpendicular) and normalized (has unit magnitude).
More generally, we can define basis vectors b_{i} so that they depend on q = (q_{1}, q_{2},...,q_{n}), i.e. they change from point to point: b_{i} = b_{i}(q). In which case to define the same point x in terms of this alternative basis: the coordinates with respect to this basis v_{i} also necessarily depend on x also, that is v_{i} = v_{i}(x). Then a vector v in this space, with respect to these alternative coordinates and basis vectors, can be expanded as a linear combination in this basis (which simply means to multiply each basis vector e_{i} by a number v_{i} – scalar multiplication):
The vector sum that describes v in the new basis is composed of different vectors, although the sum itself remains the same.
From a more general and abstract perspective, a curvilinear coordinate system is simply a coordinate patch on the differentiable manifold E^{n} (ndimensional Euclidean space) that is diffeomorphic to the Cartesian coordinate patch on the manifold.^{[3]} Note that two diffeomorphic coordinate patches on a differential manifold need not overlap differentiably. With this simple definition of a curvilinear coordinate system, all the results that follow below are simply applications of standard theorems in differential topology.
The transformation functions are such that there's a onetoone relationship between points in the "old" and "new" coordinates, that is, those functions are bijections, and fulfil the following requirements within their domains:
is not zero; meaning the transformation is invertible: x_{i}(q).
according to the inverse function theorem. The condition that the Jacobian determinant is not zero reflects the fact that three surfaces from different families intersect in one and only one point and thus determine the position of this point in a unique way.^{[4]}Elementary vector and tensor algebra in curvilinear coordinates is used in some of the older scientific literature in mechanics and physics and can be indispensable to understanding work from the early and mid1900s, for example the text by Green and Zerna.^{[5]} Some useful relations in the algebra of vectors and secondorder tensors in curvilinear coordinates are given in this section. The notation and contents are primarily from Ogden,^{[6]} Naghdi,^{[7]} Simmonds,^{[2]} Green and Zerna,^{[5]} Basar and Weichert,^{[8]} and Ciarlet.^{[9]}
A secondorder tensor can be expressed as
where denotes the tensor product. The components S^{ij} are called the contravariant components, S^{i} _{j} the mixed rightcovariant components, S_{i} ^{j} the mixed leftcovariant components, and S_{ij} the covariant components of the secondorder tensor. The components of the secondorder tensor are related by
At each point, one can construct a small line element dx, so the square of the length of the line element is the scalar product dx • dx and is called the metric of the space, given by:
and the symmetric quantity
is called the fundamental (or metric) tensor of the Euclidean space in curvilinear coordinates.
Indices can be raised and lowered by the metric:
Defining the scale factors h_{ij} by
gives a relation between the metric tensor and the Lamé coefficients. Note also that
where h_{ij} are the Lamé coefficients. For an orthogonal basis we also have:
If we consider polar coordinates for R^{2}, note that
(r, θ) are the curvilinear coordinates, and the Jacobian determinant of the transformation (r,θ) → (r cos θ, r sin θ) is r.
The orthogonal basis vectors are b_{r} = (cos θ, sin θ), b_{θ} = (−r sin θ, r cos θ). The normalized basis vectors are e_{r} = (cos θ, sin θ), e_{θ} = (−sin θ, cos θ) and the scale factors are h_{r} = 1 and h_{θ}= r. The fundamental tensor is g_{11} =1, g_{22} =r^{2}, g_{12} = g_{21} =0.
In an orthonormal righthanded basis, the thirdorder alternating tensor is defined as
In a general curvilinear basis the same tensor may be expressed as
It can also be shown that
where the comma denotes a partial derivative (see Ricci calculus). To express Γ_{ijk} in terms of g_{ij} we note that
Since
using these to rearrange the above relations gives
This implies that
Other relations that follow are
The scalar product of two vectors in curvilinear coordinates is^{[2]}^{(p32)}
The cross product of two vectors is given by^{[2]}^{(pp32–34)}
where is the permutation symbol and is a Cartesian basis vector. In curvilinear coordinates, the equivalent expression is
Adjustments need to be made in the calculation of line, surface and volume integrals. For simplicity, the following restricts to three dimensions and orthogonal curvilinear coordinates. However, the same arguments apply for ndimensional spaces. When the coordinate system is not orthogonal, there are some additional terms in the expressions.
Simmonds,^{[2]} in his book on tensor analysis, quotes Albert Einstein saying^{[10]}
The magic of this theory will hardly fail to impose itself on anybody who has truly understood it; it represents a genuine triumph of the method of absolute differential calculus, founded by Gauss, Riemann, Ricci, and LeviCivita.
Vector and tensor calculus in general curvilinear coordinates is used in tensor analysis on fourdimensional curvilinear manifolds in general relativity,^{[11]} in the mechanics of curved shells,^{[9]} in examining the invariance properties of Maxwell's equations which has been of interest in metamaterials^{[12]}^{[13]} and in many other fields.
Some useful relations in the calculus of vectors and secondorder tensors in curvilinear coordinates are given in this section. The notation and contents are primarily from Ogden,^{[14]} Simmonds,^{[2]} Green and Zerna,^{[5]} Basar and Weichert,^{[8]} and Ciarlet.^{[9]}
Let φ = φ(x) be a well defined scalar field and v = v(x) a welldefined vector field, and λ_{1}, λ_{2}... be parameters of the coordinates
is a tangent vector to C in curvilinear coordinates (using the chain rule). Using the definition of the Lamé coefficients, and that for the metric g_{ij} = 0 when i ≠ j, the magnitude is:
where is the permutation symbol. In determinant form:
Operator  Scalar field  Vector field 

Line integral  
Surface integral  
Volume integral 
The expressions for the gradient, divergence, and Laplacian can be directly extended to ndimensions, however the curl is only defined in 3d.
The vector field b_{i} is tangent to the q^{i} coordinate curve and forms a natural basis at each point on the curve. This basis, as discussed at the beginning of this article, is also called the covariant curvilinear basis. We can also define a reciprocal basis, or contravariant curvilinear basis, b^{i}. All the algebraic relations between the basis vectors, as discussed in the section on tensor algebra, apply for the natural basis and its reciprocal at each point x.
Operator  Scalar field  Vector field  2nd order tensor field 

Gradient  
Divergence  N/A  where a is an arbitrary constant vector. In curvilinear coordinates,  
Laplacian  
Curl  N/A  For vector fields in 3d only, where is the LeviCivita symbol.  N/A 
An inertial coordinate system is defined as a system of space and time coordinates x_{1}, x_{2}, x_{3}, t in terms of which the equations of motion of a particle free of external forces are simply d^{2}x_{j}/dt^{2} = 0.^{[15]} In this context, a coordinate system can fail to be “inertial” either due to nonstraight time axis or nonstraight space axes (or both). In other words, the basis vectors of the coordinates may vary in time at fixed positions, or they may vary with position at fixed times, or both. When equations of motion are expressed in terms of any noninertial coordinate system (in this sense), extra terms appear, called Christoffel symbols. Strictly speaking, these terms represent components of the absolute acceleration (in classical mechanics), but we may also choose to continue to regard d^{2}x_{j}/dt^{2} as the acceleration (as if the coordinates were inertial) and treat the extra terms as if they were forces, in which case they are called fictitious forces.^{[16]} The component of any such fictitious force normal to the path of the particle and in the plane of the path’s curvature is then called centrifugal force.^{[17]}
This more general context makes clear the correspondence between the concepts of centrifugal force in rotating coordinate systems and in stationary curvilinear coordinate systems. (Both of these concepts appear frequently in the literature.^{[18]}^{[19]}^{[20]}) For a simple example, consider a particle of mass m moving in a circle of radius r with angular speed w relative to a system of polar coordinates rotating with angular speed W. The radial equation of motion is mr” = F_{r} + mr(w + W)^{2}. Thus the centrifugal force is mr times the square of the absolute rotational speed A = w + W of the particle. If we choose a coordinate system rotating at the speed of the particle, then W = A and w = 0, in which case the centrifugal force is mrA^{2}, whereas if we choose a stationary coordinate system we have W = 0 and w = A, in which case the centrifugal force is again mrA^{2}. The reason for this equality of results is that in both cases the basis vectors at the particle’s location are changing in time in exactly the same way. Hence these are really just two different ways of describing exactly the same thing, one description being in terms of rotating coordinates and the other being in terms of stationary curvilinear coordinates, both of which are noninertial according to the more abstract meaning of that term.
When describing general motion, the actual forces acting on a particle are often referred to the instantaneous osculating circle tangent to the path of motion, and this circle in the general case is not centered at a fixed location, and so the decomposition into centrifugal and Coriolis components is constantly changing. This is true regardless of whether the motion is described in terms of stationary or rotating coordinates.
