Added to Favorites

Popular Searches

Definitions

Nearby Words

In physics, a covariant transformation is a rule (specified below), that describes how certain physical entities change under a change of coordinate system. In particular the term is used for vectors and tensors. The transformation that describes the new basis vectors in terms of the old basis, is defined as a covariant transformation. Conventionally, indices identifying the basis vectors are placed as lower indices and so are all entities that transform in the same way. The inverse of the covariant transformation is called the contravariant transformation. Vectors transform according to the covariant rule, but the components of a vector transform according to the contravariant rule. Conventionally, indices identifying the components of a vector are placed as upper indices and so are all indices of entities that transform in the same way. The summation over all indices of a product with the same lower and upper indices, are invariant to a transformation.

A vector itself is a geometrical quantity in principle independent (invariant) of the chosen coordinate system.
A vector v is given, say, in components v^{i} on a chosen basis e_{i}, related to a coordinate system x^{i} (the basis vectors are tangent vectors to the coordinate grid).
On another basis, say
$\{mathbf\; e\}\text{'}\_i$, related to a new coordinate system $\{x\text{'};\}^i$, the same vector v has different components
$v\text{'};^i$ and

- $\{mathbf\; v\}\; =\; sum\_i\; v^i\; \{mathbf\; e\}\_i\; =\; sum\_i\; \{v\text{'};\}^i\; \{mathbf\; e\}\text{'}\_i$

If, for example in a 2-dim Euclidean space, the new basis vectors are rotated to the right with respect to the old basis vectors, then it will appear in terms of the new system that the components of the vector look as if the vector was rotated to the left (see figure).

A vector v is described in a given coordinate grid (black lines) on a basis which are the tangent vectors to the (here rectangular) coordinate grid. The components are $v\_x$ and $v\_y$. In another coordinate system (dashed and red), the new basis are tangent vectors in the radial direction and perpendicular to it. They appear rotated clockwise with respect to the first basis. The covariant transformation here is a clockwise rotation. The components in red are indicated as $v\_r$ and $v\_phi$. If we view the new components with $v\_r$ pointed upwards, it appears as if the components are rotated to the left. The contravariant transformation is an anticlockwise rotation.

## Examples of covariant transformation

### The derivative of a function transforms covariantly

### Basis vectors transform covariantly

## Contravariant transformation

### Differential forms transform contravariantly

## Dual properties

## Co- and contravariant tensor components

### Without coordinates

### With coordinates

The explicit form of a covariant transformation is best introduced with the transformation properties of the derivative of a function. Consider a scalar function f (like the temperature in a space) defined on a set of points p, identifiable in a given coordinate system $x^i,;\; i=0,1,...$ (such a collection is called a manifold). If we adopt a new coordinates system $\{x\text{'},\}^j,\; j=0,1,...$ then for each i, the original coordinate $\{x\}^i$ can be expressed as function of the new system, so $\{x\}^i(\{x\text{'},\}^j),\; j=0,1,...$ One can express the derivative of f in new coordinates in terms of the old coordinates, using the chain rule of the derivative, as

- $$

- $$

A vector can be expressed in terms of basis vectors. For a certain coordinate system, we can choose the vectors tangent to the coordinate grid. This basis is called the coordinate basis.

To illustrate the transformation properties, consider again the set of points p, identifiable in a given coordinate system $x^i$ where $i=0,1,...$ (manifold). A scalar function f, that assigns a real number to every point p in this space, is a function of the coordinates $f;(x^0,x^1,...)$. A curve is a one-parameter collection of points c, say with curve parameter λ, c(λ). A tangent vector v to the curve is the derivative $df/dlambda$ along the curve with the derivative taken at the point p under consideration. Note that we can see the tangent vector v as an operator which can be applied to a function

- $\{mathbf\; v\}[f]\; stackrel\{mathrm\{def\}\}\{=\}\; frac\{df\}\{dlambda\}=\; frac\{d;;\}\{dlambda\}\; f(c(lambda))$

- $\{mathbf\; v\}[f]\; =\; sum\_i\; frac\{dx^i\}\{dlambda\}\; frac\{partial\; f\}\{partial\; x^i\}$

- $$

If we adopt a new coordinates system $\{x\text{'},\}^i,\; ;i=0,1,...$ then for each i, the old coordinate $\{x^i\}$ can be expressed as function of the new system, so $x^i(\{x\text{'},\}^j),\; j=0,1,...$ Let $\{mathbf\; e\}\text{'}\_i\; =\; \{partial;\}/\{partial\; \{x\text{'},\}^i\}$ be the basis, tangent vectors in this new coordinates system. We can express $\{mathbf\; e\}\_i$ in the new system by applying the chain rule on x. As a function of coordinates we find the following transformation

- $$

The components of a (tangent) vector transform in a different way, called contravariant transformation. Consider a tangent vector v and call its components $v^i$ on a basis $\{mathbf\; e\}\_i$. On another basis $\{mathbf\; e\}\text{'},\_i$ we call the components $\{v\text{'},\}^i$, so

- $$

- $v^i\; =\; frac\{dx^i\}\{dlambda;\}\; ;mbox\{\; and\; \};$

- $$

An example of a contravariant transformation is given by a differential form df. For f as a function of coordinates $x^i$, df can be expressed in terms of $dx^i$. The differentials dx transform according to the contravariant rule since

- $d\{x\text{'},\}^i\; =\; sum\_j\; frac\{partial\; \{x\text{'},\}^i\}\{partial\; \{x\}^j;;\}\; \{dx\}^j$

Entities that transform covariantly (like basis vectors) and the ones that transform contravariantly (like components of a vector and differential forms) are "almost the same" and yet they are different. They have "dual" properties. What is behind this, is mathematically known as the dual space that always goes together with a given linear vector space.

Take any linear vector space T and let $\{mathbf\; e\}\_i$ be a basis for this space. Consider a linear real function defined in this linear space. If v and w are two vectors in this vector space, then a real function f (with vectors as argument) is called a linear function if both (for any v, w and scalar α)

- $f(\{mathbf\; v\}+\{mathbf\; w\})\; =\; f(\{mathbf\; v\})\; +\; f(\{mathbf\; w\})$

- $f(alpha\; \{mathbf\; v\})\; =\; alpha\; f(\{mathbf\; v\})$

A simple example is the function which assigns the value of one of its components (the so called projection function). It has a vector as argument and assigns a real number, the value of a component.

All such linear functions together form a linear space by themselves. It is called the dual space of T. One can easily see that, indeed, the sum f+g is again a linear function for linear f and g by applying f+g to a sum v + w. And that the same holds for scalar multiplication αf.

We can define a basis, called the dual basis in this space in a natural way by taking the set of linear functions mentioned above: the projection functions. So those functions ω that produce the number 1 when they are applied to one of the basis vector $\{mathbf\; e\}\_i$. For example $\{omega\}^0$ gives a 1 on $\{mathbf\; e\}\_0$ and zero elsewhere. Applying this linear function $\{omega\}^0$ to a vector $\{mathbf\; v\}\; =v^i\; \{mathbf\; e\}\_i$, gives (using its linearity)

- $$

There are as many dual basis vectors $omega^i$ as there are basis vectors $\{mathbf\; e\}\_i$, so the dual space has the same dimension as the linear space itself. It is "almost the same space",except that the elements of the dual space (called dual vectors) transform contravariant and the elements of the tangent vector space transform covariant.

Sometimes an extra notation is introduced where the real value of a linear function σ on a tangent vector u is given as

- $sigma\; [\{mathbf\; u\}]\; :=\; langle\; sigma,\; \{mathbf\; u\}rangle$

With the aid of the section of dual space, a tensor of rank $(^r\_s)$ is simply defined as a real-valued multilinear function of r dual vectors and s vectors in a point p. So a tensor is defined in a point. It is a linear machine: feed it with vectors and dual vectors and it produces a real number. Since vectors (and dual vectors) are defined coordinate independently, this definition of a tensor is also free of coordinates and does not depend on the choice of a coordinate system. This is the main importance of tensors in physics.

The notation of a tensor is

- $T(sigma,\; ldots\; ,rho,\; \{mathbf\; u\},\; ldots,\; \{mathbf\; v\})$

Because a tensor depends linearly on its arguments, it is completely determined if one knows the values on a basis $omega^i\; ldots\; omega^j$ and $\{mathbf\; e\}\_k\; ldots\; \{mathbf\; e\}\_l$

- $T(omega^i,ldots,omega^j,\; \{mathbf\; e\}\_k\; ldots\; \{mathbf\; e\}\_l)\; =$

If we choose another basis (which are a linear combination of the original basis), we can use the linear properties of the tensor and we will find that the tensor components in the upper indices transform as dual vectors (so contravariant), whereas the lower indices will transform as the basis of tangent vectors and are thus covariant. For a tensor of rank 2, we can easily verify that

- $$

- $$

For a mixed co- and contravariant tensor of rank 2

- $$

Wikipedia, the free encyclopedia © 2001-2006 Wikipedia contributors (Disclaimer)

This article is licensed under the GNU Free Documentation License.

Last updated on Wednesday September 03, 2008 at 14:56:58 PDT (GMT -0700)

View this article at Wikipedia.org - Edit this article at Wikipedia.org - Donate to the Wikimedia Foundation

This article is licensed under the GNU Free Documentation License.

Last updated on Wednesday September 03, 2008 at 14:56:58 PDT (GMT -0700)

View this article at Wikipedia.org - Edit this article at Wikipedia.org - Donate to the Wikimedia Foundation

Copyright © 2015 Dictionary.com, LLC. All rights reserved.