nLab
differentiation

See also differentiable function.

Context

Analysis

Differential geometry

synthetic differential geometry

Introductions

from point-set topology to differentiable manifolds

geometry of physics: coordinate systems, smooth spaces, manifolds, smooth homotopy types, supergeometry

Differentials

V-manifolds

smooth space

Tangency

The magic algebraic facts

Theorems

Axiomatics

cohesion

tangent cohesion

differential cohesion

graded differential cohesion

id id fermionic bosonic bosonic Rh rheonomic reduced infinitesimal infinitesimal & étale cohesive ʃ discrete discrete continuous * \array{ && id &\dashv& id \\ && \vee && \vee \\ &\stackrel{fermionic}{}& \rightrightarrows &\dashv& \rightsquigarrow & \stackrel{bosonic}{} \\ && \bot && \bot \\ &\stackrel{bosonic}{} & \rightsquigarrow &\dashv& Rh & \stackrel{rheonomic}{} \\ && \vee && \vee \\ &\stackrel{reduced}{} & \Re &\dashv& \Im & \stackrel{infinitesimal}{} \\ && \bot && \bot \\ &\stackrel{infinitesimal}{}& \Im &\dashv& \& & \stackrel{\text{étale}}{} \\ && \vee && \vee \\ &\stackrel{cohesive}{}& ʃ &\dashv& \flat & \stackrel{discrete}{} \\ && \bot && \bot \\ &\stackrel{discrete}{}& \flat &\dashv& \sharp & \stackrel{continuous}{} \\ && \vee && \vee \\ && \emptyset &\dashv& \ast }

Models

Lie theory, ∞-Lie theory

differential equations, variational calculus

Chern-Weil theory, ∞-Chern-Weil theory

Cartan geometry (super, higher)

Contents

Idea

Differentiation is the process that assigns to a function f:XYf : X \to Y its derivative, sometimes denoted dfd f. The derivative is a function that, roughly speaking, assigns to each point xXx \in X the linear transformation df xd f_x that maps infinitesimal differences yxy - x (for points yy infinitesimally close to xx) to infinitesimal differences f(y)f(x)f(y) - f(x).

Definitions

Classical (analytic) definitions

Classically, if f: nf: \mathbb{R}^n \to \mathbb{R} is a function, the derivative df xd f_x at a point x nx \in \mathbb{R}^n is defined by a limiting process: it is the unique (when it exists) linear transformation L: nL: \mathbb{R}^n \to \mathbb{R} such that

lim h0f(x+h)f(x)Lh|h|=0\lim_{h \to 0} \frac{f(x + h) - f(x) - L h}{{|h|}} = 0

or in other words such that

f(y)=f(x)+L(yx)+g(y)f(y) = f(x) + L(y - x) + g(y)

where g(y)g(y) is o(|yx|)o({|y - x|}) (see o-notation?). We say in that case that ff is differentiable at xx, with derivative df x=Ld f_x = L. When n=1n = 1, such a linear transformation L:L: \mathbb{R} \to \mathbb{R} is multiplication by a scalar λ\lambda \in \mathbb{R}. This is often denoted f(x)f'(x), so that the derivative may be regarded as another function f:f': \mathbb{R} \to \mathbb{R}.

We say f: nf: \mathbb{R}^n \to \mathbb{R} is differentiable if it is differentiable at every point xx of its domain. In that case, we get a global derivative function n× n\mathbb{R}^n \times \mathbb{R}^n \to \mathbb{R} which takes a pair (x,h)(x, h) to df x(h)d f_x(h). We can then ask whether this function is itself differentiable, i.e., whether the differentiation process can be iterated. The nn-th derivative is the result of differentiating nn times (the 00-th derivative of ff is just ff). We say that f: nf: \mathbb{R}^n \to \mathbb{R} is infinitely differentiable (or more briefly, that ff is smooth) if it has an nn-th derivative for each natural number nn.

Notice that a differentiable function is necessarily continuous (indeed, it is locally a Lipschitz function). One denotes the class of functions having a continuous1 n thn^{th} derivative by C n( n)C^n(\mathbb{R}^n). Thus continuous functions f: nf: \mathbb{R}^n \to \mathbb{R} form a filtration

C n( k)C 1( k)C 0( k)\ldots \subset C^n(\mathbb{R}^k) \subset \ldots \subset C^1(\mathbb{R}^k) \subset C^0(\mathbb{R}^k)

and the intersection C ( k) n0C n( k)C^\infty(\mathbb{R}^k) \coloneqq \bigcap_{n \geq 0} C^n(\mathbb{R}^k) coincides with the class of smooth functions.

Evidently differentiation gives a function D:C n+1( k)C n( k× k)D: C^{n+1}(\mathbb{R}^k) \to C^n(\mathbb{R}^k \times \mathbb{R}^k). The convenient setting for developing the differential calculus is to go directly to the smooth case, where we have D:C ( k)C ( k× k)D: C^\infty(\mathbb{R}^k) \to C^\infty(\mathbb{R}^k \times \mathbb{R}^k) without having to keep track of the degree of differentiability.

This development carries over more generally to smooth manifolds XX, which are locally (smoothly) isomorphic to Euclidean spaces k\mathbb{R}^k.

Differentiation as a functor

Differentiation in more conceptual terms is a functor d:DiffDiffd : Diff \to Diff on the category Diff of smooth manifolds that sends

  • a manifold XX to its tangent bundle TXT X; recall that the points of TXT X are ordered pairs (x,v)(x, v) where xXx \in X and vv is a tangent vector at xx, i.e., an (augmented) derivation v:C (X)v: C^\infty(X) \to \mathbb{R} on the algebra of smooth functions, augmented by evaluation ev x:C (X)ev_x: C^\infty(X) \to \mathbb{R} at xx;

  • a smooth function f:XYf : X \to Y to its differential df:TXTYd f : T X \to T Y:

    if γ:[1,1]X\gamma : [-1,1] \to X is a path in XX that represents a vector vT xXv \in T_x X, then (df)(v)T f(x)Y(d f)(v) \in T_{f(x)} Y is the vector represented by the path [1,1]γXfY[-1,1] \stackrel{\gamma}{\to} X \stackrel{f}{\to} Y.

Equivalently, a smooth map f:XYf: X \to Y induces an algebra map f *=f:C (Y)C (X)f^\ast = - \circ f: C^\infty(Y) \to C^\infty(X), and hence by composition sends a derivation v:C (X)v: C^\infty(X) \to \mathbb{R} to a derivation vf *:C (Y)v \circ f^\ast: C^\infty(Y) \to \mathbb{R} augmented by ev f(x):C (Y)ev_{f(x)}: C^\infty(Y) \to \mathbb{R}). Concretely: vf *(ϕ)=v(ϕf)v \circ f^\ast(\phi) = v(\phi \circ f) for ϕC (Y)\phi \in C^\infty(Y). Then the differential df:TXTYd f: T X \to T Y is defined by df(x,v)=(f(x),vf *)d f(x, v) = (f(x), v \circ f^\ast).

According to either of these descriptions, differentiation is manifestly functorial. In the English literature, this fact is known as the “chain rule”. In more traditional form:

d(fg) x=df g(x)dg x.d(f \circ g)_x = d f_{g(x)} \circ d g_x.

In a context HH of synthetic differential geometry (SDG) the functor dd is simply the internal hom out of the infinitesimal interval DD

d=[D,]:HH. d = [D, -] : H \to H \,.

where DD \hookrightarrow \mathbb{R} is the smooth locus defined by x 2=0x^2 = 0. (Classically, i.e., in the topos SetSet, this locus consists of just a single point2, but in toposes that model SDG it should be regarded as an infinitesimal neighborhood about a point.) This description as internally representable functor underlies the following classical fact:

Theorem

Differentiation is a product-preserving functor.

For example, the product of two smooth functions f,g:Xf, g: X \to \mathbb{R} is a composition

XΔX×Xf×g×multX \stackrel{\Delta}{\to} X \times X \stackrel{f \times g}{\to} \mathbb{R} \times \mathbb{R} \stackrel{mult}{\to} \mathbb{R}

and the product rule for differentiation derives by applying the functor dd which results in the composite

TXΔTX×TXdf×dgT×TdmultTT X \stackrel{\Delta}{\to} T X \times T X \stackrel{d f \times d g}{\to} T \mathbb{R} \times T \mathbb{R} \stackrel{d mult}{\to} T \mathbb{R}

which reduces to computing dmultd mult.

Exposition of differentiation via infinitesimals

We attempt to give an exposition of differentiation not via taking limits in the sense of analysis, but rather, by working with infinitesimal spaces as in synthetic differential geometry.

Setup

For that purpose we regard objects such as the real line \mathbb{R} and more generally any smooth manifold as being objects of a well adapted smooth topos of smooth spaces. But in fact much less is required for most of the discussion. For the basic theory, it is sufficient to work in the canonical site of such a topos, namely, the category SmoothLocSmoothLoc of smooth loci. This site is defined via a categorial equivalence with the opposite category of the category of smooth algebras:

C :SmoothLocSmoothAlg op C^\infty: SmoothLoc \stackrel{\sim}{\longrightarrow} SmoothAlg^{op}

Apart from the real line \mathbb{R}, the other object involved in our discussion of the basics of differentiation is the first order infinitesimal line segment DD (an infinitesimally thickened point), a subobject of \mathbb{R}. This subobject collects those points of \mathbb{R} that square to zero under the canonical multiplicative structure of \mathbb{R}

D={ϵ|ϵ 2=0}. D = \{\epsilon \in \mathbb{R} | \epsilon^2 = 0\} \,.

Although this is just {0}\{0\} when interpreted in the usual context of Set, here we are to interpret this in the ambient smooth topos, or equivalently just in our site SmoothLocSmoothLoc for it. That is to say, DD satisfies a pullback square in SmoothLocSmoothLoc:

D * 0 () 2 \array{ D &\to& * \\ \downarrow && \downarrow^{\mathrlap{0}} \\ \mathbb{R} &\stackrel{(-)^2}{\to}& \mathbb{R} }

Dually, the categorial equivalence C C^\infty from SmoothLocSmoothLoc to the opposite category of smooth algebras carries the above pullback square to a pushout square of smooth algebras:

C (D) C (*) ff(0) C () xx 2 C () \array{ C^\infty(D) &\leftarrow& C^\infty(*) \simeq \mathbb{R} \\ \uparrow && \uparrow^{\mathrlap{f \mapsto f(0)}} \\ C^\infty(\mathbb{R}) &\stackrel{x \mapsto x^2}{\leftarrow}& C^\infty(\mathbb{R}) }

Here along the bottom and on the right are the ordinary smooth algebras of smooth functions on the smooth manifolds 1=\mathbb{R}^1 = \mathbb{R} and 0=*\mathbb{R}^0 = *, whereas C (D)C^\infty(D) is new notation, suggestive and justified, for whatever the algebra of functions on the object DD is, where DD is defined by the fact that it has this algebra of functions. The bottom morphism sends a smooth function f:f : \mathbb{R} \to \mathbb{R} to the function f(() 2):f((-)^2) : \mathbb{R} \to \mathbb{R}. If we write x=Id:x = Id : \mathbb{R} \to \mathbb{R} for the canonical coordinate function, then this sends xx to the function x 2x^2.

Notice that the commutativity of this pushout square means that the image in C (D)C^\infty(D) of the squared coordinate function x 2C ()x^2 \in C^\infty(\mathbb{R}) is zero. This is because the former is the image under the bottom morphism of the coordinate function xx and that maps under the right vertical morphism to 0C (*)0 \in C^\infty(*) \simeq \mathbb{R}.

In fact one can show that the pushout C (D)C^\infty(D) is generated by the coordinate function xC ())x \in C^\infty(\mathbb{R})) subject to the relation x 2=0x^2 = 0: it is the ring of dual numbers

C (D) C ()/(x 2) [x]/(x 2). \begin{aligned} C^\infty(D) & \simeq C^\infty(\mathbb{R})/(x^2) \\ & \simeq \mathbb{R}[x]/(x^2) \end{aligned} \,.

(Here in the second step we have used Hadamard's lemma.)

This computation of the smooth algebra C (D)C^\infty(D) of smooth functions on DD expresses the intuitive idea that DD is such a small neighbourhood of 00 in \mathbb{R} that while the canonical coordinate function of \mathbb{R} restricted to DD is non-vanishing (it would vanish only when restricted entirely to the point 0) it is “so small that its square is always 0”. Readers who like this intuitive statement should keep it in mind, as it is accurate and useful, whereas readers who do not like it could stick with the above precise definition, which unwinds this slogan in the internal logic of the ambient smooth topos that we keep alluding to.

In order to remind us about the infinitesimal nature of the generator xx in C (D)C^\infty(D) we shall usually call this “dxdx” and write

C (D)=[dx]/((dx) 2)dx. C^\infty(D) = \mathbb{R}[dx]/((dx)^2) \simeq \mathbb{R} \oplus dx \mathbb{R} \,.

Functions and differential 1-forms

We discussion of differential 1-forms are functions on the space of infinitesimal paths. For more comprehensive such discussion see (Stel 13).

Define the smooth locus (Δ inf 1)\mathbb{R}^{(\Delta^1_{inf})} as the product ×D\mathbb{R} \times D of the real line with the first order infinitesimal line segment. We think of this as the space of infinitesimal paths in \mathbb{R} (see Spaces of infinitesimal k-simplicies ). We do so by thinking of a generalized element of (Δ inf 1)\mathbb{R}^{(\Delta^1_{inf})} in the category SmoothLocSmoothLoc, namely a pair (x,ϵ)×D(x, \epsilon) \in \mathbb{R} \times D, as the linear path in \mathbb{R} stretching from xx to x+ϵx + \epsilon (both regarded as generalized elements of \mathbb{R}).

We say a smooth differential 1-form on \mathbb{R} is a function (meaning: a morphism in SmoothLocSmoothLoc)

ω:X (Δ inf 1) \omega : X^{(\Delta^1_{inf})} \to \mathbb{R}

with the property that its restriction to \mathbb{R} vanishes, hence so that the composite morphism

0:X (Δ inf 1)ω 0 : \mathbb{R} \hookrightarrow X^{(\Delta^1_{inf})} \stackrel{\omega}{\to} \mathbb{R}

vanishes, where the first morphism is the inclusion of \mathbb{R} as the constant infinitesimal paths in \mathbb{R}, hence (Id,0):×D(Id, 0) : \mathbb{R} \to \mathbb{R} \times D.

We call the coproduct of smooth algebras their tensor product (classically it is the smoothly completed algebraic tensor product). For each smooth loci XX and YY, we have

C (X×Y)C (X)C (Y). C^\infty(X \times Y) \simeq C^\infty(X) \otimes C^\infty(Y) \,.

Using this property, we can dualize the above definition of a diferential 1-form. Namely, in the opposite category of smooth algebras, a 1-form on the line is a morphism

C ()[dx]/((dx) 2)C ():ω *, C^\infty(\mathbb{R}) \otimes \mathbb{R}[dx]/((dx)^2) \leftarrow C^\infty(\mathbb{R}) : \omega^* \,,

hence an assignment xg(x)+h(x)dx x \mapsto g(x) + h(x) dx such that g(x)=0g(x) = 0. In other words, it is any element in C (R)[dx]/((dx) 2)C^\infty(\mathrm{R}) \otimes \mathbb{R}[dx]/((dx)^2) that is proportional to the generator dxdx. We write ω=hdx \omega = h dx with hC ()h \in C^\infty(\mathbb{R}).

Derivatives as infinitesimal differences

We can now “synthetically” describe differentiation as the process of taking infinitesimal differences of functions.

Consider the two morphisms of smooth loci

+: (Δ inf 1)=×D + : \mathbb{R}^{(\Delta^1_{inf})} = \mathbb{R} \times D \to \mathbb{R}

and

p 1: (Δ inf 1)=×D, p_1 : \mathbb{R}^{(\Delta^1_{inf})} = \mathbb{R} \times D \to \mathbb{R} \,,

where the first is the restriction of the canonical addition operation +:×+ : \mathbb{R} \times \mathbb{R} \to \mathbb{R} of real numbers, and where the second is the projection on the first factor out of the product.

Given an element fC ()f \in C^\infty(\mathbb{R}), hence a smooth function f:f : \mathbb{R} \to \mathbb{R}, we can then form two functions on infinitesimal paths by precomposing with these two morphisms:

first we get

f:X (Δ inf 1)p 1f. f : X^{(\Delta^1_{inf})} \stackrel{p_1}{\to} \mathbb{R} \stackrel{f}{\to} \mathbb{R} \,.

This is the function on infinitesimal paths in the line that only depends on the starting point of a path, not on the length of the path, and which sends that starting point to the value of ff on that point. Therefore by slight abuse of notation we just keep writing “ff” for this function.

The second combination is

f˜:X (Δ inf 1)+f. \tilde f : X^{(\Delta^1_{inf})} \stackrel{+}{\to} \mathbb{R} \stackrel{f}{\to} \mathbb{R} \,.

This is the function on infinitesimal paths that sends any path (x)(x+ϵ)(x) \to (x + \epsilon) to the value of the function on the endpoint (x+ϵ)(x + \epsilon) of the paths: in terms of these generalized elements f˜\tilde f is the assignment

f˜:(x,ϵ)f(x+ϵ). \tilde f : (x , \epsilon) \mapsto f(x + \epsilon) \,.

For instance consider the function f:xx 2f : x \mapsto x^2. Then

f˜:(x,ϵ) (x+ϵ) 2 =x 2+2xϵ+ϵ 2 =x 2+2xϵ. \begin{aligned} \tilde f : (x, \epsilon) &\mapsto (x + \epsilon)^2 \\ & = x^2 + 2 x \epsilon + \epsilon^2 \\ & = x^2 + 2 x \epsilon \end{aligned} \,.

This is evident, if maybe still somewhat mysterious, in the internal language of the ambient smooth topos that we keep alluding to, but to make it entirely explicit and concrete notice that the functions here are dually morphisms of smooth algebras

C ()(dx)C ():(f˜) * C^\infty(\mathbb{R}) \otimes (\mathbb{R} \oplus dx \mathbb{R}) \leftarrow C^\infty(\mathbb{R}) : (\tilde f)^*

and the equation

(x+dx) 2=x 2+2xdx (x + dx)^2 = x^2 + 2 x dx

holds in C ()(dx)C^\infty(\mathbb{R}) \otimes (\mathbb{R} \oplus dx \mathbb{R}) by the above discussion, where now xC ()x \in C^\infty(\mathbb{R}) and dxC (D)=dxdx \in C^\infty(D) = \mathbb{R} \oplus dx \mathbb{R} are the two canonical generators.

But notice that the expression x 2+2xdxx^2 + 2 x dx is not a dfifferential 1-form by the above definition, equivalently the function f˜\tilde f does not vanish when restricted to constant infinitesimal paths.

To get such, we form the derivative or infinitesimal difference, the function

f˜f:X (Δ inf 1) \tilde f - f : X^{(\Delta^1_{inf})} \to \mathbb{R}

that acts on generalized elements of the space of infinitesimal paths in the line as

(f˜f):(x,ϵ)f(x+ϵ)f(x). (\tilde f - f) : (x, \epsilon) \mapsto f(x + \epsilon) - f(x) \,.

By construction on the right this now is something linear in the infinitesimal length ϵ\epsilon of the path. Write fC ()f' \in C^\infty(\mathbb{R}) for the coefficient:

f(x+ϵ)f(x)=ϵf(x). f(x + \epsilon) - f(x) = \epsilon f'(x) \,.

This function ff' is the differential of ff. It is the function that gives the change of the function ff along a “unit infinitesimal path”.

Dually, we have a morphism

C ()(dx)C ():(f˜f) * C^\infty(\mathbb{R})\otimes (\mathbb{R} \oplus dx \mathbb{R}) \leftarrow C^\infty(\mathbb{R}) : (\tilde f - f)^*

that sends

xf(x+dx)f(x)=f(x)dx. x \mapsto f(x + dx) - f(x) = f'(x) dx \,.

The 1-form on the right is the de Rham differential of ff, usually written

(df)(x)=f(x)dxC (×D). (d f)(x) = f'(x) dx \in C^\infty(\mathbb{R} \times D) \,.

As a map of tangent bundles

Equivalently we may rephrase this differential dfd f as a morphism of tangent bundles df:TTd f \colon T \mathbb{R} \longrightarrow T \mathbb{R} as follows.

First, by the Kock-Lawvere axiom of SDG we have the standard fact that T:= D×T \mathbb{R} := \mathbb{R}^D \simeq \mathbb{R} \times \mathbb{R}. Therefore given a function

×D \mathbb{R} \times D \longrightarrow \mathbb{R}

this is by the internal hom-adjunction equivalently a function of the form

D=T. \mathbb{R} \longrightarrow \mathbb{R}^D = T \mathbb{R} \,.

The condition on the original function makes this adjunct be a section of the tangent bundle of \mathbb{R}. This section is xf(x)x \mapsto f'(x), hence is the derivative of ff regarded as a tangent vector on \mathbb{R}. Moreover, since \mathbb{R} is a microlinear space, this induces by rescaling a function

T=×T. T \mathbb{R} = \mathbb{R} \times \mathbb{R} \longrightarrow T \mathbb{R} \,.

This is the differential of ff regarded as a map of tangent bundles.

References

Discussion of this history of the concept, with emphasis on its roots all the way back in Zeno's paradoxes of motion is in

  • Carl Benjamin Boyer, The history of the Calculus and its conceptual development, Dover 1949

Discussion of differential forms as functions on infinitesimal simplices is in

Examples of sequences of local structures

geometrypointfirst order infinitesimal\subsetformal = arbitrary order infinitesimal\subsetlocal = stalkwise\subsetfinite
\leftarrow differentiationintegration \to
smooth functionsderivativeTaylor seriesgermsmooth function
curve (path)tangent vectorjetgerm of curvecurve
smooth spaceinfinitesimal neighbourhoodformal neighbourhoodgerm of a spaceopen neighbourhood
function algebrasquare-0 ring extensionnilpotent ring extension/formal completionring extension
arithmetic geometry𝔽 p\mathbb{F}_p finite field p\mathbb{Z}_p p-adic integers (p)\mathbb{Z}_{(p)} localization at (p)\mathbb{Z} integers
Lie theoryLie algebraformal grouplocal Lie groupLie group
symplectic geometryPoisson manifoldformal deformation quantizationlocal strict deformation quantizationstrict deformation quantization

  1. Without the continuity assumption on derivatives, the class of (once-)differentiable functions exhibits non-trivial descriptive set-theoretic complexity, as may be surmised by this paper by Kechris and Woodin. Thus the continuity assumption eliminates troublesome pathologies.

  2. According to John Bell, the fact that classically this is merely a point bears comparison to Bishop Berkeley’s objections to the differential calculus he put forth in The Analyst.

Revised on June 7, 2017 10:37:15 by Urs Schreiber (185.46.137.5)