1.6. Interpolation (scipy.interpolate
)¶
Contents
There are several general interpolation facilities available in SciPy, for data in 1, 2, and higher dimensions:
- A class representing an interpolant (
interp1d
) in 1-D, offering several interpolation methods. - Convenience function
griddata()
offering a simple interface to interpolation in N dimensions (N = 1, 2, 3, 4, ...). Object-oriented interface for the underlying routines is also available. - Functions for 1- and 2-dimensional (smoothed) cubic-spline interpolation, based on the FORTRAN library FITPACK. There are both procedural and object-oriented interfaces for the FITPACK library.
- Interpolation using Radial Basis Functions.
1.6.1. 1-D interpolation (interp1d
)¶
The interp1d class in scipy.interpolate is a convenient method to create a function based on fixed data points which can be evaluated anywhere within the domain defined by the given data using linear interpolation. An instance of this class is created by passing the 1-d vectors comprising the data. The instance of this class defines a __call__ method and can therefore by treated like a function which interpolates between known data values to obtain unknown values (it also has a docstring for help). Behavior at the boundary can be specified at instantiation time. The following example demonstrates its use, for linear and cubic spline interpolation:
(Source code, png, hires.png, pdf)
1.6.2. Multivariate data interpolation (griddata()
)¶
Suppose you have multidimensional data, for instance for an underlying function f(x, y) you only know the values at points (x[i], y[i]) that do not form a regular grid.
(Source code, png, hires.png, pdf)
1.6.3. Spline interpolation¶
1.6.3.1. Spline interpolation in 1-d: Procedural (interpolate.splXXX)¶
Spline interpolation requires two essential steps: (1) a spline
representation of the curve is computed, and (2) the spline is
evaluated at the desired points. In order to find the spline
representation, there are two different ways to represent a curve and
obtain (smoothing) spline coefficients: directly and parametrically.
The direct method finds the spline representation of a curve in a two-
dimensional plane using the function splrep
. The
first two arguments are the only ones required, and these provide the
\(x\) and \(y\) components of the curve. The normal output is
a 3-tuple, \(\left(t,c,k\right)\) , containing the knot-points,
\(t\) , the coefficients \(c\) and the order \(k\) of the
spline. The default spline order is cubic, but this can be changed
with the input keyword, k.
For curves in \(N\) -dimensional space the function
splprep
allows defining the curve
parametrically. For this function only 1 input argument is
required. This input is a list of \(N\) -arrays representing the
curve in \(N\) -dimensional space. The length of each array is the
number of curve points, and each array provides one component of the
\(N\) -dimensional data point. The parameter variable is given
with the keyword argument, u, which defaults to an equally-spaced
monotonic sequence between \(0\) and \(1\) . The default
output consists of two objects: a 3-tuple, \(\left(t,c,k\right)\)
, containing the spline representation and the parameter variable
\(u.\)
The keyword argument, s , is used to specify the amount of smoothing to perform during the spline fit. The default value of \(s\) is \(s=m-\sqrt{2m}\) where \(m\) is the number of data-points being fit. Therefore, if no smoothing is desired a value of \(\mathbf{s}=0\) should be passed to the routines.
Once the spline representation of the data has been determined,
functions are available for evaluating the spline
(splev()
) and its derivatives
(splev()
, spalde()
) at any point
and the integral of the spline between any two points (
splint()
). In addition, for cubic splines ( \(k=3\)
) with 8 or more knots, the roots of the spline can be estimated (
sproot()
). These functions are demonstrated in the
example that follows.
(Source code, png, hires.png, pdf)
1.6.3.2. Spline interpolation in 1-d: Object-oriented (UnivariateSpline
)¶
The spline-fitting capabilities described above are also available via
an objected-oriented interface. The one dimensional splines are
objects of the UnivariateSpline class, and are created with the
\(x\) and \(y\) components of the curve provided as arguments
to the constructor. The class defines __call__
, allowing the object
to be called with the x-axis values at which the spline should be
evaluated, returning the interpolated y-values. This is shown in
the example below for the subclass InterpolatedUnivariateSpline.
The integral
,
derivatives
, and
roots
methods are also available
on UnivariateSpline objects, allowing definite integrals,
derivatives, and roots to be computed for the spline.
The UnivariateSpline class can also be used to smooth data by
providing a non-zero value of the smoothing parameter s, with the
same meaning as the s keyword of the splrep
function
described above. This results in a spline that has fewer knots
than the number of data points, and hence is no longer strictly
an interpolating spline, but rather a smoothing spline. If this
is not desired, the InterpolatedUnivariateSpline class is available.
It is a subclass of UnivariateSpline that always passes through all
points (equivalent to forcing the smoothing parameter to 0). This
class is demonstrated in the example below.
The LSQUnivariateSpline class is the other subclass of UnivariateSpline. It allows the user to specify the number and location of internal knots explicitly with the parameter t. This allows creation of customized splines with non-linear spacing, to interpolate in some domains and smooth in others, or change the character of the spline.
(Source code, png, hires.png, pdf)
1.6.3.3. Two-dimensional spline representation: Procedural (bisplrep()
)¶
For (smooth) spline-fitting to a two dimensional surface, the function
bisplrep()
is available. This function takes as required inputs
the 1-D arrays x, y, and z which represent points on the
surface \(z=f\left(x,y\right).\) The default output is a list
\(\left[tx,ty,c,kx,ky\right]\) whose entries represent
respectively, the components of the knot positions, the coefficients
of the spline, and the order of the spline in each coordinate. It is
convenient to hold this list in a single object, tck, so that it can
be passed easily to the function bisplev
. The
keyword, s , can be used to change the amount of smoothing performed
on the data while determining the appropriate spline. The default
value is \(s=m-\sqrt{2m}\) where \(m\) is the number of data
points in the x, y, and z vectors. As a result, if no smoothing is
desired, then \(s=0\) should be passed to
bisplrep
.
To evaluate the two-dimensional spline and it’s partial derivatives
(up to the order of the spline), the function
bisplev
is required. This function takes as the
first two arguments two 1-D arrays whose cross-product specifies
the domain over which to evaluate the spline. The third argument is
the tck list returned from bisplrep
. If desired,
the fourth and fifth arguments provide the orders of the partial
derivative in the \(x\) and \(y\) direction respectively.
It is important to note that two dimensional interpolation should not
be used to find the spline representation of images. The algorithm
used is not amenable to large numbers of input points. The signal
processing toolbox contains more appropriate algorithms for finding
the spline representation of an image. The two dimensional
interpolation commands are intended for use when interpolating a two
dimensional function as shown in the example that follows. This
example uses the mgrid
command in NumPy which is
useful for defining a “mesh-grid” in many dimensions. (See also the
ogrid
command if the full-mesh is not
needed). The number of output arguments and the number of dimensions
of each argument is determined by the number of indexing objects
passed in mgrid
.
(Source code, png, hires.png, pdf)
1.6.3.4. Two-dimensional spline representation: Object-oriented (BivariateSpline
)¶
The BivariateSpline
class is the 2-dimensional analog of the
UnivariateSpline
class. It and its subclasses implement
the FITPACK functions described above in an object oriented fashion,
allowing objects to be instantiated that can be called to compute
the spline value by passing in the two coordinates as the two
arguments.
1.6.4. Using radial basis functions for smoothing/interpolation¶
Radial basis functions can be used for smoothing/interpolating scattered data in n-dimensions, but should be used with caution for extrapolation outside of the observed data range.
1.6.4.1. 1-d Example¶
This example compares the usage of the Rbf and UnivariateSpline classes from the scipy.interpolate module.
(Source code, png, hires.png, pdf)
1.6.4.2. 2-d Example¶
This example shows how to interpolate scattered 2d data.
(Source code, png, hires.png, pdf)