Polynomial Interpolation

Aaron Tian

25 May 2023

〜 ♣ 〜

Cursory introductions to AI/ML often reference the black box—that is, the notion of a function whose inner workings are uninterpretable—in describing machine learning models. They're not wrong; we truly don't understand how and why these models work—and not without reason. Real-world data is often multidimensional in nature and embeds complex relationships that can't be articulated using conventional statistics.

So there's a reason why we turn to the black box model. It's got clear advantages, of course; we never could've made so much progress in ML research if we hadn't embraced the "it works because it works" mentality. But there's also loads of drawbacks, especially when ML meets decision-making. When we turn to computers to make decisions, we suddenly hesitate to trust their judgement. That's because, when real-world consequences are involved, it's not enough to say "the computer made me do it" to justify our actions; clear, founded reasoning must follow. The black box isn't going to cut it for high-stakes usage.

So how do we begin to understand the models we've spent years building? It's a daunting task, but, as good scientific practice suggests, we could begin by simplifying the problem and solving it with a foundation of solid mathematical principles. Enter the interpolation problem, a "predecessor" of sorts to machine learning. First, some notation:

Let and be sets such that and .

Now, suppose a function exists on the interval .

The problem arises: how do we construct a tractable interpolant to approximate given the condition that for all ?

A reasonable first approach leverages the fact that there exists a unique polynomial of degree for any collection of points with distinct abscissae. Thus, we can express

Lagrange interpolation enables us to find explicit coefficients. Consider the following construction of a polynomial:

is interpolated by the linear combination

So how well does our interpolant perform? Let's put it to the test. I begin by sampling evenly-spaced points on from the Legendre polynomial of degree 3 and computing the Lagrange polynomial that interpolates those points.

import scipy
import numpy as np
import matplotlib.pyplot as plt

f = scipy.special.legendre(3)
x = np.linspace(-1, 1, 16)
y = f(x)
p = scipy.interpolate.lagrange(x, y)

fig, ax = plt.subplots(1, 2, figsize=[10, 3])
for i in (0, 1):
    ax[i].set_xlim([-1.25, 1.25])
    ax[i].set_ylim([-1.25, 1.25])
    ax[i].scatter(x, y)
d = np.linspace(-1.25, 1.25, 500)
ax[1].plot(d, p(d), 'g')

Here, the Lagrange interpolation works like a charm. But that's to be expected; we're using a polynomial to interpolate a polynomial. What if we test our method with a harder task, like interpolating a transcendental function?

import scipy
import numpy as np
import matplotlib.pyplot as plt

f = lambda x: 1 / np.cosh(5 * x)
x = np.linspace(-1, 1, 16)
y = f(x)
p = scipy.interpolate.lagrange(x, y)

fig, ax = plt.subplots(1, 2, figsize=[10, 3])
for i in (0, 1):
    ax[i].set_xlim([-1.25, 1.25])
    ax[i].set_ylim([-0.25, 1.25])
    ax[i].scatter(x, y)
d = np.linspace(-1.25, 1.25, 500)
ax[1].plot(d, p(d), 'g')

Here, I interpolate the function using evenly-spaced points, as before. Notice that, unlike the first example, diverges aggressively from in an oscillatory fashion near the ends of the interpolation points. This behavior is known as Runge's phenomenon. The oscillation is encountered often in naïve Lagrange interpolation and exacerbated by the use of equally-spaced points, although there are a number of numerical methods that can mitigate the problem. Unfortunately, these topics are beyond the scope of a 1 am post.

The key takeaway is that, like in machine learning, unexpected behavior can arise as a natural consequence of the method in numerical analysis. By understanding the mathematical basis for these errors, we can develop robust mitigation techniques.

- Aaron