The Kolmogorov-Arnold Theorem

Created on 2025-04-13T14:34:03-05:00

Any function which is continuous can be represented as the sum of single variable functions within a bounded region.

Multiplication of two variables can be broken in to a sum of smaller functions.

"Phi" functions are added to transform a variable prior to summing it: t1 = phi(x) + phi(y)

Summed phis become "terms" of the Kolmogorov function.

Each Kolmogorov term is wrapped in a separate "capital phi" function, as in Phi(t1) + Phi(t2)

f(x,y) =

Every Kolmogorov function has 2n+1 Kolmogorov terms, where n is the number of variables in the function.

Kolmogorov functions can be drawn out as a graph similar to a neural network; demonstrating that Kolmogorov approximates using only two layers.

Also some basic discourse on how this is important to KAN networks.