I was asked about this the other day and I only talked about the univariate case in Part 3 of the MBA series. I will present the main result first and then bore you with the proof.

We will consider the polynomials to be over $Z$ instead of $Z_{m}$ . It makes some things slightly cleaner. We will write $(X)_{i} := j = 0 \prod i - 1 (X - j) = X \cdot (X - 1) \cdot \dots \cdot (X - i + 1)$ which is also known as the falling factorial. Let $P \in Z [X_{1}, \dots, X_{n}]$ , we will write $p_{m} : Z^{n} ⟶ Z_{m}$ , for the induced function mod m.

Let $null_{m, n}$ be the ideal of polynomials in $Z [X_{1}, \dots, X_{n}]$ that evaluate to zero (mod m) everywhere, i.e. $null_{m, n} = {P ∣ P \in Z [X_{1}, \dots, X_{n}] : p_{m} = 0, i.e. \forall x \in Z : p_{m} (x) = 0}$

Generators of

null_{m, n}

Define

Z_{i_{1}, \dots, i_{n}} = \frac{m}{g cd ( i _{1} ! \cdot \dots \cdot i _{n} ! , m )} (X_{1})_{i_{1}} \cdot \dots \cdot (X_{n})_{i_{n}}

Then $null_{m, n} = (Z_{0, \dots, 0}, Z_{0, \dots, 1}, \dots, Z_{d, \dots, d})$ (i.e. $null_{m, n}$ is generated by the $Z$ s), where $d$ is the smallest integer such that $m$ divides $d!$ .

Note that this generating set is not at all minimal. We will discuss this at the end.

Polynomial Bases

(Multivariate) polynomials over a commutative ring $R$ form a free $R$ -module. If you don't know what that means, then just think of it as a vector space. So in particular, they have a basis. Let's consider univariate polynomials in $X$ for the moment. The obvious basis is the monomial basis: $1, X, X^{2}, \dots$ . But for our purposes, it turns out that another basis is more useful: $1, X, (X)_{2}, (X)_{3}, \dots$ These are the falling factorials from the beginning. I will call it the Newton basis. (I don't think this basis has an official name, but it is closely related to Newton's series, so I just made it up.) It is easy to see that this is a basis because the polynomials are monic and the degree is increasing by 1 from one to the next, so that we can iteratively do polynomial division with basis polynomials of lower and lower degree. That this is a basis means that every polynomial in $P \in R [X]$ can be uniquely written as $P = i = 0 \sum d a_{i} (X)_{i}$ for $a_{i} \in R$ that can be calculated with polynomial division.

Now, consider bivariate polynomials $R [X, Y]$ .

Basis of bivariate Polynomials

Let

(B_{i})_{i \in N}

be a basis for

R [X]

and

(C_{i})_{i \in N}

be a basis for

R [Y]

. Then

(B_{i} \cdot C_{j})_{i, j \in N}

is a basis for

R [X, Y]

Proof: It is well known that

R [X, Y] ≅ R [X] [Y]

, so we can write

P \in R [X, Y]

P = i \sum F_{i} C_{i}

where each

F_{i} \in R [X]

. (Although intuitively clear, formally, this step is not quite so simple, since we only know that the

C_{i}

are a basis when the polynomials are over

R

not

R [X]

. You can treat the

X

as an actual variable, then this works, e.g. in

R^{2}

you can write

(1 - s, 2 s)

in the standard basis

(1, 0)

(0, 1)

where

s

is a completely independent variable.) Now, we can write each

F_{i} = j \sum G_{i, j} B_{j}

So overall,

P = i \sum F_{i} C_{i} = i, j \sum G_{i, j} B_{j} C_{i}

(B_{i} \cdot C_{j})_{i, j \in N}

is a basis.

We can inductively extend this to multivariate polynomials to get

Basis of multivariate Polynomials

Let

(B_{i, j})_{i \in N, 1 \leq j \leq n}

be bases for

R [X_{1}], \dots, R [X_{n}]

. Then

(\prod_{j = 1}^{n} B_{i, j})_{i \in N}

is a basis for

R [X_{1}, \dots, X_{n}]

Thus the Newton basis for $R [X_{1}, \dots, X_{n}]$ is $((X_{1})_{i_{1}} \cdot \dots \cdot (X_{n})_{i_{n}})_{i \in N_{0}^{n}}$ .

Univariate $null_{m, 1}$

Generators of

null_{m, 1}

null_{m, 1} = (Z_{0}, \dots, Z_{d})

where

d

is the smallest integer such that

m

divides

d!

$Z_{i} = \frac{m}{g cd ( i ! , m )} (X)_{i}$

This result was already stated in Part 3 of the MBA series, but I didn't prove it and instead referred to Singmaster. We will prove it here ourselves, because (I think) Singmaster's proof is more difficult to generalize to multiple variables.

The idea is to first show that the $Z_{i}$ are zero everywhere, and then that every polynomial that is zero everywhere, is generated by the $Z_{i}$ .

The $Z_{i}$ are zero everywhere

Let $Z_{i} = c (X)_{i}$ . We will show that the $c$ from the definition is actually optimal, because we will need it later. Notice that $(k)_{i}$ is the product of $i$ consecutive integers. (If you think in $Z_{m}$ then they could wrap around, in which case 0 is in the product, so we are done). It is a well known fact that the product of $i$ consecutive integers is divisible by $i!$ , which can be proven in one line. $(k)_{i} = (i k) i!$ In particular $(i)_{i} = i!$ . The right-hand side being zero in $Z_{m}$ means being a multiple of $m$ in $Z$ , so to choose $c$ as small as possible, we want $c \cdot i! = lcm (i!, m)$ So $c = \frac{lcm ( i ! , m )}{i !} = \frac{m}{g cd ( i ! , m )}$ by $ab = g cd (a, b) lcm (a, b)$ for $a, b \in N$ .

A Polynomial that is zero everywhere is generated by the $Z_{i}$

Let $N \in null_{m, 1}$ , i.e. $N \in Z_{m} [X]$ such that $n (x) = 0$ for all $x$ . We can write $N$ in the Newton basis: $N = i \sum c_{i} (X)_{i}$

We will show inductively that the coefficients have the given form, i.e. that actually $N = \sum_{i} d_{i} Z_{i}$ .

For $k = 0$ , we will consider $n (0)$ . Obviously, $n (0) = c_{0}$ , since $(0)_{i} = 0$ for $i > 0$ . So $c_{0}$ has to be zero (or a multiple of the modulus $m$ , if you think of the polynomials as being over $Z$ ).

Similarly, for $k > 0$ , we will consider $n (k)$ . From the induction hypothesis we get that we can write $N$ as: $N = i \geq k \sum c_{i} (X)_{i} + i < k \sum d_{i} Z_{i}$

We have $(k)_{i} = 0$ for $i > k$ and of course $Z_{i} (k) = 0$ , so $n (k) = c_{k} (k)_{k} = c_{k} k!$ But this is exactly the case we discussed when finding the $c$ for the $Z_{i}$ , so $c_{k}$ is a multiple of that $c$ , i.e. $c_{k} = d_{k} \frac{m}{g c d ( k ! , m )}$ and $N$ can be written $N = i > k \sum c_{i} (X)_{i} + i \leq k \sum d_{i} Z_{i}$

When $k \geq d$ , $\frac{m}{g c d ( k ! , m )}$ is 1, so further $P_{k}$ are just $R [X]$ -multiples of $P_{d}$ , and thus not needed in the generating set.

Multivariate $null_{m, n}$

One idea you might have, is whether the multivariate $null_{m, n}$ is just the product of the univariate $null_{m, 1}$ in each variable, but this is not true. For example, consider the polynomial $(2 X^{2} + 2 X) (2 Y^{2} + 2 Y) = 4 X^{2} Y^{2} + 4 X^{2} Y + 4 X Y^{2} + 4 X Y$ that is 0 everywhere mod 16, but $(2 X^{2} + 2 X)$ is not 0 mod 16. Nevertheless, the simple (but not quite as simple) generalization of the univariate case that I showed at the beginning is true.

Again, let $N \in null_{m, n}$ , i.e. $N \in Z_{m} [X_{1}, \dots, X_{n}]$ such that $n (x) = 0$ for all $x$ . Again, we can write $N$ in the Newton basis: $N = i_{1}, \dots, i_{n} \sum c_{i_{1}, \dots, i_{n}} (X_{1})_{i_{1}} \dots (X_{n})_{i_{n}}$

There very well may be a cleverer way to do this, but we are essentially going to do induction. But the details of the inductions actually require some thought. We want to follow the same logic as in the univariate case, i.e. to prove that $c_{k_{1}, \dots, k_{n}}$ has the correct form (in this case $\frac{m}{g c d ( \prod _{j} k _{j} ! , m )}$ ) by considering $n (k_{1}, \dots, k_{n})$ . Let's look at $n (k_{1}, \dots, k_{n})$ to reverse-engineer what the induction hypothesis should look like.

$n (k_{1}, \dots, k_{n}) = i_{1}, \dots, i_{n} \sum c_{i_{1}, \dots, i_{n}} (k_{1})_{i_{1}} \dots (k_{n})_{i_{n}}$

If any of the $i_{j} > k_{j}$ , then $(k_{j})_{i_{j}} = 0$ and so the whole term is zero. This means we can restrict the sum. $n (k_{1}, \dots, k_{n}) = i_{1} \leq k_{i}, \dots, i_{n} \leq k_{n} \sum c_{i_{1}, \dots, i_{n}} (k_{1})_{i_{1}} \dots (k_{n})_{i_{n}}$

The other terms are not zero in general (i.e. for any polynomial N), but since we are formulating the induction hypothesis, we can assume those $c_{i_{1}, \dots, i_{n}}$ have the correct form, meaning we can write them as multiples of the $Z_{i_{1}, \dots, i_{n}}$ . We just have to be careful to exclude $i_{1} = k_{1}, \dots, i_{n} = k_{n}$ , since that is the term we're trying to show is zero. Over all the induction hypothesis looks like this:

$N$ has the following form: $N = c_{k_{1}, \dots, k_{n}} (X_{1})_{k_{1}} \dots (X_{n})_{k_{n}} + i_{1} > k_{1} \lor \dots \lor i_{n} > k_{n} \sum c_{i_{1}, \dots, i_{n}} (X_{1})_{i_{1}} \dots (X_{n})_{i_{n}} + i_{1} \leq k_{1} \land \dots \land i_{n} \leq k_{n} \exists j : i_{j} \neq = k_{j} \sum d_{i_{1}, \dots, i_{n}} Z_{i_{1}, \dots, i_{n}}$

Here is an illustration for 2 variables where $k_{1} = k_{2} = 2$ . The yellow point at $(2, 2)$ stands for the $c_{2, 2}$ term that we are trying to show has the wanted form. The blue dots are the terms of the first sum, which are automatically zero when evaluating $n (k_{1}, k_{2}) = n (2, 2)$ . The red dots are the terms of the second sum, which we assume have the correct form already (because we have shown that they do previously) and are thus also zero.

We're now 80% done, even though we haven't even shown that $c_{k_{1}, \dots, k_{n}}$ has the correct form given the form of $N$ , which we are going to do now. As we just discussed, we will consider $n (k_{1}, \dots, k_{n})$ . By our construction, the only term that is non-zero is: $n (k_{1}, \dots, k_{n}) = c_{k_{1}, \dots, k_{n}} (k_{1})_{k_{1}} \dots (k_{n})_{k_{n}} = c_{k_{1}, \dots, k_{n}} k_{1}! \dots k_{n}!$

Going through the same logic as in the univariate case, we want this to equal a multiple of the modulus $m$ . But this implies that $c_{k_{1}, \dots, k_{n}}$ is a multiple of the smallest such value which is $\frac{m}{g c d ( k _{1} ! \dots k _{n} ! , m )}$ , by the exact argument as in the univariate case.

To show this is true for all $k_{1}, \dots, k_{n}$ , we can e.g. do induction on $k = k_{1} + \dots + k_{n}$ . For $k = 0$ , this is trivial. And the induction step we just proved can be easily used to show this for $k > 0$ .

For example for 2 variables and $k = 3$ , we have already shown it for $k < 2$ , i.e. the the red points in the following diagram. And the induction step allows us to conclude it for all points on the boundary.

On Minimality

In the Part 3 of the MBA series, I already discussed that for the univariate case, when the modulus $m$ is a power of two, then every $Z_{2 i + 1}$ is not needed, because it is an $Z [X]$ -multiple of the previous $Z_{2 i}$ , which follows from the fact that it has the same leading coefficient, as discussed there. In general, the same idea applies: If $Z_{i_{1}, \dots, i_{n}}$ has the same leading coefficient as any of $Z_{i_{1} - 1, \dots, i_{n}}, \dots, Z_{i_{1}, \dots, i_{n} - 1}$ , then it is obviously a multiple of that polynomial and is not needed. At some point the leading coefficient will be 1, at which point no further polynomials are needed, because they all have leading coefficient 1. This way you can algorithmically find the minimal generating set. Of course if you were gonna store them, you wouldn't really care about the variable names, so e.g. storing $Z_{2, 1} = 8 X_{1} (X_{1} - 1) X_{2}$ and $Z_{1, 2} = 8 X_{1} (X_{2}) (X_{2} - 1)$ (for $m = 16$ ) is redundant, because $Z_{2, 1} (X_{1}, X_{2}) = Z_{1, 2} (X_{2}, X_{1})$ . In practice you would restrict the list to $i_{1} \geq \dots \geq i_{n}$ , but by the definition of a generating set these are still needed.

Justus Polzin

Multivariate Polynomials mod m

Polynomial Bases

Univariate $null_{m, 1}$

The $Z_{i}$ are zero everywhere

A Polynomial that is zero everywhere is generated by the $Z_{i}$

Multivariate $null_{m, n}$

On Minimality

Multivariate Polynomials mod m

Polynomial Bases

Univariate nullm,1​

The Zi​ are zero everywhere

A Polynomial that is zero everywhere is generated by the Zi​

Multivariate nullm,n​

On Minimality

Univariate $null_{m, 1}$

The $Z_{i}$ are zero everywhere

A Polynomial that is zero everywhere is generated by the $Z_{i}$

Multivariate $null_{m, n}$