Блог пользователя Elegia

Автор Elegia, история, 4 дня назад, По-английски

Hello everyone.

Several years ago jqdai0815 released a blog on the topic of D-Finite functions. In this blog, I will introduce some more theories, and show the relation between some CP problems. Also, I found out that what I show is like opening Pandora's box: it provides some new ways to solve problems, but they also somehow prevent us from really understanding them. Therefore, I'll also talk about some of my personal critiques on this topic.

Since this blog is quite long, I'll divide it into several parts. This is the first part.

Appetizer

We start with this toy problem.

$$$n$$$-King Problem. Given $$$n$$$, count the number of permutations $$$\sigma$$$ of $$$[n]$$$ such that for any $$$i$$$, $$$|\sigma(i) - \sigma(i+1)| \neq 1$$$.

Of course, this sequence has already been studied before, it is the A002464 in OEIS. You can find its recurrence relation

$$$ a_n = (n+1)a_{n-1} - (n-2)a_{n-2} - (n-5)a_{n-3}+ (n-3)a_{n-4}. $$$

But how to prove this? A combinatorial proof is possible, but not that easy. Here I will show a different way to prove this.

We first consider the inclusion-exclusion principle. Suppose we force a subset $$$S$$$ of $$$[n-1]$$$ such that $$$|\sigma(i) - \sigma(i+1)| = 1$$$ for all $$$i \in S$$$. Then we can see that the number of permutations is determined by the following: First arbitrarily permute order the contiguous blocks of $$$[n]$$$ determined by $$$S$$$, then for each block of length $$$\geq 2$$$, it can be ascending or descending, this contributes a factor of $$$2$$$. Therefore, we could write down the generating function of the sequence $$$a_n$$$ as

$$$ \begin{align*} \sum_{n\geq 0} a_n x^n &= \left(\sum_{n\geq 0} n! T^n\right) \circ \left(x - 2 x^2 + 2x^3 - 2x^4 + \cdots\right)\\ &= \left(\sum_{n\geq 0} n! T^n\right) \circ \left(x\frac{1-x}{1+x}\right). \end{align*} $$$

Let $$$A(x)$$$ be the generating function of $$$a_n$$$, if we want to derive a P-recursive relation for $$$a_n$$$, this is equivalent to finding the D-Finite relation for $$$A(x)$$$. The idea is to first analyze the generating function of $$$n! T^n$$$: let

$$$ S(T) = \sum_{n\geq 0} n!T^n, $$$

let $$$s_n = n!$$$ be the sequence of $$$S(T)$$$, then by the recurrence relation

$$$ s_n = n s_{n-1} + [n=0], $$$

we can turn this into a D-finite relation

$$$ S(T) = (S\cdot T)' \cdot T +1 = 1 + TS + T^2S'. $$$

Therefore, since $$$A(x) = S(g(x))$$$, where $$$g(x) = x\frac{1-x}{1+x}$$$, we can derive the D-Finite relation for $$$A(x)$$$:

$$$ A(x) = S \circ g = 1 + gA + g^2 S'\circ g. $$$

recall that $$$A' = g' S'(g)$$$, we can plug in $$$S'\circ g = A' / g'$$$, and we can get the D-Finite relation for $$$A(x)$$$:

$$$ A = 1 + gA + \frac{g^2 A'}{g'}. $$$

After some simplification, we can get the desired P-recursive relation for $$$a_n$$$. I omit the computation details here.

The derivation above is a typical example of the application of the following basic property of D-Finite functions:

Let $$$f$$$ be a D-Finite function, and $$$q$$$ be a rational function, then $$$f\circ q$$$ is D-Finite.

This leads to a fundamental reason why D-Finite functions are useful: They satisfy so many nice closure properties, and these properties are almost complete for explaining the ubiquity of P-recursive sequences in combinatorics.

Multivariate D-Finite Functions

We are already more-or-less familiar with one variable D-Finite functions, whose associated sequence is P-recursive (or called holonomic). A natural question is to ask the correct generalization of D-Finite functions to multiple variables.

The correct definition is due to Richard P. Stanley.

Def 1. Let $$$K$$$ be a field of characteristic zero. A formal power series $$$f \in K[ [ X_1,\dots,X_d ] ]$$$ is called D-finite if the $$$K(X_1,\dots,X_d)$$$-vector space spanned by all derivatives of $$$f$$$ is finite-dimensional.

A caveat is that for multiple variables, the description of the property of the associated sequence of a D-Finite series, is not as nice as in the univariate case. However, we will see why the definition is still useful later.

Similar to the univariate case, multivariate D-Finite functions have nice closure properties.

Theorem. If $$$f, g \in K[ [ X_1,\dots,X_d ] ]$$$ is D-Finite, then

  1. For any $$$a,b \in K(X_1,\dots,X_d)$$$, if $$$af+bg$$$ is still a formal power series, it is D-Finite.
  2. The product $$$fg$$$ is D-Finite.
  3. If $$$u_1,\dots,u_d \in K[ [ Y_1,\dots,Y_e ] ]$$$ is algebraic, and the composition $$$f(u_1,\dots,u_d)$$$ is well-defined in some sense, then it is D-Finite.
  4. (Lipshitz, 1988) If $$$f$$$ is D-Finite, then the diagonal operator $$$\Delta_{X_{d-1},X_d}\colon K[ [ X_1,\dots,X_d ] ] \to K[ [ X_1,\dots,X_{d-2},U ] ]$$$
$$$ \left(\sum_{i_1,\dots,i_d\geq 0} f_{i_1,\dots,i_d} X_1^{i_1}\cdots X_d^{i_d}\right) \mapsto \sum_{i_1,\dots,i_{d-2},j\geq 0} f_{i_1,\dots,i_{d-2},j,j} X_1^{i_1}\cdots X_{d-2}^{i_{d-2}} U^j. $$$

maps D-Finite functions to D-Finite functions.

The first two properties shows that D-Finite functions in $$$K[ [ X_1,\dots,X_d ] ] \otimes_{K[X_1,\dots,X_d]} K(X_1,\dots,X_d)$$$ form a $$$K(X_1,\dots,X_d)$$$-algebra.

The first three properties are easy to prove and the last one needs more insights (but is also the most useful one).

I'll not present the proofs here; interested readers can refer to Manuel Kauers's recent book D-Finite Functions. However, I want to remark that all these properties can be made constructive.

Make Practical

One could prove that Def 1 is equivalent to the following definition:

Def 2. A formal power series $$$f \in K[ [ X_1,\dots,X_d ] ]$$$ is D-Finite if for any $$$1\leq \ell \leq d$$$, there exists $$$k_\ell$$$ such that $$$\partial_{X_\ell}^{k_\ell} f$$$ can be $$$K(X_1,\dots,X_d)$$$-linearly expressed by $$$\partial_{X_\ell}^{j} f$$$ for $$$0\leq j < k_\ell$$$.

Therefore, the constructiveness means that:

  1. We could use data of the linear relation in Def 2 to represent a D-Finite function. (Rigorously speaking, the solution of this system of equations didn't fully determine the D-Finite function, but it is enough for most purposes.)
  2. The above closure properties can be proved by an algorithm that takes the linear relations as input, and outputs the linear relations of the output D-Finite function.

Now we take the simplest example of the sum of two univariate D-Finite functions. Let $$$f,g$$$ be D-Finite, and suppose they satisfy the linear relations

$$$ \partial^n_X f = \sum_{j < n} p_j(X) \partial^j_X f, \quad \partial^m_X g = \sum_{j < m} q_j(X) \partial^j_X g. $$$

Then we can derive the linear relation for $$$f+g$$$: Recursively compute the representation of $$$\partial_X^i (f+g)$$$, in terms of $$$\partial_X^j f$$$ and $$$\partial_X^k g$$$ for $$$j < n$$$ and $$$k < m$$$. Since they form an $$$(n+m)$$$-dimensional vector space, we can always find a linear relation for $$$\partial_X^i (f+g)$$$ between $$$0\leq i\leq n+m$$$.

For example, in jqdai0815's blog, he showed that the power series involved in the problem Chinese Elephant Chess is D-Finite. He suspected that it would be non-practical to compute the sequence via the P-recursive relation, but in fact, I implemented the above algorithm to automatically compute the relation, and then compute the sequence in linear time. Turns out that this code is, in fact, quite efficient. The code is attached below as a reference.

Code

More Implications

Then, let me demonstrate a more interesting example, this might explain why D-Finite functions are everywhere.

We are often concerned with a summation involving binomials, and we want to simplify the expression. For example, the following sum

$$$ a_n = \sum_k (-1)^k \binom{n}{k}^3. $$$

Well, this actually can be simplified and is known as Dixon's identity, which has a lot of beautiful proofs.

But in a computational perspective, suppose I just want to compute the sequence $$$a_n$$$ efficiently,

do we need to know such a beautiful identity?

The answer is no. It's very general that such kind of sequences are P-recursive, here is a proof:

We first write down the generating function of the binomial:

$$$ Q(X,Y)= \sum_{n,k} \binom{n}{k} X^n Y^k = \sum_n X^n (1+Y)^n = \frac 1{1-X-XY}, $$$

which is clearly D-Finite. Then consider the generating function

$$$ Q(X_1,Y_1)Q(X_2,Y_2)Q(X_3,Y_3), $$$

is clearly D-Finite. Then we use the diagonal operator to glue the variables $$$X_1,X_2,X_3$$$ together, and $$$Y_1,Y_2,Y_3$$$ together, and we get a D-Finite function

$$$ R(X,Y) = \sum_{n,k}\binom{n}{k}^3 X^n Y^k. $$$

Finally, we plug in $$$Y=-1$$$, and we get the generating function of $$$a_n$$$:

$$$ R(X,-1) = \sum_n a_n X^n. $$$

Since $$$R(X,-1)$$$ is D-Finite, $$$a_n$$$ is P-recursive.

Therefore, we could compute $$$a_n$$$ in linear time by just using the recurrence, or even in $$$O(\sqrt n \log n)$$$ time by the fast evaluation algorithm of P-recursive sequences. Knowing the simplification of the expression doesn't help reducing the time complexity in this sense.

This argument has a vast generalization. In the paper Multiple Binomial Sums, the authors formalized a class of summation involving binomials that captures almost all the known identities. One can mimic the above argument to prove that all these sequences are D-Finite. (In the paper they proved some stronger characterization.)

Then here comes another question:

If we just want to solve a problem in competition, do we need to know the proof of P-Recursiveness?

The answer is also no! If a priori we know that the sequence is P-recursive, we can just compute the first several terms in some brute-force way, and then use Gaussian elimination to find the P-recursive relation. This is usually called Min25-BM since the BM algorithm tells that linear recurrence sequences can be reconstructed efficiently.

Remark: The problem of finding the P-recursive relation can be reduced to the Hermite-Padé approximation problem. While Gaussian elimination takes $$$O(n^3)$$$ time, the Hermite-Padé approximation can be solved $$$O(n^2)$$$ time or even $$$O(n\log^2 n)$$$ time. (I'm a little bit hand-waving here about what $$$n$$$ actually means), though this might not be very useful if one just wants to find a P-recursive relation of constant size.

  • Проголосовать: нравится
  • +123
  • Проголосовать: не нравится