Modulo Inversion in Constant Time, with Subpolynomial Precomputation

#	User	Rating
1	tourist	3857
2	jiangly	3747
3	orzdevinwang	3706
4	jqdai0815	3682
5	ksun48	3591
6	gamegame	3477
7	Benq	3468
8	Radewoosh	3463
9	ecnerwala	3451
10	heuristica	3431

#	User	Contrib.
1	cry	167
2	-is-this-fft-	162
3	Dominater069	160
4	Qingyu	158
5	atcoder_official	157
5	Um_nik	157
7	adamant	151
7	djm03178	151
7	luogu_official	151
10	awoo	146

Hi everyone,

In this post, I'll share a method to compute modular inverses in $$$O(1)$$$ time under some $$$p^{\varepsilon}$$$-time precomputation.

Rather than simply computing the modular inverse, we aim to solve for a pair of integers $$$x, y$$$ such that $$$ax + by = \gcd(a, b)$$$. I.e., find the solution of the Bezout equation.

Observations

Observation. For two non-zero positive integers $$$a, b$$$ less than some bound $$$X$$$, there exists an invertible integer matrix $$$M \in \mathrm{GL}_2(\mathbb{Z})$$$ such that one component of the vector $$$M \begin{pmatrix}a\\b \end{pmatrix}$$$ is bounded by $$$X^{1/2}$$$, and the absolute values of the elements in $$$M$$$ are also bounded by $$$X^{1/2}$$$.

Proof. The pigeonhole principle is sufficient to prove this observation. However, there could be a more algorithmic proof by analyzing the execution of Euclid's algorithm. We omit the details of this proof here. $$$\square$$$

Corollary. Given two non-zero positive integers $$$a, b$$$ less than $$$X$$$, and a parameter $$$B \leq X$$$, there exists an invertible integer matrix $$$M \in \mathrm{GL}_2(\mathbb{Z})$$$ such that one component of the vector $$$M \begin{pmatrix}a\\b \end{pmatrix}$$$ is bounded by $$$3X/B^{1/2}$$$.

Proof. First, we define $$$a_0 = \lfloor aB/X \rfloor$$$ and $$$b_0 = \lfloor bB/X \rfloor$$$. Using the matrix $$$M$$$ from the previous observation, we can compute $$$M(a_0, b_0)$$$, where one of the components is bounded by $$$B^{1/2}$$$. The error between $$$(a,b)$$$ and $$$(a_0,b_0)$$$ is controlled by the difference $$$O(B/X)$$$. In total, the error is bounded by $$$3X/B^{1/2}$$$. $$$\square$$$

The core of the algorithm involves using this corollary to precompute relevant data and then quickly look up the results during the query phase. The following describes the two phases of the approach:

Algorithm

Processing Phase:
Choose a parameter $$$B$$$ and precompute the invertible matrices $$$M$$$ for all $$$a, b < B$$$, as well as the answers for $$$a, b < 3B$$$. This precomputation phase has a time complexity of $$$\tilde{O}(B^2)$$$, where the $$$\tilde{O}$$$ notation accounts for polylogarithmic factors.

Query Phase:
Once the preprocessing is done, we can perform the following steps to compute the result for any query:

For $$$\max(a, b) \geq 3B$$$, reduce the larger of the two numbers by a factor of $$$X \cdot 3B^{-1/2}$$$, and then apply the Euclidean algorithm to reduce the other number.
Continue this process until both $$$a, b < 3B$$$, at which point the result can be directly retrieved from the precomputed table.

This query phase runs in $$$O(\log X / \log B)$$$ time, providing an efficient way to compute the desired results.

Take $$$B = X^{O(1/k)}$$$, we could do $$$O(p^{1/k})$$$ time to do precomputation, and support the modular inversion in $$$O(k)$$$ time per query.

Take $$$B=(\log X)^{O(1)}$$$, we could compute modular inversion in $$$O(\log X / \log \log X)$$$ for a single query.

Remarks

We know that for the data structure problem there are some tradeoff between $$$O(n^\epsilon)$$$ time update and $$$O(1/\epsilon)$$$ time query. The approach described here can be thought of as the right analogy with the half-GCD algorithm. The key insight of the half-GCD method is recognizing that the results of the corollary can be computed recursively, leading to an $$$O(\ell \log^2 \ell)$$$ algorithm for Euclidean division in integer arithmetic or for polynomials over finite fields.

Rev.	By	When	Δ	Comment
en4	LorentzianExpanders	2025-02-25 19:50:25	4
en3	LorentzianExpanders	2025-02-25 19:48:17	241
en2	LorentzianExpanders	2025-02-25 19:43:28	3
en1	LorentzianExpanders	2025-02-25 19:42:28	3336	Initial revision (published)

Observations

Algorithm

Remarks

History