Aliens trick, revisited

#	User	Rating
1	jiangly	3976
2	tourist	3815
3	jqdai0815	3682
4	ksun48	3614
5	orzdevinwang	3526
6	ecnerwala	3514
7	Benq	3482
8	hos.lyric	3382
9	gamegame	3374
10	heuristica	3357

#	User	Contrib.
1	cry	169
2	-is-this-fft-	165
3	Um_nik	161
3	atcoder_official	161
5	djm03178	157
6	Dominater069	156
7	adamant	154
8	luogu_official	152
9	awoo	151
10	TheScrasse	147

Hi everyone!

This time I'd like to write about what's widely known as "Aliens trick" (as it got popularized after 2016 IOI problem called Aliens). There are already some articles about it here and there, and I'd like to summarize them, while also adding insights into the connection between this trick and generic Lagrange multipliers and Lagrangian duality which often occurs in e.g. linear programming problems.

Lagrange duality

Let $$$f : X \to \mathbb R$$$ be the objective function and $$$g : X \to \mathbb R^c$$$ be the constraint function. The constrained optimization problem

$$$\begin{gather}f(x) \to \min\\ g(x) = 0\end{gather}$$$

in some cases can be reduced to finding stationary points of the Lagrange function

$$$L(x, \lambda) = f(x) - \lambda \cdot g(x).$$$

Here $$$\lambda \cdot g(x)$$$ is the dot product of $$$g(x)$$$ and a variable vector $$$\lambda \in \mathbb R^c$$$, called the Lagrange multiplier. Mathematical optimization typically focuses on finding stationary points of $$$L(x,\lambda)$$$. However, in our particular case we're more interested in the function

$$$t(\lambda) = \inf\limits_{x \in X} L(x,\lambda),$$$

which is called the Lagrange dual function. If $$$x^*$$$ is the solution to the original problem, then $$$t(\lambda) \leq L(x^*,\lambda)=f(x^*)$$$.

This allows to introduce the Lagrangian dual problem $$$t(\lambda) \to \max$$$. Note that $$$t(\lambda)$$$, as a point-wise infimum of concave (specifically, linear) functions, is always concave, even when $$$X$$$ is, e.g., discrete. If $$$\lambda^*$$$ is the solution to the dual problem, the value $$$f(x^*) - t(\lambda^*)$$$ is called the duality gap. We're specifically interested in the case when it equals zero, which is called the strong duality.

Typical example here is Slater's condition, which says that strong duality holds if $$$f(x)$$$ is convex and there exists $$$x$$$ such that $$$g(x)=0$$$.

Change of domain

In competitive programming, the set $$$X$$$ in definitions above is often weird and very difficult to analyze directly, so Slater's condition is not applicable. As a typical example, $$$X$$$ could be the set of all possible partitions of $$$\{1,2,\dots, n\}$$$ into non-intersecting segments.

To mitigate this, we define $$$h(y)$$$ as the minimum value of $$$f(x)$$$ subject to $$$g(x)=y$$$. In this notion, the dual function is written as

$$$t(\lambda) = \inf\limits_{y \in Y} [h(y) - \lambda \cdot y],$$$

where $$$Y=\{ g(x) : x \in X\}$$$. The set $$$Y$$$ is usually much more regular than $$$X$$$, as just by definition it is already a subset of $$$\mathbb R^c$$$. The strong duality condition is also very clear in this terms: it holds if and only if $$$0 \in Y$$$ and there is a $$$\lambda$$$ for which $$$y=0$$$ delivers infimum. Geometrically it means that epigraph of $$$h(y)$$$ has a supporting plane that touches it in $$$y=0$$$.

Competitive programming problems typically assume that $$$y$$$ is a variable given in the input, so this condition should actually hold for all $$$y \in Y$$$ which is essentially equivalent to $$$h(y)$$$ being convex on $$$Y$$$, which is a sufficient and almost always necessary condition here.

Interpreting lambda

If $$$h(y)$$$ is continuously differentiable and convex, the minimum for specific $$$\lambda$$$ is obtained with $$$y=g(x)$$$ such that $$$\nabla h(y) = \lambda$$$. This property allows to find $$$\lambda$$$ that corresponds to $$$y=0$$$ with nested ternary search.

Problem examples

References

Duality (optimization) — English Wikipedia
The Trick From Aliens — Serbanology
My Take on Aliens' Trick — Mamnoon Siam's Blog
Comment on Codeforces by _h_
Part of the article was once revealed to me in a dream

Rev.	By	When	Δ	Comment
en68	adamant	2023-10-11 00:47:07	8
en67	adamant	2022-02-13 16:55:43	33
en66	adamant	2022-01-04 14:57:40	25
en65	adamant	2022-01-04 05:03:24	6363	+ honorable mention
en64	adamant	2022-01-04 04:20:18	8	articles
en63	adamant	2022-01-04 04:18:39	354	example 3
en62	adamant	2022-01-04 04:00:52	748	tldr structured
en61	adamant	2022-01-04 03:39:52	33
en60	adamant	2022-01-04 03:38:28	721	example, part 2
en59	adamant	2022-01-04 03:25:48	784
en58	adamant	2022-01-04 03:18:14	1414	example
en57	adamant	2022-01-04 00:21:04	4
en56	adamant	2022-01-04 00:20:45	570	better code for min_conv
en55	adamant	2022-01-03 15:20:41	472	clarified tldr
en54	adamant	2022-01-03 03:37:09	688	code for max-conv of concave functions
en53	adamant	2022-01-03 01:11:10	43	link
en52	adamant	2022-01-03 01:07:50	30
en51	adamant	2022-01-03 01:06:56	12
en50	adamant	2022-01-03 01:02:56	1815	+ example
en49	adamant	2022-01-02 20:14:13	160	sections in testing convexity
en48	adamant	2022-01-02 13:29:02	129
en47	adamant	2022-01-02 13:26:24	104	(published)
en46	adamant	2022-01-02 13:21:47	3709
en45	adamant	2022-01-02 12:55:16	690
en44	adamant	2022-01-02 01:54:22	24
en43	adamant	2022-01-02 01:53:53	710
en42	adamant	2022-01-02 01:01:41	210
en41	adamant	2022-01-02 00:54:43	643
en40	adamant	2022-01-02 00:49:11	1627
en39	adamant	2022-01-02 00:09:54	8779
en38	adamant	2022-01-01 22:34:20	1161
en37	adamant	2022-01-01 16:34:48	296
en36	adamant	2022-01-01 16:33:24	4
en35	adamant	2022-01-01 16:29:44	22
en34	adamant	2022-01-01 16:27:27	131
en33	adamant	2022-01-01 16:26:19	173
en32	adamant	2022-01-01 16:23:52	31
en31	adamant	2022-01-01 16:23:22	865
en30	adamant	2021-12-29 05:34:22	1201
en29	adamant	2021-12-29 03:47:08	332
en28	adamant	2021-12-29 02:45:56	2366
en27	adamant	2021-12-28 22:09:05	1383
en26	adamant	2021-12-28 19:28:41	1585
en25	adamant	2021-12-28 18:57:44	52
en24	adamant	2021-12-28 18:44:26	5
en23	adamant	2021-12-28 18:43:58	915
en22	adamant	2021-12-28 18:28:53	112
en21	adamant	2021-12-28 18:25:09	218
en20	adamant	2021-12-28 18:18:48	1224	Tiny change: 'blem\n\n$$f(x)→ming(x)=0f(x)→ming(x)=0\begin{gat' -> 'blem\n\n$$\begin{gat'
en19	adamant	2021-12-28 17:51:33	66
en18	adamant	2021-12-28 17:50:21	316	Tiny change: 'blem\n\n$$f(x)→ming(x)=0f(x)→ming(x)=0\begin{gat' -> 'blem\n\n$$\begin{gat'
en17	adamant	2021-12-28 17:35:06	409	Tiny change: 'blem\n\n$$f(x)→ming(x)=0f(x)→ming(x)=0\begin{gat' -> 'blem\n\n$$\begin{gat'
en16	adamant	2021-12-28 17:18:07	81	Tiny change: 'blem\n\n$$f(x)→ming(x)=0f(x)→ming(x)=0\begin{gat' -> 'blem\n\n$$\begin{gat'
en15	adamant	2021-12-28 17:14:07	1049	Tiny change: 'blem\n\n$$f(x)→ming(x)=0f(x)→ming(x)=0\begin{gat' -> 'blem\n\n$$\begin{gat'
en14	adamant	2021-12-28 16:12:52	96	Tiny change: 'blem\n\n$$f(x)→ming(x)=0f(x)→ming(x)=0\begin{gat' -> 'blem\n\n$$\begin{gat'
en13	adamant	2021-12-28 16:01:48	338	Tiny change: 'blem\n\n$$f(x)→ming(x)=0f(x)→ming(x)=0\begin{gat' -> 'blem\n\n$$\begin{gat'
en12	adamant	2021-12-28 15:54:43	903	Tiny change: 'blem\n\n$$f(x)→ming(x)=0f(x)→ming(x)=0\begin{gat' -> 'blem\n\n$$\begin{gat'
en11	adamant	2021-12-28 04:38:22	2	Tiny change: 'blem\n\n$$f(x)→ming(x)=0f(x)→ming(x)=0\begin{gat' -> 'blem\n\n$$\begin{gat'
en10	adamant	2021-12-28 04:35:18	118	Tiny change: 'blem\n\n$$f(x)→ming(x)=0f(x)→ming(x)=0\begin{gat' -> 'blem\n\n$$\begin{gat'
en9	adamant	2021-12-28 04:23:41	1406	Tiny change: 'blem\n\n$$f(x)→ming(x)=0f(x)→ming(x)=0\begin{gat' -> 'blem\n\n$$\begin{gat'
en8	adamant	2021-12-28 03:18:55	333	Tiny change: 'blem\n\n$$f(x)→ming(x)=0f(x)→ming(x)=0\begin{gat' -> 'blem\n\n$$\begin{gat'
en7	adamant	2021-12-28 02:59:30	322	Tiny change: 'blem\n\n$$f(x)→ming(x)=0f(x)→ming(x)=0\begin{gat' -> 'blem\n\n$$\begin{gat'
en6	adamant	2021-12-28 01:40:50	2845	Tiny change: 'blem\n\n$$f(x)→ming(x)=0f(x)→ming(x)=0\begin{gat' -> 'blem\n\n$$\begin{gat'
en5	adamant	2021-12-27 22:08:35	26	Tiny change: 'blem\n\n$$f(x)→ming(x)=0f(x)→ming(x)=0\begin{gat' -> 'blem\n\n$$\begin{gat'
en4	adamant	2021-12-26 19:06:11	0	Tiny change: 'blem\n\n$$f(x)→ming(x)=0f(x)→ming(x)=0\begin{gat' -> 'blem\n\n$$\begin{gat'
en3	adamant	2021-12-26 01:42:07	1888	Tiny change: 'blem\n\n$$f(x)→extrg(x)=0f(x)→extrg(x)=0\begin{gat' -> 'blem\n\n$$\begin{gat'
en2	adamant	2021-12-25 16:27:58	176	Tiny change: 'blem\n\n$$f(x)→extrg(x)=0f(x)→extrg(x)=0\begin{gat' -> 'blem\n\n$$\begin{gat'
en1	adamant	2021-12-25 16:21:30	2909	Initial revision (saved to drafts)

Lagrange duality

Change of domain

Interpreting lambda

Problem examples

References

History