Those of us who have been doing cp for a long time, have come to love the representation of numbers as an infinite prime power vector i.e.
Where $$$n$$$ is a number and $$$p_i$$$ is the $$$i$$$ th prime number. This has many useful properties for problems, such as mapping
All the above operations are done elementwise. However in some cases, the problem authors may forbid you from factorising the numbers, by perhaps increasing $$$n$$$ to around $$$10^{36}$$$ where there isn't any simple factorisation algorithm to do it in reasonable time. However using effective prime factorisation, we can use these properties freely on a set of numbers. If this set contained all integers, then it would require us to truly prime factorise the integer. However, we only need to use these operations on a subset of integers, making our task a lot simpler. We will make a finite array $$$q$$$ of size $$$t$$$ with the following properties.
$$$(1)$$$ Any element in $$$x \in S$$$ is representable by $$$q$$$, i.e. there exists $$$[f_1, f_2, \ldots f_t] = v'_n$$$ such that $$$x = \prod q_i^{f_i}$$$
$$$(2)$$$ For all the above operations, replacing $$$v_n$$$ with $$$v'_n$$$ should produce the same result.
Let us show, that if $$$q$$$ satisfies $$$(1)$$$ and $$$gcd(q_i, q_j) = 1$$$ for all $$$i < j$$$, then $$$q$$$ satisfies $$$(2)$$$.
Let us now consider how we can find $$$q$$$ given $$$S$$$. Let us proceed element wise, and assume we have a valid solution, and we want to add another element.
Now let us add the next element $$$x \in S$$$. Go through $$$q$$$ until we spot an element that has a common factor with $$$x$$$. Let that element be $$$q_i$$$.
Let $$$P(q)$$$ be the set of elements produced by some product of elements in $$$q$$$. Notice that if $$$q$$$ has 2 elements that divide each other $$$a \mid b$$$, then $$$a, b$$$ is replaceable by $$$a, \frac{b}{a}$$$, and $$$P$$$ will not lose any elements. Let us do this recursively until neither divides the other, or an element becomes 1. Let us call this procedure reduce_pair
.
(It maybe useful to look at the example implementation in parallel from here)
We run the procedure reduce_pair
on the elements $$$q_i, x$$$, if $$$x$$$ becomes 1, we're done. If $$$q_i$$$ becomes 1, we remove it from the set and continue.
We then run an iterative algorithm on a list $$$L$$$, initialised with $$$[q_i, x]$$$. Let the elements before $$$L[ptr]$$$ be all co-prime. Initialise $$$ptr$$$ with 1. Run reduce_pair
on all elements before $$$ptr$$$ with it. Then add their gcd to the end of the list if its not one, and divide both elements by it. This will make all elements upto $$$ptr + 1$$$ coprime. There can only be as many elements in this list as there are true primes in $$$q_i, x$$$. Therefore we can deduce that the size of the list will be at most $$$O(\log{\log{max(S)}})$$$.
Since this algorithm will be run $$$O(\log{x})$$$ times, the total algorithm will run in $$$O(n^2\log{A} + n\log{A} \times (\log{\log{A}})^2)$$$ Where $$$A = max(S)$$$, and we will get our array $$$q$$$.
Example implementation : 159590539
Example problem: arc122_e - Increasing LCMs
Two more: http://usaco.org/index.php?page=viewproblem2&cpid=1213 and https://codeforces.me/contest/1656/problem/H
I did indeed use something like this before, can't remember the exact problem
Wait this algo also gives the smallest set with the given properties?
I doubt so. However one strong property we have is that the this is the smallest set that works on the elements generated by $$$S$$$, i.e. all elements that can be created by applying the 5 operations to some elements in $$$S$$$.
Great blog! I believe I have used this idea before on trying to implement a way to represent a limited kind of irrational numbers. More precisely, I wanted to represent irrational numbers in the following form.
For convenience, we would also want all values under the root to be square-free, as square factors will be better represented on their multiplied values ("coefficients") instead.
For these values to be square-free, all factors should be below 2, and we can now think of their implicit prime power vector as a binary vector. For convenience, let us define these functions to convert between numbers and their prime power vectors.
Now, we know that multiplying two values gives the sum of the two vectors in its prime power vector. However, we want to keep the powers of each factor to under 2. This lets us think about operations on the F2 field ("binary operations"). We want to keep factors with power of 0 or 1, but remove powers of 2 and move them to the coefficients. We can now define multiplication between two square roots with coefficients in a new way.
Now this lets us compute multiplication of sum-of-roots in quadratic time complexity, assuming there are limited amounts of factors so their implicit vectors can be represented on indexes of arrays (or keys of a map, if you want). The two other operations, addition and subtraction, are very trivial to implement, and so I only explained how multiplication would work.
I have been thinking of a way to do this with FFT since then, but I could still not find a way for it. Please notice me if anyone finds it.
I have a slight implementation question, in the
add_element
method:When
basis[i] == 1
, you docontinue
after replacingbasis[i]
withbasis.back()
. But this essentially means you are skipping over doingreduce_pair
onval
and what was originallybasis.back()
.I think this is OK because if
basis[i] == 1
, this means the originalbasis[i]
had the newval
as a divisor and thus every other element ofbasis[i]
will be coprime with the new value ofval
. But in this case, why not make this abreak
instead of acontinue
? I tried submitting your code with abreak
in place of thiscontinue
and it got AC: 176634276There is also
in
solve_inner
, which might skip iterating some elements inbasis
as well. But I am not sure whether skipping some elements in this case is also fine.It always works here, since if
basis[pos] == 1
There was someg > 1
added to the end of the basis, which would be useless to check against again.This is true, but I didn't think of it at the time. When I wrote
solve_inner
I realised thereduce_pair
is unnecessary. But I left it there because its faster that way.The
continue
was a mistake in my thinking.break
works there and is faster. You can just havesolve_inner
inside the for loop and it would work.Can you say a little bit more about the $$$\log \log \max (S)$$$ bound? I see that the expected number of prime divisors of a number up to $$$n$$$ is $$$O(\log \log n)$$$, but why the worst case?
Super nice and well-written blog, thanks!
If all $$$n$$$ numbers are primes, won't the size be just $$$n$$$?
Based on the example implementation, when we call
add_element
, it may happen that we need to callsolve_inner
multiple times. $$$\log \log \max(S)$$$ is supposed to bound how much thebasis
vector can increase in size for eachadd_element
call.In the case of a sequence of $$$n$$$ primes,
solve_inner
won't be called, and so the size ofbasis
will be indeed $$$n$$$. But think of a case where you add a new number $$$X$$$ and each of its prime factors are elements ofbasis
. You will end up adding precisely the number of prime factors of $$$X$$$ tobasis
, which is not bounded by $$$\log \log X$$$.My question is: is this really the bound? or just an approximation for CP purposes? As I said, the expected number of primers that divide $$$n$$$ is $$$\log \log n$$$, but the only worst case bound I could find is $$$inverseFactorial(n)$$$, which is worse than claimed by the blog.
I'm very sorry, I thought you meant that size of basis will be $$$\log \log max(S)$$$
It was an approximation for CP purposes. I find it really hard to pin down the exact worst case, as it depends on really complicated factors. It seems to be something related to integer ratios between set of primes. I'm sure with a better understanding, you can optimize that loop.
Just found this lol — this reminds me a lot of finding an orthogonal basis of a given vector space!