Minimum cost subtree

Please read the new rule regarding the restriction on the use of AI tools. ×

→ Pay attention

Before contest
Codeforces Round 975 (Div. 1)
4 days

Before contest
Codeforces Round 975 (Div. 2)
4 days

→ Streams

Meta Hacker Cup Practice Round Solution Discussion

By aryanc403

Before stream 03:42:16

View all →

→ Top rated

#	User	Rating
1	tourist	4009
2	jiangly	3773
3	Radewoosh	3646
4	ecnerwala	3624
5	jqdai0815	3620
5	Benq	3620
7	orzdevinwang	3612
8	Geothermal	3569
8	cnnfls_csy	3569
10	Um_nik	3396

Countries | Cities | Organizations

View all →

→ Top contributors

#	User	Contrib.
1	Um_nik	164
2	cry	160
2	maomao90	160
4	-is-this-fft-	159
5	atcoder_official	158
5	awoo	158
7	adamant	155
8	nor	154
9	maroonrk	152
10	Dominater069	149

View all →

→ Find user

→ Recent actions

Detailed →

olivergrant's blog

Minimum cost subtree

By olivergrant, history, 8 months ago, In English

Hi! I have the following problem regarding MSTs.

Given a complete undirected graph $$$G = (V,E)$$$ of edges only costing either 1 or 2, find a subset of the edges $$$|E'|=k$$$ for some $$$k$$$ and vertices $$$V'$$$ such that $$$(V', E')$$$ form a tree and is of minimum cost.

The bounds for this problem is that $$$1 \le n \le 10^3$$$ and $$$1 \le k \le n - 1$$$

To me, this is very similar to an MST problem, but running a classic MST algorithm would be slow and I'm trying to do this in $$$O(|E|)$$$ time by making use of the complete graph property.

graphs, mst, minimum spanning tree, graph theory

olivergrant
8 months ago
8

Comments (8)

Write comment?

olivergrant

8 months ago, # |

Auto comment: topic has been updated by olivergrant (previous revision, new revision, compare).

→ Reply

tvmpqx_8601

8 months ago, # |

+39

Since all edges either have weight one or two and number of edges in answer is fixed, let us subtract one from weights of all edges (we will add k back to answer in the end). Now we have free edges and edges of uniform cost. Instead of searching for tree let us search for connect subgraph of size at least (k + 1), answer won't change. Firstly, take all free edges. That is, compress connected components. Now from those connected components we need to build subgraph of required size. We can make it greedily: sort components by size and take prefix of some largest components.

→ Reply

olivergrant

8 months ago, # ^ |

← Rev. 2 →

Question: What is the purpose of subtracting one from weights of all edges?

And also could you elaborate on greedily taking components. What if we're using too many edges in the compressed components?

→ Reply

EzikBro

8 months ago, # ^ |

Question: What is the purpose of subtracting one from weights of all edges?

This is to better understand the solution. If you think about why the greedy algorithm would work in this case, it is very simple: each of the components is free by itself, and only the connection between them has equal cost. However, if the edges have cost 1 and 2, then now each component has a cost that is not equal to the cost of the connection or any other component. In this case, the proof of the solution will not be so obvious.

What if we're using too many edges in the compressed components?

Each connected component has a spanning tree that has the minimum number of edges to connect the subgraph. So in this solution you don't use more than you need. You can build this tree using BFS or DFS.

And I think I should make it clear that you don't need to compress components in the real solution — you just need to extract them, but as I see in your code here you've figured that out for yourself.

→ Reply

olivergrant

8 months ago, # ^ |

← Rev. 2 →

Thanks a lot! Are there any similar problems as this? I'm curious because I want to build intuition like this.

My final code is here for anyone who is curious:

Spoiler

#include <bits/stdc++.h>
using namespace std;
struct Edge {
    int u, v, w;
};
vector<int> par, sz;
void make_set(int n) {
    par[n] = n;
    sz[n] = 1;
}
int Find(int n) {
    if (par[n] == n) return n;
    return par[n] = Find(par[n]);
}
bool Union(int a, int b) {
    a = Find(a);
    b = Find(b);
    if (a != b) {
        par[b] = a;
        sz[a] += sz[b];
        return true;
    }
    return false;
}
int main() {
    int N, K, u, v, w;
    cin >> N >> K;
    par.resize(N);
    sz.resize(N);
    int M = N * (N &mdash; 1) / 2;
    for (int i = 0; i < N; ++i) {
        make_set(i);
    }
    vector<Edge> edges, zeros, ones;
    for (int i = 0; i < M; ++i) {
        cin >> u >> v >> w;
        --w; // change edge cost to {0,1}
        if (w == 0) {
            zeros.push_back({u, v, w});
        } else if (w == 1) {
            ones.push_back({u, v, w});
        }
    }
    // take all the free edges
    for (auto & [a, b, c] : zeros) {
        Union(a, b);
    }
    // find size of each component
    set<int> unique_components;
    for (int i = 0; i < N; ++i) {
        par[i] = Find(i);
        unique_components.insert(par[i]);
    }
    vector<int> component_sizes;
    for (auto & component : unique_components) {
        component_sizes.push_back(sz[component]);
    }
    int cost = 0, count = 0;
    sort(component_sizes.rbegin(), component_sizes.rend());
    for (auto & component_size : component_sizes) {
        if (count >= K + 1) break;
        count += component_size;
        ++cost;
    }
    cout << cost &mdash; 1 + K << endl;
    return 0;
}

→ Reply

EzikBro

8 months ago, # ^ |

I don't know much about similar problems, but you can have a look at this. It has almost the same graph structure with bigger constraints, so I remembered it when I first saw this thread.

And I'd like to add one thing to your solution. It's not a big deal, but you don't need to use DSU in this problem — the extraction of the connected components can be done with BFS or DFS, and then you'll have a smaller constant, I suppose.

→ Reply

olivergrant

8 months ago, # ^ |

← Rev. 3 →

Also, sorting the components by size may not make the run time of the solution $$$O(|E|)$$$. Not sure if I'm doing something wrong here.

Spoiler

int main() {
    int N, K, u, v, w;
    cin >> N >> K;
    par.resize(N);
    sz.resize(N);
    int M = N * (N - 1) / 2;
    for (int i = 0; i < N; ++i) {
        make_set(i);
    }
    vector<Edge> edges, zeros, ones;
    for (int i = 0; i < M; ++i) {
        cin >> u >> v >> w;
        --w; // change edge cost to {0,1}
        if (w == 0) {
            zeros.push_back({u, v, w});
        } else if (w == 1) {
            ones.push_back({u, v, w});
        }
    }
    // take all the free edges
    for (auto & [a, b, c] : zeros) {
        Union(a, b);
    }
    // find size of each component
    set<int> unique_components;
    for (int i = 0; i < N; ++i) {
        par[i] = Find(i);
        if (sz[i] != 1) unique_components.insert(par[i]);
    }
    vector<int> component_sizes;
    for (auto & component : unique_components) {
        component_sizes.push_back(sz[component]);
    }
    int count = 0;
    sort(component_sizes.rbegin(), component_sizes.rend());
    for (auto & component_size : component_sizes) {
        if (count >= K + 1) break;
        count += component_size;
    }
    cout << count << endl;
    return 0;
}

→ Reply

perkyfever

8 months ago, # ^ |

← Rev. 2 →

+15

Since the graph is complete you have $$$\mathcal{O}(E) = \mathcal{O}(n^2)$$$. The number of components is $$$\leq n$$$ so sorting it takes $$$\mathcal{O}(n\log n) = o(n^2)$$$.

Sort the sizes in descending order and take them greedily one by one until the sum of the sizes is at least $$$k$$$. You can restore the edges you need by running dfs on the verticies of taken components.

→ Reply