Google Hashcode 2022 Practice Round "One pizza" Discussion

→ Pay attention

Before contest
Codeforces Round 1006 (Div. 3)
3 days
Register now »

→ Streams

By Shayan

Before stream 00:59:20

View all →

→ Top rated

#	User	Rating
1	tourist	3856
2	jiangly	3747
3	orzdevinwang	3706
4	jqdai0815	3682
5	ksun48	3591
6	gamegame	3477
7	Benq	3468
8	Radewoosh	3462
9	ecnerwala	3451
10	heuristica	3431

Countries | Cities | Organizations

View all →

→ Top contributors

#	User	Contrib.
1	cry	167
2	-is-this-fft-	162
3	Dominater069	160
4	Um_nik	158
5	atcoder_official	157
6	Qingyu	155
7	djm03178	151
7	adamant	151
9	luogu_official	150
10	awoo	146

View all →

→ Find user

→ Recent actions

Detailed →

kingmoshe's blog

Google Hashcode 2022 Practice Round "One pizza" Discussion

By kingmoshe, history, 3 years ago, In English

Hello everyone,

Since the practice round doesn't seem to provide a scoreboard, I'd like to open this thread to discuss scores and possible approaches for the problem. We got the following scores after some ideas:

A: 2

B: 5

C: 5

D: 1802

E: 2083

Total: 3897

We also tryied to calculate an upper bound for some of the tasks:

D upper bound: 1900

E upper bound: 2804 (probably can be significantly improved)

I will post my ideas in the comments. Did anyone managed to get a better sccore? or had an insight to the problem?

kingmoshe
3 years ago
39

Comments (32)

Show archived | Write comment?

kingmoshe

3 years ago, # |

Auto comment: topic has been updated by kingmoshe (previous revision, new revision, compare).

→ Reply

kingmoshe

3 years ago, # |

Auto comment: topic has been updated by kingmoshe (previous revision, new revision, compare).

→ Reply

yarden.porat

3 years ago, # |

← Rev. 2 →

Upper bound explanation(I worked on the problem with kingmoshe):

First we need to explain the graph of clients. We represent each client as a vertex in a graph. Two clients or vertexs has an edge between them if there is no pizza that can satisfy both clients(for exmple one cllients like cheese and the other doesnt like cheese). It is easy to see that if you take a group of vertexs with no edges between each two vertex, then there is a pizza that satisfy does clients vertexes. Also it is very easy to see, that no pizza could satisfy 2 vertexes with an edge between them.

Therefore we can make a reduction to an max anti-clique problem(which is NP).

We did the upper bound by: 1. Search for the biggest clique that we can find. 2. We deduce that only one vertex from the clique could be chosen. 3. Remove the clique from the clients and start again. Then the number of cliques we find is a good upper bound.

Where in d we found 1900 cliques. And in e we found 2804 cliques.

This is approach for a good upper bound that could obviously be reduced more.

→ Reply

yarden.porat

3 years ago, # |

Static solution for e that reached 2025 points(to get to 2078 we did a lot of dynamic approach of approving a given output). 1. Calculate the degree of each vertex. 2. Chosen the vertex of the lowest degree. 3. Remove the vertex and his neighbors of the graph. 4. go to step 1.

→ Reply

H-H2648

3 years ago, # ^ |

How did you improve the score from 2025 to 2078?

→ Reply

kingmoshe

3 years ago, # ^ |

We had two main *dynamic improvments

1) We allready have a function for finding a large anti-clique so lets try the next improvment: a) take randomly half of you anti-clique b) remove from the graph all nodes in that half and all of their neighbor c) calculate the large anti-clique of the remaining graph d) if the new anti-clique plus the old half is larger than the old anti-clique keep it, otherwise remain with old anti-clique e) go back to part a for 1000 times

2) lets call the selected anti-clique C, and the rest of the graph G a) create a new list, lets call it K b) pick at random a node from G that has a minimal amount of neighbors in C, and add it to K c) remove that node, and any of its neighbors (in G) from G d) remove all of the node neighbors from C e) if the amount of nodes in K is larger than the amount of nodes removed from C than K is better than the removed nodes in C ( and we increased the solution) f) repeat those steps 1000 times

both of this improvments helped allot, but using both of them is what really improved, also after trieng a little bit more random, we manged to increase our score to 2803

dynamic improvment — take your solution try a small change if the scores increase keep that changed, otherwise dont keep it, try it again and again untill no improvment is found

→ Reply

Conan_

3 years ago, # ^ |

how the function that calculate large anti-clique works ?
its already an NP problem

→ Reply

kingmoshe

3 years ago, # ^ |

basicly, every time take the node with the least amount of neighbors and after taking it delete it's neigbors, if two nodes exists with same amount of nodes, then look at the degrees of its neighbors, you want does degrees to be as big as posible (or more precisley we maximized on the minum degree of the neighbor, the idea is that a neighbor with a small degree as a good chance to be picked later), and of course we added some random along the line, so we could run the process many times and get different outcomes.

→ Reply

halcyon_past

3 years ago, # ^ |

hey so I am a beginner and I have absolutely no idea what we have to do so if you don't mind could you explain? I am a student from first year so I don't know dsa or dynamic programming yet.

→ Reply

bcollet

3 years ago, # ^ |

+14

Hey, no dynamic programming or complicated dsa required yet (although in the qualification round some may take advantage of it) but little knowledge on graph might help.

First, if it's your question and you are interested, if you didn't already do it find a team of between 2 and 4 people (including you) and register for google Hash code.

Second, you will be able to submit to the training problem which I explain below in spoiler in order that the post appears shorter.

subject

reading, scoring, printing

dumb solution

my strategy, ingredient-wise

best solution explained so far by kingmoshe

I wrote 500 lines by trying this problem Hope it'll help you and that I did not too much state the obvious, don't hesitate if you have any further question which are not complete solution of the problem

→ Reply

kingmoshe

3 years ago, # ^ |

what a great explanation!

→ Reply

yarden.porat

3 years ago, # |

Insighets on the problem. Test case e biggest clique is only 3. And after some removel of clique of size 3(~60), the biggest clique is of size 2. Also there are 68 clients of degree 0 which should allways be chosen.

→ Reply

NVAL

3 years ago, # |

A: 2
B: 5
C: 5
D: 1804
E: 2083
Total: 3899

→ Reply

kingmoshe

3 years ago, # ^ |

cool, what was your solution?

→ Reply

NVAL

3 years ago, # ^ |

Update: D is 1805 points now.
Nothing interesting. A lot of random swaps in 10 threads for ~30 minutes

→ Reply

not_found404

3 years ago, # |

A — 2 b- 5 C — 4 D — 1697 E — 799 total — 2507

→ Reply

Rhodoks

3 years ago, # |

+24

The optimal solution of D is 1805, which can be calculated using Binary Linear Programming.

However, E is too large. It cost me about 1h to solve D but I failed to solve E with a night.

→ Reply

H-H2648

3 years ago, # ^ |

How do you calculate it with binary linear programming? And can you calculate this for c and e as well?

→ Reply

Rhodoks

3 years ago, # ^ |

← Rev. 3 →

$$$choose[i]$$$: if we choose ingredient $$$i$$$.

$$$satistied[i]$$$: if person $$$i$$$'s conditions are satisfied.

maximize $$$\sum satistied[j]$$$

s.t. $$$\sum_{j~\text{like}~i} choose[i] + \sum_{j~\text{dislike}~i}~(1-choose[i]) \geq cnt_j \times satisfied[j]$$$ for $$$j = 1...n$$$, $$$cnt_j$$$ is number of $$$j$$$'s like and dislike.

when $$$j$$$ is served($$$satisfied[j]=1$$$), that means $$$\sum_{j~\text{like}~i} choose[i] + \sum_{j~\text{dislike}~i}~(1-choose[i]) \geq cnt_j$$$ so for $$$j~\text{like}~i,choose[i]=1$$$,for $$$j~\text{dislike}~i,choose[i]=0$$$ must hold.

C's answer is 5. E has too many ingredients so it will consume too much time...

→ Reply

Amareelez

3 years ago, # ^ |

How did you specifically solved it ? I mean, in BIP you cannot drop some constraints as some of your constraints are going to be unsatisfied (we can't serve all the clients). And also, how did you calculate your objective in terms of variables ? Could you please give us more details about your approach ?

→ Reply

Rhodoks

3 years ago, # ^ |

Sorry, there are some mistakes in my formula. I have fixed it and added some explanation. Thank you very much.

→ Reply

Eclecticity

3 years ago, # |

Our Scores: 2, 5, 5, 1687, 1849.
Our Approach: Randomized Greedy.

→ Reply

jeroenodb

3 years ago, # |

← Rev. 2 →

+26

Our scores: A: 2 B: 5 C: 5 D: 1805 E: 2085 Total: 3902

We first did a greedy graph algorithm for finding an approximate independent set (picking minimum degree node). Then we iteratively improved the solution with simulated annealing (basically random swapping and checking if the result becomes better). It found the solution of 2084 in E in 10 minutes, while A through D were found in a minute. During a testrun of the code, we scored 2085 in E, but we didn't implement writing the best solutions to a file, so this solution is lost forever.

Edit: Found another 2085 solution and saved the solution this time.

→ Reply

quinoa

3 years ago, # ^ |

How do you do swaps? Randomly picking a vertex you selected and one you did not select probably will give you too often a combination that is invalid?

→ Reply

jeroenodb

3 years ago, # ^ |

For the swaps we actually did two different things:

Swap a random pizza ingredient from "present on pizza" to "not present on pizza" or vice versa.
Pick a random person that is currently not satisfied and swap all ingredients which prevent him from being satisfied.

We did a 50/50 split between these two kinds of swaps, which seemed to get the best result.

→ Reply

kinhosz

3 years ago, # |

← Rev. 2 →

A: 2 B: 5 C: 5 D: 1805 E: 2047

Total: 3864

→ Reply

spiralJava

3 years ago, # |

+19

My hashcode team required one more person. If anyone wants to participate google hashcode but till now not getting any team. So DM me!

→ Reply

AlexLorintz

3 years ago, # |

I accidentally misread the statement and thought of a variant of the problem where one person likes the pizza if it contains at least one of its favourite ingredients. Is this version easier or has an interesting solution?

→ Reply

bjy

3 years ago, # |

Looking for 1-2 last-minute replacements who can actually participate during the timeslot (apparently, timezones are hard). Python preferred but ultimately optional, visualization skills a plus! Got a score of 8,926,023 for last year's qualifying round which is definitely not a brag, but maybe that can speak to expectations (upper bound on stress level, let's say). Get in touch however you like, I will edit/update here if/when the spots are filled.

→ Reply

intrusiv

3 years ago, # ^ |

Got a score of 8,926,023 for last year's qualifying round

Care to explain how?

→ Reply

bjy

3 years ago, # ^ |

If you're familiar with the problem (archived on kaggle?), then you'll know that despite the number of digits, it's a fairly minimal score. I'd have more useful to say about what not to do...

don't team up with random people off the competition's facebook page
don't solve an incorrect reading of the problem for a majority of the time
don't prematurely optimize out of 'slow python' fears
also don't be afraid of rewriting, especially if the thing that 'works' took you less than a majority of the time...?

Swung a bit too far the opposite way this year, reached out early to people I knew, and welp...

→ Reply

intrusiv

3 years ago, # ^ |

Thank you!

→ Reply

kingmoshe's blog

I don't know how to add tabulation

this code can become more efficient but I wanted it to be understandable

you can in particular try to convert ingredient in integer in order to make

the internal process faster and convert back at the end for the output

try to think to a better strategy alone, if you fail read the following for

explanation