Apparently, it's codeforces performance beats both o1-mini, o1 and deepseek r1 in codeforces rating:
It's price is on par with o1-mini, I guess they felt the heat from deepseek r1:
# | User | Rating |
---|---|---|
1 | jiangly | 3898 |
2 | tourist | 3840 |
3 | orzdevinwang | 3706 |
4 | ksun48 | 3691 |
5 | jqdai0815 | 3682 |
6 | ecnerwala | 3525 |
7 | gamegame | 3477 |
8 | Benq | 3468 |
9 | Ormlis | 3381 |
10 | maroonrk | 3379 |
# | User | Contrib. |
---|---|---|
1 | cry | 168 |
2 | -is-this-fft- | 165 |
3 | Dominater069 | 161 |
4 | atcoder_official | 159 |
4 | Um_nik | 159 |
6 | djm03178 | 157 |
7 | adamant | 153 |
8 | luogu_official | 151 |
9 | awoo | 149 |
10 | TheScrasse | 146 |
Name |
---|
o1 is claimed to be 1800 and it fails div2b, i'm suspicious of this
which round is it from may i ask?
I remember someone posting a blog about how o1-pro (the 200$ a month version) couldn't solve 2040B - Paint a Strip, but o3-mini can solve it :(
most of the rounds
I just tested it on all the problems that Deepseek R1 failed that I had tested (from https://codeforces.me/blog/entry/138735 ), it solved all of them (though it took 2 attempts on Maximum AND Queries (Easy version)). I also tested it on Paint a Strip, which o1-pro (which was 200$) wasn't able to solve.
I'm also on the free plan, meaning my o3-mini is on low compute (if it means anything, it also has way shorter wait times)
nvm
the edit...
It's nice to know that the lower rated problems aren't entirely screwed, though it's a bit nerve wracking seeing it solve problems on the free plan that it used to not be able to solve on the 200$ plan
Source
Note: o3-mini has already achieved a rating(allegedly) of 2130 (above cf master) with the setting set to high reasoning.