Блог пользователя SecondThread

Автор SecondThread, история, 15 месяцев назад, По-английски

Llama Code in CP

Today, Meta released Llama Code, a Large Language Model trained specifically for code. It's trained on Python, C++, Java, PHP, Typescript, C#, Bash. They are releasing 3 models with 7B, 13B and 34B parameters respectively. The smaller two allow for Fill-in-the-Middle.

It's free for research and commercial use, but obviously requires a solid amount of computing power if you want to run it locally.

You can read more about it here: https://about.fb.com/news/2023/08/code-llama-ai-for-coding/

I'd be interested in seeing if this becomes useful in CP. It seems like there's a lot of potential, specifically with code completion with Fill-in-the-middle. Imagine having vscode, hitting "F3", typing something like "SegTree with lazy prop, range set max, and range +=", and getting exactly the segment tree you need in two seconds.

I also think it may be useful because although LLMs are currently horrendous at the problem solving aspect of CP (similarly to how tools like MidJourney suck at hands), if you can highlight an area or chunk of code and be like "hey, this part's bad, try again for just this part", that might help improve weaknesses.

Just a thought.

  • Проголосовать: нравится
  • +129
  • Проголосовать: не нравится

»
15 месяцев назад, # |
  Проголосовать: нравится 0 Проголосовать: не нравится

This is pretty cool, sounds like codepilot but local

»
15 месяцев назад, # |
  Проголосовать: нравится +147 Проголосовать: не нравится

wen hacker cup

»
15 месяцев назад, # |
  Проголосовать: нравится 0 Проголосовать: не нравится

do you think AI will become better than human at CP eventually?

  • »
    »
    15 месяцев назад, # ^ |
      Проголосовать: нравится +13 Проголосовать: не нравится

    I think it'll be better than the average human within the decade. But (unlike chess) I don't think it'll be better than the best humans in limited-submission, 4+ hour contests until very close to full AGI (or never if we never hit AGI). I can't imagine the world having a scalable entity with that level of problem solving ability, planning, and creativity without imminent runaway risk.

»
15 месяцев назад, # |
  Проголосовать: нравится 0 Проголосовать: не нравится

tools like MidJourney suck at hands

Hands are not an archenemy of AI since inpainting and LoRAs became a thing(so like, for 8 months already), it's just that most people are using such tools to generate slop, and don't care about post production at all.

The current issue with AI generated art is that it looks like play-doh, and models don't really understand the lightning, which makes pictures look even worse. A good eye of a professional will immediately find the inconsistencies, and if you saw a lot of human art, you will see that art feels weird.

I am kinda hyped for what open source community will create, Llama models themselves are a bit mid, but as a baseline, finetuned Llama 1 was almost on par with ChatGPT 3.5

Also, I wonder about your opinion on LLMs used for substituting coders. Artists don't utilize the power of Stable Diffusion or MidJourney to create a good prototype of an image, that they can enhance by hand. Programmers are very different though, they already incorporate ChatGPT in their work a lot. It is very good at writing a lot of mid code, you basically have a power of 10+ coding monkeys. Do people use AI at your job?