More

habinero · 2026-02-15T03:51:54 1771127514

I have literally heard this exact vague phrase about every single stupid model that has come out, plus more than a few companies.

So far it's all been endless unfounded FOMO hype by people who have something to sell or podcasts to be on. I am so tired of it.

simonw · 2026-02-15T03:57:35 1771127855

Ask around and see if you can find anyone you know who's experienced the November 2025 effect. Claude Code / Codex with GPT-5.1+ or Opus 4.5+ really did make a material difference - they flipped the script from "can write code that often works" to "can write code that almost always works".

I know you'll dismiss that as the same old crap you've heard before, but it's pretty widely observed now.

geraneum · 2026-02-15T06:44:57 1771137897

I’ve been living this experience and using latest models in work throughout this time. The failure modes of LLMs have not fundamentally changed. The makers are not awfully transparent about what exactly they change in each model release the same way you know what changed in i.e., a new Django version. But there’s not been a paradigm shift. I believe/guess (from outside) the big change you think you’re experiencing could be result of many things like better post training processes (RLHF) for models to run a predefined set of commands like always running tests, or other marginal improvements to the models and focusing on programming tasks. To be clear these improvements are welcome and useful, just not the groundbreaking change some claim.

ej88 · 2026-02-15T06:31:38 1771137098

the perimeter of the tasks the LLMs can handle continuously expands at a pretty steady pace

a year ago they could easily one shot full stack features in my hobby next.js apps but imploded in my work codebase

as of opus 4.6 they can now one shot full features in a complex js/go data streaming & analysis tool but implode in low latency voice synthesis systems (...for now...)

just depends on how you're using it (skill issues are a thing) and what you're working on

habinero · 2026-02-15T03:23:17 1771125797

I uncharitably snarked that AI lets the 0.05X programmers become 0.2X ones, but reading this stuff makes me feel like I was too charitable.

I've never had problems with any of those things after I learned what a code editor was.

skydhash · 2026-02-15T17:14:54 1771175694

Yep, it may be an issue in notepad, which does not have helper like syntax highlighting, auto indent, and line numbers. But I started with IDLE which has all those things. So my next editor was notepad++ and codeblock.

habinero · 2026-02-15T00:55:48 1771116948

There's no "outgroup", dude, it's just software. Stop anthropomorphizing it. We have more than enough real social problems without making up fake ones.

habinero · 2026-02-12T17:28:24 1770917304

"Lets be nicer to the robots winky face" is not a solution to this problem. It's just a tool, and this is a technical problem with technical solutions. All of the AI companies could change this behavior if they wanted to.

habinero · 2026-02-12T17:22:25 1770916945

People are allowed to dislike it, ban it, boycott it. Despite what some very silly people think, the tech does not care about what people say about it.

habinero · 2026-02-12T08:12:05 1770883925

I really only find it useful when I'm investigating or troubleshooting some system I'm not familiar with.

A stupid yet accurate analogy is I turn up the log level for my brain lol

It's basically just a log file of everything I did and the result so I can pick it back up later, plus I include timestamps which helps me realize when I'm spinning my wheels for too long.

For building stuff, scribbling diagrams and flows is more useful if I need to work out something complex.

habinero · 2026-02-10T19:52:44 1770753164

...Man, men really will do anything to avoid going to therapy.

habinero · 2026-02-10T19:39:19 1770752359

> I can one shot an AI model to do all of those things

Bullshit you can lol. If it's that trivial, create an instagram right now and post the code.

habinero · 2026-02-10T14:22:16 1770733336

You wait for everyone to go broke chasing whatever, and then take their work for your own. It's not that hard to copy and paste.

habinero · 2026-02-10T14:13:59 1770732839

I actually do build all of those things before standing something up in prod. Not doing that is insane. Literally every web framework has reasonable defaults baked in.

Any competent tech company will have canned ways to do all of those things that have already been reviewed and vetted

giancarlostoro · 2026-02-10T15:34:47 1770737687

I never said anything about before hitting production, I said do you build everything in one shot when you start a brand new project, in one sitting.

habinero · 2026-02-10T19:35:56 1770752156

Why are you building and deploying a site critical enough to need CSP and user security & so on in one sitting lol

Anyways, yes, if I know I'm gonna need it? Because every framework has reasonable defaults or libraries for all of those things, and if you're in a corporate environment, you have vetted ways of doing them

1. import middleware.whatever

2. configure it

3. done

Like, you don't write these things unless you need custom behavior.