More

Philpax · 2026-03-21T17:33:04 1774114384

Apologies for the obligatory question, but what did you try to do, and with which AI did you try to do it with?

ErroneousBosh · 2026-03-21T18:32:32 1774117952

Well following advice from folk on here earlier, I thought I'd start small and try to get it to write some code in Go that would listen on a network socket, wait for a packet with a bunch of messages (in a known format) come in, and split those messages out from the packet.

I ended up having to type hundreds of lines of description to get thousands of lines of code that doesn't actually work, when the one I wrote myself is about two dozen lines of code and works perfectly.

It just seems such a slow and inefficient way to work.

timacles · 2026-03-21T20:43:06 1774125786

Hate to pull the skill issue card here, but that is a trivial problem that can be one shotted with almost any model with

smj-edison · 2026-03-21T22:46:04 1774133164

tbh that's not a helpful thing to say. I think a more productive thing would be to ask "What model are you using?" "Are you using it in chat mode or as a dedicated agent?" "Do you have an AGENTS.md or CLAUDE.md?"

I've also been underwhelmed with its ability to iterate, as it tends to pile on hacks. So another useful question is "did you try having it write again with what you/it learned?"

timacles · 2026-03-22T00:21:53 1774138913

Agreed was a bit rough. Yes they are not great at iterating and keeping long contexts, but you look at what he’s describing and you have to agree that’s exactly the type of problem llm excel at

Shouldn’t have to baby step through the basics when the author is clearly not interested in learning himself

smj-edison · 2026-03-22T00:39:38 1774139978

> Shouldn’t have to baby step through the basics when the author is clearly not interested in learning himself

I'd rather assume good faith, because when I first started using LLMs I was incredibly confused what was going on, and all the tutorials were grating on me because the people making the tutorials were clearly overhyping it.

It was precisely the measured and detailed HN comments that I read that convinced me to finally try out Claude, so I do my best to pay it forward :)

ErroneousBosh · 2026-03-21T22:55:39 1774133739

> I think a more productive thing would be to ask "What model are you using?" "Are you using it in chat mode or as a dedicated agent?" "Do you have an AGENTS.md or CLAUDE.md?"

In my case I'd have to say "Don't know, whatever VS Code's bot uses", and "no idea what those are or why I have to care".

smj-edison · 2026-03-22T00:17:49 1774138669

> Don't know, whatever VS Code's bot uses

The reason I ask about what model is I initially dismissed AI generated code because I was not impressed with the models I was trying. I decided if I was going to evaluate it fairly though, I would need to try a paid product. I ended up using Claude Sonnet 4.5, which is much better than the quick-n-cheap models. I still don't use Claude for large stuff, but it's pretty good at one-off scripts and providing advice. Chances are VS Code is using a crappy model by default.

> no idea what those are or why I have to care

For the difference between chat mode and agent mode, chat mode is the online interface where you can ask it questions, but you have to copy the code back and forth. Agent mode is where it's running an interface layer on your computer, so the LLM can view files, run commands, save files, etc. I use Claude in agent mode via Claude Code, though I still check and approve every command it runs. It also won't change any files without your permission by default.

AGENTS.md and CLAUDE.md are pretty much a file that the LLM agent reads every time it starts up. It's where you put your style guide in, and also where you have suggestions to correct things it consistently messes up on. It's not as important at the beginning, but it's helpful for me to have it be consistent about its style (well, as consistent as I can get it). Here's an example from a project I'm currently working on: https://github.com/smj-edison/zicl/blob/main/CLAUDE.md

I know there's lots of other things you can do, like create custom tools, things to run every time, subagents, plan mode, etc. I haven't ever really tried using them, because chances are a lot of them will be obsolete/not useful, and I'd rather get stuff done.

I'm still not convinced they speed up most tasks, but it's been really useful to have it track down memory leaks and silly bugs.

ErroneousBosh · 2026-03-21T22:54:42 1774133682

Okay, tell you what then. Help me learn.

The problem is that I want something that listens on a TCP connection for GD92 packets, and when they arrive send appropriate handshaking to the other end and parse them into Go structs that can be stuffed into a channel to be dealt with elsewhere.

And, of course, something to encode them and send them again.

How would I do that with whatever AI you choose?

I'm pretty certain you can't solve this with AI because there is literally no published example of code to do it that it can copy from.

timacles · 2026-03-22T00:17:56 1774138676

GD92 packets?

No idea what you’re talking about but if it has a spec then it doesn’t matter if it’s trained on it. Break the problem down into small enough chunks. Give it examples of expected input and output then any llm can reason about it. Use a planning mode and keep the context small and focused on each segment of the process.

You’re describing a basic tcp exchange, learn more about the domain and how packets are structured and the problem will become easier by itself. Llms struggle with large code bases which pollute the context not straightforward apps like this

smj-edison · 2026-03-22T00:41:00 1774140060

One other thing, it might be worthwhile having the spec fresh in the LLM's context by downloading it and pointing the agent at it. I've heard that that's a fruitful way to get it to refresh its memory.

timacles · 2026-03-22T00:49:26 1774140566

Yep you can even extract the relevant parts and put them into local files the llm can scan

Philpax · 2026-03-19T13:40:09 1773927609

Astral's tooling is excellent and almost makes up for Python being a badly-designed language. Almost.

yoyohello13 · 2026-03-19T15:36:20 1773934580

I work in Python every day and Astral's tools are really what made it bearable. This acquisition is so disappointing.

moralestapia · 2026-03-19T15:45:32 1773935132

Agree. As many others have expressed, uv and ruff have brought some sanity to the Python toolkit.

This is a massive backward step for the Python ecosystem, but it's not like a hundred-billion dollar company will care about that.

Philpax · 2026-03-17T20:12:47 1773778367

Yes, but the grandparent poster and I would agree that the parse is not that ambiguous/the meaning is easily inferred. The sentence states that the library is overlapped _and_ that overlap is available in better quality: it may seem contrived, but it reads as a rather natural collapse of an implicit conjunction to me.

Philpax · 2026-03-12T20:09:27 1773346167

There's not really an exact science to it, but manually-optimised code is usually more structured/systematic to make it easier for the human author to manage the dependencies and state across the board, while automatically-optimised code is free to arrange things however it would like.

As an example of the kinds of optimisations that the best human programmers were doing before compilers took over, see Michael Abrash's Black Book: https://www.phatcode.net/res/224/files/html/index.html - you can intuit how a human might organise their code to make the most of these while still keeping it maintainable.

Philpax · 2026-03-03T23:47:18 1772581638

If you asked a three-year-old a question that they proceeded to completely flub, would you then assume that all humans are incapable of answering questions correctly?

Nobody is arguing for the quality of the search overviews. The models that impress us are several orders of magnitude larger in scale, and are capable of doing things like assisting preeminent computer scientists (the topic of discussion) and mathematicians (https://github.com/teorth/erdosproblems/wiki/AI-contribution...).

Philpax · 2026-03-03T04:50:18 1772513418

I'm a Rust main, but this argument seems... incorrect? You would not need macros for Rust to remain a usable memory-safe language. They certainly make it easier, but they're not necessary. It would be perfectly possible to design a variant of Rust that gets you to 80-90% of Rust's usability, with the same safety, without macros.

kobebrookskC3 · 2026-03-03T16:13:35 1772554415

how would you implement https://doc.rust-lang.org/stable/std/pin/macro.pin.html without macros? a macro is used to shadow the original variable so that you can't move it (safely) after you pin it

steveklabnik · 2026-03-03T16:50:10 1772556610

Regular variable definition shadows. Macros expand to regular Rust code, they could always be replaced by the expanded body.

kobebrookskC3 · 2026-03-04T00:19:17 1772583557

yes, but the code inside is unsafe. the pin macro is like a safe function.

steveklabnik · 2026-03-04T17:44:08 1772646248

I'm not sure what that has to do with anything. The macro isn't what makes it safe. The unsafe code being properly written is.

kobebrookskC3 · 2026-03-05T00:41:03 1772671263

but without macros, how would you expose a safe interface?

  fn pin(x: T) -> Pin<&mut T> { ... }

would move the value

Philpax · 2026-03-05T05:59:34 1772690374

Your macroless variant of Rust would offer a safe builtin that does this. It doesn't need to be implemented with a macro.

jibal · 2026-03-03T20:38:05 1772570285

Since macros just expand into code, how could you imagine that a macro is ever necessary?

kobebrookskC3 · 2026-03-04T00:13:11 1772583191

the macro uses unsafe inside, so that's another instance of unsafe you'll need to check, whereas the pin macro is like a safe function

jibal · 2026-03-04T06:35:50 1772606150

Excellent goalpost moving! Congratulations!

kobebrookskC3 · 2026-03-04T13:21:36 1772630496

no, you just missed my point. expanding the implementation is not a safe abstraction. show me how you'd implement the functionality of the pin macro as a safe abstraction.

jibal · 2026-03-04T21:14:28 1772658868

I didn't miss that you totally changed the subject and now you're attacking a strawman. See Steve Klabnick's response to your other comment where you did this. Of course macros are good for encapsulation and abstraction, but that's a different subject--and note that the discussion was about Zig vs. Rust, and Zig has no macros so there's unencapsulated unsafe code all over the place.

I won't respond further.

kobebrookskC3 · 2026-03-05T00:56:53 1772672213

i was responding this claim

> It would be perfectly possible to design a variant of Rust that gets you to 80-90% of Rust's usability, with the same safety, without macros.

i then present an api that i think relies on macros to expose a safe api

> Of course macros are good for encapsulation and abstraction, but that's a different subject.

no it's not. exposing safe abstractions is pretty much rust's raison d'être

jitl · 2026-03-04T14:24:44 1772634284

everything in zig is unsafe and needs to be checked like rust unsafe, so…

Philpax · 2026-03-01T17:29:45 1772386185

It is presented as a Wikipedia article from the future describing a subculture of tomorrow. See also https://qntm.org/mmacevedo for another example of this genre.

Philpax · 2026-02-25T02:51:46 1771987906

Functionality-wise, it's great, but it's a buggy mess, and it seems to be getting worse with each release.

pudsbuds · 2026-02-25T08:16:15 1772007375

I've been using deletated Claude agents in vscode and it crashes so much it's insane... I switched to copilot Claude local agents and it works much better.

Idk about this whole vibe coding thing though... Well see what happens

pgwhalen · 2026-02-25T03:18:21 1771989501

I’m a heavy user for about four months now, and it’s definitely getting better for me. How would you say it’s getting worse?

Philpax · 2026-02-24T22:00:59 1771970459

The human operator controls what gets built. If they want to build Redis 2, they can specify it and have it built. If you can't take my word for it, take those of the creator of Redis: https://antirez.com/news/159

Philpax · 2026-02-20T19:27:32 1771615652

I wish some taste had been used in the choice of font. Courier New is rather unpleasant to look at for prose.

layer8 · 2026-02-20T21:21:12 1771622472

Or any monospace font, really.