amzin's comments

amzin · 2025-11-10T12:31:14 1762777874

In progress: I’m writing a book about how the brain processes text in general - and news articles in particular.

I also created and maintain a Russian "newspeak" dictionary: https://github.com/alamzin/az/

amzin · 2025-09-18T09:36:41 1758188201

Professional procrastinator here. All the tips in the post are common knowledge and, sadly, not very good (especially in the long run).

The only thing that works for many people is to skip the motivation part and embrace the rather uncomfortable principle of "action before motivation."

The flow state will come. I believe it arises independently of motivation. Motivation just tricks us into believing that everything we do should bring joy.

It will — but not right now; we need to dive in first.

cowboylowrez · 2025-09-18T20:03:19 1758225799

yeah its sort of the no pain no gain approach but I like it. my best hack is time logging. I feel that if I specifically label breaks or procrastination or production failures, I at least feel like I'm more in control. I like the act of blogging about it though, it never fails to pick me up knowing I'm not alone with suffering from procrastination and I am always happy to read thoughts about it.

amzin · 2025-02-25T17:41:53 1740505313

Is there a FS that keeps only diffs in clone files? It would be neat

rappatic · 2025-02-25T17:55:00 1740506100

I wondered that too.

If we only have two files, A and its duplicate B with some changes as a diff, this works pretty well. Even if the user deletes A, the OS could just apply the diff to the file on disk, unlink A, and assign B to that file.

But if we have A and two different diffs B1 and B2, then try to delete A, it gets a little murkier. Either you do the above process and recalculate the diff for B2 to make it a diff of B1; or you keep the original A floating around on disk, not linked to any file.

Similarly, if you try to modify A, you'd need to recalculate the diffs for all the duplicates. Alternatively, you could do version tracking and have the duplicate's diffs be on a specific version of A. Then every file would have a chain of diffs stretching back to the original content of the file. Complex but could be useful.

It's certainly an interesting concept but might be more trouble than it's worth.

abrookewood · 2025-02-25T20:39:33 1740515973

ZFS does this by de-duplicating at the block level, not the file level. It means you can do what you want without needing to keep track of a chain of differences between files. Note that de-duplication on ZFS has had issues in the past, so there is definitely a trade-off. A newer version of de-duplication sounds interesting, but I don't have any experience with it: https://www.truenas.com/docs/references/zfsdeduplication/

UltraSane · 2025-02-25T18:55:45 1740509745

VAST storage does something like this. Unlike how most storage arrays identify the same block by hash and only store it once VAST uses a content aware hash so hashes of similar blocks are also similar. They store a reference block for each unique hash and then when new data comes in and is hashed the most similar block is used to create byte level deltas against. In practice this works extremely well.

https://www.vastdata.com/blog/breaking-data-reduction-trade-...

OnlyMortal · 2025-02-27T20:11:29 1740687089

That’s very interesting. Typically a Rabin fingerprint is used to identify identical chunks of data.

Identifying similar blocks and, maybe sub-rechunking isn’t something I’ve ever considered.

abrookewood · 2025-02-25T20:36:23 1740515783

ZFS: "The main benefit of deduplication is that, where appropriate, it can greatly reduce the size of a pool and the disk count and cost. For example, if a server stores files with identical blocks, it could store thousands or even millions of copies for almost no extra disk space." (emphasis added)

https://www.truenas.com/docs/references/zfsdeduplication/

p_ing · 2025-02-26T17:00:17 1740589217

Not an FS but attempting to mimic NTFS, SharePoint does this within it's content database(s).

https://www.microsoft.com/en-us/download/details.aspx?id=397...

alwillis · 2025-02-25T21:55:07 1740520507

That’s how APFS works; it uses delta extents for tracking differences in clones: https://en.wikipedia.org/wiki/Delta_encoding?wprov=sfti1#Var...

jonhohle · 2025-02-26T00:07:45 1740528465

APFS shares blocks so only blocks that changed are no longer shared. Since a block is the smallest atomic unit (except maybe an inode) in a FS, that’s the best level of granularity to expect.

the8472 · 2025-02-26T10:27:57 1740565677

With extent-based filesystems you can clone extents and then overwrite one extent and only that becomes unshared.

amzin · on Dec 25, 2021

Sent the link to my son who is an avid gamer.

Got an one-word response:

"Winnienet".

Could not phrase it better.

amzin · on May 26, 2021

And to clarify, the beginning is here: http://scp-wiki.wikidot.com/antimemetics-division-hub

amzin · on Nov 6, 2019

> Gitlab has no Chinese or Russian employee so far -> so nobody got pressure or intimidate, ffs.

Actually, they do have employees there. There are at least 5 Russians and 1 Chinese listed here: https://about.gitlab.com/jobs/

There are no Russians at the role that had been debated though.

amzin · on July 17, 2019

Came to mention Notion.so too.

My email is a mess for I should switch between several work/home mailboxes — and from time to time I just have to wait for an email too.

This is one of the reasons I avoid launching Notion.so.

amzin · on June 8, 2011

Yandex.Factory is not a fund, actually. It is investment program.