> The practical upshot is a git commit hash is not enough l to know you are dist...

sgjohnson · on Dec 31, 2024

> If you can give me a second git repo with such a commit containing different contents, I'll happily send you $10k USD, or donate it to a charity of your choice.

Calculating that SHA1 collision is going to be a bit more expensive than $10k, by a couple of orders of magnitude.

Finding it in the wild is improbable, but calculating it is definitely possible, and has been done before. http://shattered.io/

chippiewill · on Dec 31, 2024

Shattered didn't produce a collision for an arbitrary hash, it produced two documents with the same hash (which is a slightly easier problem, about 100,000x faster).

SHA1 is certainly insecure at this point, but not even close to trivially so.

bawolff · on Dec 31, 2024

Indeed. We can't even do this for md5, let alone sha1.

Preimage attacks are very different from collision attacks.

codeflo · on Dec 31, 2024

That is enough to distribute malicious code though, at least in certain scenarios. Someone might create a setup where reviewers check/sign one version of the source code, and what gets distributed is another version with the same hash.

usr1106 · on Dec 31, 2024

Code review in the Linux kernel still happens by email to a large degree.

Further up in contribution tree there is additional signing. Would that further complicate the insertion of a false commit? I am not convinced that signing is used all the way down to every contribution.

codeflo · on Dec 31, 2024

Linux probably has enough eyeballs on its source to make attacks like that unlikely anyway, but Git isn't just used by Linux.

ExoticPearTree · on Dec 31, 2024

Can you create a proof of concept and show it here?

codeflo · on Dec 31, 2024

What's your point?

ExoticPearTree · on Dec 31, 2024

My point is that you need to put the money where your mouth is.

sgjohnson · on Dec 31, 2024

You're talking about hundreds of millions of dollars to calculate that. "Put your money where your mouth is".

We know it's theoretically possible. And we also know that this theoretical possibility is within the reach of a couple of countries.

Terr_ · on Jan 1, 2025

If your goal was to prove that SHA1 collisions are unimportant, far too hard for any group to exploit within the next X years of processing improvements... That means math.

In contrast, this "challenge" stuff is just chasing outage endorphins and internet points.

Think it through, and it's pointless. Any refusal or negative result is utterly compromised and confounded by things like: How trustworthy you appear; whether the amount is reasonable; whether the random commenter has the skillset, free time, and financial assets to try; whether they're part of a larger group they can recruit; etc.

ExoticPearTree · on Jan 1, 2025

My goal is to see the actual proof of concept that whatever the person I replied to is feasible. Not the daily BS from security wannabes that start with "In certain scenarios it is possible to X and Y" and then never show proof.

"In certain scenarios I could be a ninja": it means absolutely nothing without proving that I actually have the skills and I could actually use them.

It is not pointless, but if you claim something show the proof.

Dylan16807 · on Jan 1, 2025

The math is the proof of concept when an attack costs that much money to pull off. Or the various papers that show successful attacks on reduced-round versions of the hash.

Do you not accept those? What would you accept as a proof of concept?

ExoticPearTree · on Jan 2, 2025

I expected a proof of concept for this statement:

_That is enough to distribute malicious code though, at least in certain scenarios. Someone might create a setup where reviewers check/sign one version of the source code, and what gets distributed is another version with the same hash._

Dylan16807 · on Jan 2, 2025

Well the proof of concept without actually having two colliding files is really simple, so I thought it was generally understood.

Here's the easiest to explain way: Upload the malicious version of the file to github. Send an innocuous patch to the kernel devs that creates a file with the same hash. It gets accepted, and anyone that downloads the kernel from github gets the malicious version. Done. That's a small fraction of linux downloaders, but this is just the proof of concept.

rurban · on Jan 3, 2025

A proof of concept became much easier with C11 unicode identifiers, and email patch review. You can trivially hide Cyrillic chars eg. between whitespace changes or other trivial "optimizations". Even without collisions.

And with the current surge of GPU's even collisions are realistic now. The H100's are not doing much when not in training.

poincaredisk · on Jan 1, 2025

>which is a slightly easier problem, about 100,000x faster

Where did you get this number from? I was under impression that this is completely infeasible (just like we can generate a collision good md5 in seconds, but we still can't do a preimage attack).