social.kernel.org

Conversation

☃️karolherbst☃️

"Fun bug of the month, mesa edition, episode may"

so if you do "uint64_t some_var = 1 << 31;" in C you get "0xffffffff80000000" as the value, because that's super obvious and not confusing at all.

It's pretty funny getting reminded how non-intuitive and broken C is from time to time.

trilader

trilader@chaos.social

1 month ago

Reply to @karolherbst@chaos.social

@karolherbst For my understanding: That's default int promotion + sign extend on 64 bit extension? Would 1L << 31L fix this or is there other pitfalls with that?

☃️karolherbst☃️

karolherbst@chaos.social

1 month ago

Reply to @trilader@chaos.social

Edited 1 month ago

@trilader yeah sure, but any competent and modern language would type the constant to what's expected, not make it int32 by default, because that's just broken imho.

Like any new language doing that today would be considered broken on arrival.

trilader

trilader@chaos.social

1 month ago

Reply to @karolherbst@chaos.social

@karolherbst Yeah. Things like this make me think someone needs to invent -fbackwards-compatible-bs=off

Jann Horn

jann@infosec.exchange

1 month ago

Reply to @karolherbst@chaos.social

@karolherbst I think that's UB? see C99 6.5.7 "Bitwise shift operators" - the LHS is signed and the result of the computation is not representable in the result type

Jann Horn

jann@infosec.exchange

1 month ago

Reply to @jann@infosec.exchange

@karolherbst but apparently gcc has decided to not treat it as UB, except when using UBSAN: https://gcc.gnu.org/onlinedocs/gcc/Integers-implementation.html

☃️karolherbst☃️

karolherbst@chaos.social

1 month ago

Reply to @jann@infosec.exchange

Edited 1 month ago

@jann yeah technically it's UB, but there is only so much you can optimize with a 1-2 instruction pattern that it doesn't really matter in practice, because most impls will do the same (more or less).

Like there is UB and then there is UB.

Jann Horn

jann@infosec.exchange

1 month ago

Reply to @karolherbst@chaos.social

@karolherbst yeah, I guess my point is that, for the code you showed, a C compiler would be well within its rights to refuse to build that code or complain about it, so this is not entirely the language's fault

Andrea (Drea) Tamar Pinski

pinskia@hachyderm.io

1 month ago

Reply to @jann@infosec.exchange

@jann @karolherbst

It was not UB in C90. That is why it was UB without ubsan ...

☃️karolherbst☃️

karolherbst@chaos.social

1 month ago

Reply to @jann@infosec.exchange

@jann ohh it's totally the languages fault even if it wouldn't be UB, because that's just the worst way to specify this.

Like it's just a design bug really. And no matter how much this is UB or not won't change that.

David Chisnall (Now with 50% more sarcasm!)

david_chisnall@infosec.exchange

1 month ago

Reply to @karolherbst@chaos.social

@karolherbst @jann

It’s UB in the general case because, if the operand is not a constant, you want to lower it to a shift instruction but C works with targets that have different number representations. Ones or twos complements, or explicit sign bits are all permitted, but all of these will give different behaviours if you flip the top bit.

For wider shifts, different ISAs had different semantics for shifts wider than the register, so C made that fully undefined.

This combination lets you lower source-level shifts to a shift instruction.

C also doesn’t mandate that this be constant evaluated unless the result is used as a constant, so there’s no way to force implementations to diagnose the UB at compile time for this case. But, as a QoI issue, it is permitted and compilers should.

Pavel Machek

pavel

☃️karolherbst☃️

trilader

☃️karolherbst☃️

trilader

Jann Horn

Jann Horn

☃️karolherbst☃️

Jann Horn

Andrea (Drea) Tamar Pinski

☃️karolherbst☃️

David Chisnall (*Now with 50% more sarcasm!*)

Pavel Machek

Pavel Machek

Pavel Machek

☃️karolherbst☃️

☃️karolherbst☃️

Pavel Machek

NEPŘÁTELSKÉ EMOCE 🇺🇦🇨🇿

NEPŘÁTELSKÉ EMOCE 🇺🇦🇨🇿

Eniko Fox

Luna Dragofelis ΘΔ🏳️‍⚧️🐱

☃️karolherbst☃️

☃️karolherbst☃️

MaddieM4

Oblomov

Oblomov

David Chisnall (*Now with 50% more sarcasm!*)

☃️karolherbst☃️

James Widman

James Widman

James Widman

bruh/a1ba

☃️karolherbst☃️

bruh/a1ba

☃️karolherbst☃️

Pavel Machek

James Widman

James Widman

James Widman

James Widman

Pavel Machek

James Widman

James Widman

Pavel Machek

James Widman

James Widman

Pavel Machek

James Widman

James Widman

Terms of service

Privacy notice

Getting your own account

David Chisnall (Now with 50% more sarcasm!)

David Chisnall (Now with 50% more sarcasm!)