How a 20 year old bug in GTA San Andreas surfaced in Windows 11 24H2

(cookieplmonster.github.io)

1371 points by yett 8 months ago

314 comments

View on Hacker News

bombcar 8 months ago

This is the kind of thing I'd expect from Raymond Chen - which is extremely high praise!

I'm glad they tracked it down even further to figure out exactly why.

Reply View 16 replies

aneutron 8 months ago

Or randomascii. A freaking legend (although he had a heart braking streak of bad events ... I wish him the best)

Reply View | 6 replies
- Izikiel43 8 months ago
  
  What happened to him?
  
  Reply View | 5 replies
  
  wiseowise 8 months ago
  
  https://randomascii.wordpress.com/2024/10/01/life-death-and-...
  https://randomascii.wordpress.com/2016/10/17/vestibular-dysf...
  
  Reply View | 4 replies
martinsnow 8 months ago

Raymond is a wizard. Read his blogs for many years and love his style and knowledge.

Reply View | 8 replies
- Discordian93 8 months ago
  
  He's a total legend, yet apparently he's never met Bill Gates in person from what he said in an interview in the Dave's Garage YouTube channel a few years ago. You'd think that someone who's been that prominent for so long in the company would have been invited to a company dinner where he was present or something.
  
  Reply View | 2 replies
  
  bombcar 8 months ago
  
  Microsoft's a big company, and billg "stepped down" in 2000. Raymond is still working, so they overlap less than may appear.
  
  Reply View | 1 reply
  
  iforgotpassword 8 months ago
  
  He has stories on his blog about windows 2 iirc, so there was an overlap from a time where they were still relatively small. So I think it's a bit odd they never talked or met.
  
  Reply View | 0 replies
- MattSayar 8 months ago
  
  Small thing but I love the effort he puts into actually coding up his examples instead of screenshots. For example: https://devblogs.microsoft.com/oldnewthing/20250414-00/?p=11...
  He has many better ones but that's the latest one I've seen
  
  Reply View | 0 replies
- RcouF1uZ4gsC 8 months ago
  
  Raymond knows everything. From microcode bugs on Alpha AXP to template meta programming to UI.
  
  Reply View | 3 replies
  
  transcriptase 8 months ago
  
  I wonder how many times a Deloitte, PwC, KPMG, Bain, EY, McKinsey, or BCG consultant naively tried putting him on a shortlist for being “impacted” over the years because he was in the Top X of a spreadsheet sorted on Y.
  
  Reply View | 1 reply
  
  billforsternz 8 months ago
  
  "Look this guy's job seems to be mainly writing blog posts. We could replace that with AI and get it to regularly pitch the new Visual Enshitify 2.0 product launch as a bonus. Win win win!"
  
  Reply View | 0 replies
  
  gosub100 8 months ago
  
  [flagged]
  
  Reply View | 0 replies

amenghra 8 months ago

IMHO, if something isn’t part of the contract, it should be randomized. Eg if iteration order of maps isn’t guaranteed in your language, then your language should go out of its way to randomize it. Otherwise, you end up with brittle code: code that works fine until it doesn’t.

Reply View 50 replies

bri3d 8 months ago

There are various compiler options like -ftrivial-auto-var-init to initialize uninitialized variables to specific (or random) values in some situations, but overall, randomizing (or zeroing) the full content of the stack in each function call would be a horrendous performance regression and isn't done for this reason.

Reply View | 10 replies
- neuroelectron 8 months ago
  
  There are fast instructions (e.g., REP STOSx, AVX zero stores, dc zva) and tricks (MTE, zero pages), but no magic CPU instruction exists that transparently and efficiently randomizes or zeros the stack on function calls. You think there would be one and I bet there are on some specialized high-security systems, but I'm not sure even where you would find such a product. Telecom certainly isn't it.
  
  Reply View | 7 replies
  
  db48x 8 months ago
  
  There are proposed cpu architectures that work that way, like the Mill <https://millcomputing.com/>. Where most cpus support multiple calling conventions the Mill enforces a single calling convention in hardware. There is a hardware `call` instruction that does all the work directly, along with a corresponding `ret` instruction for returning from a function call. It also uses its equivalent of the TLB to ensure that each function is only granted permission to read from that portion of the stack which contains its arguments; any attempt to read outside that region would result in a permission error that causes the read to return a NaR (Not a Result, akin to a floating point NaN).
  As an additional protection, new stack frames are implicitly zeroed as they are created. I assume this is done by filling the CPU cache with zeros for those addresses before continuing to execute the called function. No need to wait for actual zeros to be written to main memory.
  https://millcomputing.com/wiki/Protection#Protecting_Stacks
  
  Reply View | 2 replies
  
  mjevans 8 months ago
  
  You couldn't do random, but with a predictable performance hit to memory, cache and write-line use stack addresses COULD be isolated for a program, for a library, etc.
  It'd be expensive though; every context switch would require it's own stack and pushing / restoring one more register. There's GOOD reason programs don't work that way and are supposed to not rely on values outside of properly initialized (and not later clobbered) memory.
  
  Reply View | 2 replies
  
  dwattttt 8 months ago
  
  CPUs already special case xor reg,reg as zeroing out the register, breaking any data dependency on it. If zeroing bits of the stack were common enough, I'd believe CPUs could be made that handled it efficiently (they already special case the stack; push/pop)
  
  Reply View | 0 replies
- smarks 8 months ago
  
  I'm a bit distant from this stuff, but it looks like C++26 will have something like -ftrivial-auto-var-init enabled by default. See the "safe by default" section of [1].
  For reference, the actual proposal that was accepted into C++26 is [2]. It discusses performance only in general, and it refers to an earlier analysis [3] for more details. This last reference describes regressions of around 0.5% in time and in code size. Earlier prototypes suggested larger regressions (perhaps even "horrendous") but more emphasis on compiler optimizations has brought the regression down considerably.
  Of course one's mileage may vary, and one might also consider a 0.5% regression unacceptable. However, the C++ committee seems to have considered this to be an acceptable tradeoff to remove a frequent cause of undefined behavior from C++.
  [1]: https://herbsutter.com/2024/08/07/reader-qa-what-does-it-mea...
  [2]: https://www.open-std.org/jtc1/sc22/wg21/docs/papers/2024/p27...
  [3]: https://open-std.org/jtc1/sc22/wg21/docs/papers/2023/p2723r1...
  
  Reply View | 0 replies
- canucker2016 8 months ago
  
  Microsoft's Visual C++ compiler has the /Ge compiler option ( see https://learn.microsoft.com/en-us/cpp/build/reference/ge-ena... ) Deprecated since VC2005.
  This compiler option causes the compiler to emit a call to a stack probe function to ensure that a sufficient amount of stack space is available.
  Rather than just probe once for each stack page used, you can substitute a function that *FILLS* the stack frame with a particular value - something like 0xBAADF00D - one could set the value to anything you wanted at runtime.
  This would get you similar behaviour to gcc/clang's -ftrivial-auto-var-init
  Windows has started to auto-initialize most stack variables in the Windows kernel and several other areas.
  The following types are automatically initialized: Scalars (arrays, pointers, floats) Arrays of pointers Structures (plain-old-data structures) The following are not automatically initialized: Volatile variables Arrays of anything other than pointers (i.e. array of int, array of structures, etc.) Classes that are not plain-old-data During initial testing where we forcibly initialized all types of data on the stack we saw performance regressions of over 10% in several key scenarios. With POD structures only, performance was more reasonable. Compiler optimizations to eliminate redundant stores (both inside basic blocks and between basic blocks) were able to further drop the regression caused by POD structures from observable to noise-level for most tests. We plan on revisiting zero initializing all types (especially now that our optimizer has more powerful optimizations), we just haven’t gotten to it yet.
  see https://web.archive.org/web/20200518153645/https://msrc-blog...
  
  Reply View | 0 replies
frollogaston 8 months ago

Randomization at this level would be too expensive. There are tools that do this for debug purposes, and your stuff runs a lot slower in that mode.

Reply View | 5 replies
- throwaway2037 8 months ago
  
  I had to Google to find the tid bit that I read about Perl years ago. I think this will affect iteration order of dicts.
  > Nov 22, 2012 — Perl 5.18 will introduce per process hash randomization and almost certainly will feature a new hash function.
  
  Reply View | 0 replies
- foxhill 8 months ago
  
  it probably shouldn’t be a “release” thing. actually, certainly. i do wonder how many bugs would never have seen the light of day, if someone’s “set” actually turned out to be a sequence (i.e. allowed duplicate values) resulting in a debug build raising an assert.
  
  Reply View | 3 replies
  
  Arainach 8 months ago
  
  Debug builds are worthless for catching issues. How many people actually run them? Perhaps developers run debug builds of individual binaries they're working on when they're trying to repro a bug, but my experience at every company of every size and position in the stack (including the Windows team) is that no one does their general purpose use on a debug build.
  
  Reply View | 2 replies
abnercoimbre 8 months ago

Regarding contracts, there's an additional lesson here, quoting from the source:
> This is an interesting lesson in compatibility: even changes to the stack layout of the internal implementations can have compatibility implications if an application is bugged and unintentionally relies on a specific behavior.
I suppose this is why Linux kernel maintainers insist on never breaking user space.

Reply View | 1 reply
- cylemons 8 months ago
  
  But the linux equivalent here would be glibc, not the kernel
  
  Reply View | 0 replies
tantalor 8 months ago
Nope. You have to remember https://www.hyrumslaw.com/
With a sufficient number of users of an API, it does not matter what you promise in the contract: all observable behaviors of your system will be depended on by somebody.
If you promise randomization, then somebody will depend on that :)
And then you can never remove it!
Reply View | 6 replies
- scott_w 8 months ago
  
  Semi-related: this type of thing is actually covered in the Site Reliability Engineering book by Google. They highlighted a case of a system that outperformed its SLO, so people depended on it having 100% uptime. They "fixed" this by injecting errors to go closer to their SLA, forcing downstream engineers to deal with the fact that the dependent services would sometimes fail for no reason.
  I know it's easier said than done everywhere, just found it to be an interesting parallel.
  
  Reply View | 0 replies
- timewizard 8 months ago
  
  > If you promise randomization
  You don't. You say the order is undefined.
  
  Reply View | 4 replies
  
  __float 8 months ago
  
  That isn't the point. In practice, if you provide randomness, it will be depended upon.
  
  Reply View | 2 replies
  
  dwattttt 8 months ago
  
  You can randomly not randomise it :)
  
  Reply View | 0 replies
ormax3 8 months ago

one might argue that one of the advantages of languages like C is that you only pay for the features you choose to use, no unnecessary overhead like initializing unused variables

Reply View | 4 replies
- nayuki 8 months ago
  
  You can pay for those features in debug mode or in chaos monkey mode. It's okay to continue to not pay for them in release mode. Heck, Rust has this approach when it comes to handling integer overflow - fully checked in debug mode, silent wraparound in release mode.
  
  Reply View | 2 replies
  
  irundebian 8 months ago
  
  In Ada you can pay for integer overflow checks (runtime) if you want to. With Ada SPARK you can prove that your code does not contain integer overflows so that you don't need runtime checks.
  
  Reply View | 1 reply
  
  johnisgood 8 months ago
  
  And you can disable these checks with a flag when it comes to Ada, and yeah, with SPARK, none of it happens at runtime.
  Check the table at https://docs.adacore.com/spark2014-docs/html/ug/en/usage_sce..., look for "SPARK builds on the strengths of Ada to provide even more guarantees statically rather than dynamically.".
  More reading:
  https://docs.adacore.com/spark2014-docs/html/ug/en/tutorial....
  https://learn.adacore.com (many books for learning Ada and SPARK) available in PDF, EPUB, and HTML format.
  
  Reply View | 0 replies
- pjc50 8 months ago
  
  However, the compiler does not tell you this. We're back to the problem that it's possible to have a "working" C program that relies on UB and will therefore break at some point, but the tools will not yell at you for doing this. Whereas in Java or C# you get warnings or errors for using maybe-uninitialized variables.
  Also, scanf should be deprecated. Terrible API. Never use scanf or sscanf etc. We managed to get "gets()" deprecated, time to spread that to other parts of the API.
  atoi() or atof() etc. work OK, but really you need a parser.
  
  Reply View | 0 replies
willcipriano 8 months ago

Then you are wasting runtime clock cycles randomizing lists.

Reply View | 6 replies
- Cthulhu_ 8 months ago
  
  Not necessarily; you can do a thing where it's randomized during development, testing and fuzzing but not in production builds or benchmarks so that the obvious "I rely on internal map order" bugs are spotted right away.
  
  Reply View | 0 replies
- wat10000 8 months ago
  
  You can get it pretty much for free by using a random salt with your hash function. This is also useful for avoiding DOS attacks using deliberate hash collisions to trigger quadratic behavior in your hash tables.
  
  Reply View | 0 replies
- nayuki 8 months ago
  
  Any sane language would design a list iterator to follow the order of the list. No, the difference is when you're iterating over orderless hash-based sets or maps/dictionaries. Many languages choose to leave the iteration order undefined. I think Python did that up to a point, but afterward they defined dictionaries (but not sets) to be iterated over in the order that keys were added. Also, some languages intentionally randomize the order per program run, to avoid things like users intentionally stuffing hash tables with colliding keys.
  
  Reply View | 3 replies
  
  masklinn 8 months ago
  
  > Also, some languages intentionally randomize the order per program run, to avoid things like users intentionally stuffing hash tables with colliding keys.
  Most modern langages do that as part of hashdos mitigation, Python did that until it switched to a naturally ordered hashmap, then made insertion order part of the spec. Importantly iteration order remains consistent with a process (possibly on a per-hashmap basis).
  Notably, Go will randomise the starting point of hashmap iteration on each iteration.
  
  Reply View | 0 replies
  
  mabster 8 months ago
  
  Best change ever, that. Now it would also be nice if sets were ordered too.
  
  Reply View | 1 reply
  
  throwaway2037 8 months ago
  
  I am pretty sure there are trivial impls of sets with guaranteed iteration order in Python that use an underlying ordered map and a dummy value in each entry.
  
  Reply View | 0 replies
gzalo 8 months ago

I agree, this can also detect brittle tests (e.g, test methods/classes that only pass if executed in a particular order). But applying it for all data could be expensive computation-wise

Reply View | 0 replies
mras0 8 months ago

Not really the ethos of C(++), though of course this particular bug would be easily caught by running a debug build (even 20 years ago). However, this being a game "true" debug builds were probably too slow to be usable. That was at least my experience doing gamedev in that timeframe. Then again code holding up for 20 years in that line of biz is more than sufficient anyway :)

Reply View | 1 reply
- mabster 8 months ago
  
  When I was doing gamedev about 5 years ago, we were still debugging with optimisation on. You get a class of bugs just from running in lower frame rates that don't happen in release.
  
  Reply View | 0 replies
plutaniano 8 months ago

Aren't you just creating another contract? Users might write code that depends on it being random.

Reply View | 2 replies
- Artoooooor 8 months ago
  
  Maybe it would be good to change all non promised things between releases. So that such unwritten rules never become something users rely upon.
  
  Reply View | 0 replies
- tantalor 8 months ago
  
  For those users, do this instead: https://xkcd.com/221/
  
  Reply View | 0 replies
codebje 8 months ago

I once updated a little shy of 1mloc of Perl 5.8 code to run on Perl 5.32 (ish). There were, overall, remarkably few issues that cropped up. One of these issues (that showed itself a few times) was more or less exactly this: the iteration order through a hash is not defined. It has never been defined, but in Perl 5.8 it was consistent: for the same insertion order of the same set of keys, a hash would always iterate in the same way. In a later Perl it was deliberately randomised, not just once, but on every iteration through the hash.
It turned out there a few places that had assumed a predictable - not just stable, but deterministic - hash key iteration order. Mostly this showed up as tests that failed 50% of the time, which suggested to me a rough measure of how annoying an error is to track down is inversely correlated with how often the error appears in tests.
(Other issues were mostly due to the fact that Perl 5 is all but abandoned by its former community: a few CPAN modules are just gone, some are so far out of date that they can't be coerced to still work with other modules that have been updated over time. )

Reply View | 2 replies
- zerr 8 months ago
  
  At booking.com? :)
  
  Reply View | 1 reply
  
  codebje 8 months ago
  
  No, but they do (did?) have a vast ocean of Perl, and I did know a hacker or two who got hired to work there on it.
  
  Reply View | 0 replies
roseway4 8 months ago

iirc, Go intentionally randomizes map ordering for just this reason.

Reply View | 2 replies
- withinboredom 8 months ago
  
  Yep, and then you get crash reports you can’t reproduce.
  
  Reply View | 1 reply
  
  cristaloleg 8 months ago
  
  Same can be said about pointer addresses (random for each run). But ASLR exists for a specific reason.
  
  Reply View | 0 replies

jandrese 8 months ago

> Not ignore the compilation warnings – this code most likely threw a warning in the original code that was either ignored or disabled!

What compiler error would you expect here? Maybe not checking the return value from scanf to make sure it matches the number of parameters? Otherwise this seems like a data file error that the compiler would have no clue about.

Reply View 17 replies

kristianp 8 months ago

Trying g++ version 11.4, there's no warning by default if you don't check the return value of sscanf. Even `g++ -Wall -Wextra -Wunused-result` produces no warnings for a small example.

Reply View | 0 replies
burch45 8 months ago

Undefined behavior to access the uninitialized memory. A sanitizer would have flagged that.

Reply View | 14 replies
- jandrese 8 months ago
  
  The compiler has no way of knowing that the memory would be undefined, not unless it somehow can verify the data file. The most I think it can do is flag the program for not checking the return value of scanf, but even that is unlikely to be true since the program probably was checking for end of file which is also in the return value. It was failing to check the number of matched parameters. This is the kind of error that is easy to miss given the semantics of scanf.
  
  Reply View | 12 replies
  
  nayuki 8 months ago
  
  > The compiler has no way of knowing that the memory would be undefined
  Yes it would. -fsanitize=address does a bunch of instrumentation - it allocates shadow memory to keep track of what main memory is defined, and it checks every read and write address against the shadow memory. It is a combination of compile-time instrumentation and run-time checking. And yes, it is expensive, so it should be used for debugging and not the final release.
  https://clang.llvm.org/docs/AddressSanitizer.html , https://learn.microsoft.com/en-us/cpp/sanitizers/asan?view=m...
  
  Reply View | 9 replies
  
  andrewmcwatters 8 months ago
  
  Uninitialized variables are a really common case.
  
  Reply View | 1 reply
  
  gmueckl 8 months ago
  
  The pointer to the uninitialized variable is passed to scanf, which writes a value there unless it encounters an error. The compiler cannot understand this contract from the scanf declaration alone.
  
  Reply View | 0 replies
- andrewmcwatters 8 months ago
  
  Yeah, the debugging here is great, but the actual cause is super mild.
  
  Reply View | 0 replies
phire 8 months ago

Good point. When reading, I kind of just assumed the "use of initialised memory" warning would pick this up.
But because the whole line is parsed in a single sscanf call, the compiler's static analysis is forced to assume they have now initialised. There doesn't seem to be any generic static analysis approach that can catch this bug.
Though... you could make a specialised warning just for scanf that forced you to either pass in pre-initilized values or check the return result.

Reply View | 0 replies

maz1b 8 months ago

I always enjoy reading deeply technical writeups like these. I only wonder how much more rare they may or may not get in the AI era.

Reply View 9 replies

Cthulhu_ 8 months ago

I don't think they will get more rare; there will always be a top % of engineers that do deep dives. I hope anyway.
But AI won't replace them, nor did the past 50+ years of software development innovation. There's millions (tens of millions?) of higher programming language developers that don't know the difference between stack or heap besides maybe some theory they half remember from school but they don't care because they don't have to think about it for their day job.

Reply View | 2 replies
- throwaway2037 8 months ago
  
  If your whole career will be using higher order languages with very little data stored on stack (vs heap), why should those programmers care? It seems like normal progression of more abstraction in the tools that we use. Similarly, I have programmed a lot of C and C++ in my career and I never once need assembly language. (I am expecting someone to pop in the convo here and tell me about how I am a terrible C/C++ programmer because I don't know any assembly.)
  
  Reply View | 1 reply
  
  chrz 8 months ago
  
  Why should I care is a awful catchohrase.
  
  Reply View | 0 replies
senda 8 months ago

i think the shift will be from craftmens to trademens in regards to general software engineers, but these are type of writes up stem of a artisan style all to its own.

Reply View | 5 replies
- eduardofcgo 8 months ago
  
  We have been seeing this shift for a while, where "software engineers" graduate from 3 month bootcamps. Except now most likely they will not be earning 500k making crud apps.
  
  Reply View | 2 replies
  
  sitzkrieg 8 months ago
  
  and thats a good thing
  
  Reply View | 0 replies
  
  throwaway2037 8 months ago
  
  I call bullshit. What 3mo bootcamp grads were earning 500k writing CRUD apps? Zero.
  
  Reply View | 0 replies
- throwaway2037 8 months ago
  
  What about the incredible front end Devs that only know JS/CSS/HTML? They can still be true craftspeople in their art, be it cross-browser/platform issues or performance tweaking.
  
  Reply View | 0 replies
- nonethewiser 8 months ago
  
  Compare python devs of today to fortran devs of the 60s. Something like that distance. Maybe more. But the trend isnt new.
  
  Reply View | 0 replies

adzm 8 months ago

I'm more curious in what changed with the critical section locking/unlocking implementation in this version of Windows!

Reply View 6 replies

mjevans 8 months ago

It looks like the utilized stack, or a stack protection area, increased.

Reply View | 5 replies
- asveikau 8 months ago
  
  When I worked at Microsoft and I had downtime I would sometimes read the code for app compatibility shims out of pure curiosity.
  Win9x video games that made bad assumptions about the stack were a theme I saw. One of the differences between win9x and NT based windows is that kernel32 (later kernelbase) is a now user mode wrapper atop ntdll, whereas in the olden days kernel32 would trap directly into the kernel. This means that kernel32 uses more user mode stack space in NT. A badly behaving app that stored data to the left of the stack pointer and called into kernel32 might see its data structures clobbered in NT and not in 9x. So there were compatibility hacks that temporarily moved the stack pointer for certain apps.
  
  Reply View | 4 replies
  
  tom_ 8 months ago
  
  I wonder how many people think of the call stack as running left to right, most recent return first, rather than top to bottom, likewise? If you stare at enough hex dumps, it makes perfect sense.
  
  Reply View | 0 replies
  
  hoten 8 months ago
  
  What was the testing like for such bugs? Is it somehow automated, or is there a lengthy doc describing the manual testing steps, or are there no tests at all?
  
  Reply View | 2 replies

rossant 8 months ago

Am I the only one to be annoyed by this...?

while (this->m_fBladeAngle > 6.2831855) { this->m_fBladeAngle = this->m_fBladeAngle - 6.2831855; }

Like, "let's just write a while loop that could turn into an infinite loop coz I'm too lazy to do a division"

Reply View 16 replies

nemothekid 8 months ago

I want to assume that the GTA developers did this hack because it was faster than floating point division on the Playstation 2 or something.
But knowing they were able to they were able to blow up loading GTA5 by 5 minutes by just parsing json with sscanf, I don't have much hope.

Reply View | 6 replies
- badsectoracula 8 months ago
  
  IIRC the whole parsing performance issue was because the original code was written for the SP campaign of GTA5 that only had a handful of objects to parse data for. That was barely a blip in terms of performance impact and AFAIK was written years before GTAOnline was made (where it became an issue - and even then only became an issue much after GTAOnline was first made).
  Writing some simple code that works with the data you expect to have without bothering with optimizations is fine, if anything it is one of the actual cases of "premature optimization": even with profiling no real time is spent on that code, your data wont make it spend any time and you should avoid wild guesses since chances are you'll be wrong (even if in this case it could be a correct guess, it'd be like a broken clock guessing the time is always 13:37).
  The actual issue with that code was that, after they reused it for GTAOnline and started becoming a performance issue after some time as they added more objects, nobody thought to try and see what is wrong.
  
  Reply View | 2 replies
  
  vultour 8 months ago
  
  Are you actually arguing that using a JSON parser for JSON-formatted data is a premature optimization? The solution here was to use a different format, not a somewhat-JSON-compatible hacked together parser.
  
  Reply View | 1 reply
  
  badsectoracula 8 months ago
  
  No, i'm arguing that it wasn't a performance issue for the original purpose of the code and it only became one at much later, in a different project and only after some time long after that code was pushed way beyond what it was originally meant to do.
  The premature optimization would be trying to optimize that piece of code without that being necessary given what the code was meant to do.
  
  Reply View | 0 replies
- masklinn 8 months ago
  
  They were not the only one to make that mistake e.g. rapidjson had to fix the same error, few people expect parsing one token out of sscanf to strlen the entire input (not only that but there are c++ APIs which call sscanf under the hood).
  The second error of deduplicating values by linear scanning an array was way more egregious.
  
  Reply View | 2 replies
  
  hoten 8 months ago
  
  The real, systemic error is that dozens(?) of engineers worked on that product, supposedly often testing the online component and experiencing that wait time first hand; and none thought "wait, parsing JSON doesn't take that long, computers are fast! what's going on?"
  I think someone estimated that error cost them millions in revenue? I'm pretty sure a fraction of that could afford an engineer who knows how fast computers ought to be.
  
  Reply View | 1 reply
  
  masklinn 8 months ago
  
  GTA was never my wheelhouse, but from what I gathered GTA Online didn't have that much support, and since it was only the initial loading time, and it would have increased over time as the shop content increased, and a very fast machine (e.g. a dev machine) would have had less of an issue, the engineers working on it were probably not that incentivised to dig into it.
  Like, even though it's pretty critical to initial user experience initial loading time is generally what gets disregarded the most.
  > I'm pretty sure a fraction of that could afford an engineer who knows how fast computers ought to be.
  It can, if someone cares enough or realises it's an issue, and then someone is motivated enough to dig into it, or has the time to.
  
  Reply View | 0 replies
GeoAtreides 8 months ago

I'm willing to bet it was was done for performance reasons, subtraction is cheaper than float point division. Probably the compiler also has some tricks to optimize this further.
There is absolutely no way this could turn into an infinite loop. It could underflow, but for that to happen angle would have to be less than the 2*pi, therefore exiting the loop.

Reply View | 3 replies
- auxiliarymoose 8 months ago
  
  The article discusses how that turns into an infinite loop and causes a hang.
  When you subtract a small float from a very large float, the value doesn't change. This is because the "steps" between float values increase with the size of the value (i.e. floats have coarser resolution for larger magnitudes)
  To see this in action, try running the following in a JavaScript interpreter:
  console.log(1_000_000_000_000_000_000 - 1);
  
  Reply View | 1 reply
  
  MBCook 8 months ago
  
  But that’s “impossible”. It’s an angle between 0 and 2pi. When transformed it might go over a bit so they added the check.
  It will “never” become big.
  So why check? It’s unnecessary.
  Thus the bug.
  
  Reply View | 0 replies
- mabster 8 months ago
  
  If m_fBladeAngle is really large (>2.2e8 back of the envelope), the subtraction will have no effect, and that would be an infinite loop.
  
  Reply View | 0 replies
anal_reactor 8 months ago

Long shot, but maybe if the value is small, then this loop could be faster than division.

Reply View | 1 reply
- matsemann 8 months ago
  
  If the code runs every frame, it's probably always small and does just one iteration once in a while when it wraps over the value.
  
  Reply View | 0 replies
hoten 8 months ago

for real. The author clearly never heard of fmod

Reply View | 2 replies
- zerd 8 months ago
  
  fmod takes in the order of 30+ cycles, probably more in year 2003 CPUs, vs 1 for cmp, 1 for sub, 1 for jmp.
  
  Reply View | 1 reply
  
  hoten 8 months ago
  
  Sure the lower bound is nicer here. But when the tradeoff includes an unlimited upper bound it's not a very attractive option.
  I guess the most robust code handling both performance and unexpected input would be one iteration of this (leveraging the assumption that angles are either always within the bounds, or had one frame of going out of bounds by a small amount); followed by a fmod if that assumption is just totally off.
  
  Reply View | 0 replies

mjevans 8 months ago

For anyone with access issues

https://web.archive.org/web/20250423144746/https://cookieplm...