Comment by tptacek

> ordinary code is memory-safe by default

What does that mean? What constitutes "ordinary"? I'm not sure there is any official definition of memory safety, but I would consider it to mean that aside from code that is explicitly marked as unsafe it is impossible to write code that has undefined behavior.

Reply View 40 replies

kccqzy a year ago

Good definition. I've seen Go beginners trying to append to a slice from multiple goroutines. It works as well as calling push_back on the same vector from multiple threads in C++. It can easily corrupt GC state and lead to segfaults. The beginner didn't use any advanced trickery or the unsafe package. Therefore Go is not a memory safe language.

Reply View | 35 replies
- sshine a year ago
  
  > Therefore Go is not a memory safe language.
  Interesting.
  To quote the NSA [1], "Some examples of memory safe languages are Python, Java, C#, Go, Delphi/Object Pascal, Swift, Ruby, Rust, and Ada. Memory safe languages provide differing degrees of memory usage protections, so available code hardening defenses, such as compiler options, tool analysis, and operating system configurations, should be used for their protections as well."
  The narrow definition of memory safety here is:
  Go has garbage collection, so you won't have memory leaks or use-after-free.
  Go is powerful enough that beginners can cause segfaults by accidentally abusing internals, okay.
  I'm not sure this is a very redeeming property of Go: Being able to crash the GC, without the flexibility of manual memory management.
  But I'm not sure I'd categorize it as "not memory safe" for the same reason C/C++ aren't (a trade-off).
  Because I don't believe that you can generally leverage this for the kinds of memory exploits made in C/C++.
  I recall that some ML dialects (Standard ML and OCaml) have a library function Obj.magic : 'a -> 'b which escapes the type system. Using this can easily cause segfaults. Does that mean Standard ML and OCaml are not memory safe? Generally, no, they're extremely safe if you avoid that feature, which is most likely. This is arguably less safe than Go, since you most likely won't accidentally run that function.
  [1]: https://media.defense.gov/2022/Nov/10/2003112742/-1/-1/0/CSI...
  
  Reply View | 15 replies
  
  kccqzy a year ago
  
  I'm trying to provide some commentary to OP's original term of "ordinary code" three comments above. While this term is inherently ambiguous and subjective, my personal opinion is that appending to slices simultaneously from multiple goroutines count as "ordinary code" but Obj.magic does not.
  
  Reply View | 12 replies
  
  Cthulhu_ a year ago
  
  To add to Go being memory safe, it automatically blanks/zeroes memory, unlike C.
  
  Reply View | 0 replies
  
  capitol_ a year ago
  
  Could you share more of your thoughts on why that kind of memory corruption wouldn't be exploitable? Do go have something in place that prevents it?
  
  Reply View | 0 replies
- Thaxll a year ago
  
  Appending from multiple goroutine to an in un-synchronized slice is "memory safe", it's completely different from c/c++.
  It behave exactly like Java or C# which are also memory safe.
  
  Reply View | 10 replies
  
  tsimionescu a year ago
  
  I'm not sure of C#, but Java has stronger memory guarantees than Go, even in the presence of a data race.
  In Java, all primitive types (including Object pointers) are atomically modified. And since all Java writes are primitives (Java doesn't have structs), you can never corrupt a data structure at the Java level. Of course, you can still corrupt it at a logical level (break an invariant established in the constructor), but not at the language level.
  Go has a guarantee that word-sized reads/writes are atomic, but Go has plenty of larger objects than that. In particular, interface values are "fat pointers" and exceed the word-size on all platforms, so interface writes are not atomic. Which means another thread can observe an interface value having a vtable from one object but data from another, and can then execute a method from one object on data from another object, potentially re-interpreting fields as values of other types.
  
  Reply View | 7 replies
  
  kaba0 a year ago
  
  Not at all. Java or C# can end up in a logical bug from that, but they will never corrupt their runtime. So in java you can just try-catch whatever bad stuff happens there, and go on afterwards.
  Go programs can literally segfault from a data race. That's no memory safety.
  
  Reply View | 0 replies
  
  kccqzy a year ago
  
  No. Go is worse than Java in this regard. I'll just give you a random example from my five seconds of search: https://github.com/golang/go/issues/37484
  
  Reply View | 0 replies
- everybodyknows a year ago
  
  > corrupt GC state
  I understand this to mean the runtime's internal state, not visible to user code. If so, in general we should expect almost any sort of crash mode to be possible. Seems fair enough to call this "memory-unsafe".
  
  Reply View | 6 replies
  
  tptacek a year ago
  
  You'll be using an idiosyncratic definition the rest of the industry does not use, but you do you.
  What I think is happening here is another instance of a pattern that recurs all the time in communities like this: a term of art was created, "memory safety", to address the concept of languages that don't have buffer overflows, integer overflows, use-after-frees, double frees, controllable uninitialized pointers, and all the other memory lifecycle vulnerabilities. People unfamiliar with the state of the art heard the term, liked it, and have axiomatically derived their own definition for it. They like their definition better, and are not open to the idea that the term exists to serve a purpose orthogonal to their arguments.
  Another recent instance of the same phenomenon: "zero trust".
  Just as happened in the Zero Trust Wars of 2022, people, hearing the industry definition and intent of the term, scramble to reconcile their axiomatic definition with the state of the art, convincing themselves they were right all along.
  The problem they have in this particular argument is: where are the vulnerabilities? Go is not a niche language. It is a high-profile target and has been for over a decade. I saw Go security talks at OWASP Chicago(!) in 2012(!). People have all sorts of hypotheses about how a memory corruption vulnerability --- not "memory corruption", but a vulnerability stemming from it, implying valuable attacker control over the result of whatever bad thing happened --- might sneak into a Go program. Practitioners hear those axiomatic arguments, try to reconcile them with empirical reality, and: it just doesn't hold up.
  Just for whatever it's worth to hear this, if at Black Hat 2025 someone does to Go what James Kettle does to web frameworks ever year and introduces a widespread repeatable pattern of memory exploitability in Go race conditions, about half of my message board psyche will be really irritated (I'll have been wrong!), but the other half of my message board psyche will be fucking thrilled (there will be so much to talk about!) and all of my vulnerability researcher psyche will be doing somersaults (there will be so many new targets to hit!). On net, I'm rooting for myself being wrong. But if I had to bet: we're not going to see that talk, not at BH 2025, or 2026, or 2027. I'm probably not wrong about this.
  
  Reply View | 5 replies
- [removed] a year ago
  
  [deleted]
  
  Reply View | 0 replies
danielheath a year ago

Go lets you use `unsafe.Pointer` (or indeed, assembly intrinsics) if you really want to, but those are certainly not used "ordinarily".

Reply View | 1 reply
- tsimionescu a year ago
  
  It's not just about that. Data races can expose an object in a state that was never written from any thread in Go, potentially corrupting even internal details not exposed. Simply writing a struct value from two different threads can expose this.
  
  Reply View | 0 replies
tptacek a year ago

An example of extraordinary code would be code that interfaces with and/or pulls in non-memory-safe legacy C code.
Another example would be code specifically contrived to highlight a soundness problem in the language.
I used the term "extraordinary" to avoid exactly this kind of bickering over corner cases that aren't relevant to day-to-day software development (or at least, not in ways that aren't immediately evident when they come up.)

Reply View | 1 reply
- thinkharderdev a year ago
  
  > An example of extraordinary code would be code that interfaces with and/or pulls in non-memory-safe legacy C code.
  That's my point though. Of course calling non-memory safe native code over FFI can lead to memory-safety problems in any language. Likewise using the "unsafe" subset that basically every language has. But none of that is required in Go. It is only required that you mutate shared state from different threads, which is something that I would imagine happens in a lot of Go code codebases since it is an extremely easy mistake to make.
  To be clear I think:
  1. Go is mostly a memory safe language because it does in fact prevent the most common memory safety issues in C/C++ (UAF, buffer overflows, etc)
  2. It is LESS memory safe than other modern memory-sage languages (Rust, Java, C#, Python, etc....)
  3. The memory safety issues in Go are very difficult to exploit in code that is not specifically crafted to surface them
  
  Reply View | 0 replies

kaba0 a year ago

But ordinary go code is not memory safe. Data racing can trivially happen just by using the language's primitives. It requires no special keyword like unsafe, or native FFI like in other, actually memory safe languages (rust, or of the GCd kind, java, c#, Js)

Reply View 1 reply

kiitos a year ago

You're using a definition of 'memory safety' which is not common.

Reply View | 0 replies

remram a year ago

How safe is it? It has pointers and they are widely used (more than Rust where pointers are unsafe, but there are other reference types). Are those safe?

Reply View 5 replies

dfawcus a year ago

Generally.
That is as it does not have pointer arithmetic, unlike C, and arrays / slices are bounds checked. So one will get a crash from a null pointer deref.
The other risk with null pointer access is struct member access via such a pointer, but again due to lack of pointer arithmetic, that can't be easily triggered. The one way would be to have a massive struct, say much greater than the page size, and deref through that - fairly unlikely.
The other reference types (slices, maps, interface values, channels) are safe unless subject to data race issues (multi goroutine update). However channels are safe there, as their role is to be used from multiple goroutines.
So the path to lack of memory safety would be a data race, leading to type misinterpretation, hence type unsafety, then incorrect access and/or spatial and temporal unsafety as a consequence.
Apart from poor design / implementation of explicit multi threaded apps, the most likely data race strikes me as accidental lexical capture by a goroutine, hence movement to the heap, and a resultant race. The sort of thing which was mentioned in a paper (by Uber?). Those should be amiable to detection by linters.
The other case of races from poor threading design would be harder to automatically detect, but also harder to trigger. Probably avoidable by correct use of mutexes around access to the shared types (slices and maps), or simply by following an Actor or CSP design model.

Reply View | 3 replies
- remram a year ago
  
  Thanks for the summary, that is very helpful.
  At a language level though, it is either safe or unsafe. If it is "generally safe" provided you use it correctly, I would say it is not safe, in the strict sense.
  I don't think data races on pointers are allowed (looking at the memory model: https://go.dev/ref/mem) but I am not sure I have understood your scenario fully. Maybe I should read that paper you mention.
  Thanks again for the detailed response!
  
  Reply View | 2 replies
  
  dfawcus a year ago
  
  That is an absolutist position, which some of us don't agree with. Taking the view that in practice there are degrees of "memory safety". That is generally my position, and that the largest benefits come from spatial safety, then temporal safety, in that order.
  On that absolute position, there possibly are no "memory safe" languages, not even Rust as until it's borrow checker "bug" is fixed, it fails the absolutist position. If such a bug is left unfixed for long enough, one can deem it as de-facto "won't fix".
  The Go example code provided elsewhere in the thread included a memory race on an "interface value", that being a form of "fat pointer". It was that I was referring to, updating only half of value, so making it internally inconsistent.
  
  Reply View | 1 reply
  
  [removed] a year ago
  
  [deleted]
  
  Reply View | 0 replies
tptacek a year ago

Yes.

Reply View | 0 replies