Comment by haberman

Comment by haberman 10 months ago

> Non-goals: Drop-in replacement for CPython: Codon is not a drop-in replacement for CPython. There are some aspects of Python that are not suitable for static compilation — we don't support these in Codon.

This is targeting a Python subset, not Python itself.

For example, something as simple as this will not compile, because lists cannot mix types in Codon (https://docs.exaloop.io/codon/language/collections#strong-ty...):

    l = [1, 's']

It's confusing to call this a "Python compiler" when the constraints it imposes pretty fundamentally change the nature of the language.

quotemstr 10 months ago

It's not even a subset. They break foundational contracts of the Python language without technical necessity. For example,

> Dictionaries: Codon's dictionary type does not preserve insertion order, unlike Python's as of 3.6.

That's a gratuitous break. Nothing about preserving insertion order interferes with compilation, AOT or otherwise. The authors of Codon broke dict ordering because they felt like it, not because they had to.

At least Mojo merely claims to be Python-like. Unlike Codon, it doesn't claim to be Python then note in the fine print that it doesn't uphold Python contractual language semantics.

Reply View 14 replies

orf 10 months ago

Try not to throw around statements like “they broke duct ordering because they felt like it”.
Obviously they didn’t do that. There are trade-offs when preserving dictionary ordering.

Reply View | 7 replies
- baq 10 months ago
  
  dicts ordering keys in insertion order isn't an implementation detail anymore and hasn't been for years.
  
  Reply View | 2 replies
  
  nick238 10 months ago
  
  I get that all dicts are now effectively an `collections.OrderedDict`, but I've never seen practical code that uses the insertion order. You can't do much with that info (no `.pop()`, can't sort a dict without recreating it) beyond maybe helping readability when you print or serialize it.
  
  Reply View | 1 reply
  
  funny_falcon 10 months ago
  
  Readability, reproducibility, and simple LRU.
  Reproducibility matters for tests, they become simpler. Some other algorithms become simpler as well.
  LRU is just a dict with preserving order: on access just delete and insert again.
  
  Reply View | 0 replies
- dathinab 10 months ago
  
  if you claim
  > high-performance Python implementation
  then no this aren't trade-offs but breaking the standard without it truly being necessary
  most important this will break code in a subtle and potentially very surprising way
  they could just claim they are python like and then no one would hold them for not keeping to the standard
  but if you are misleading about your product people will find offense even if it isn't intentionally
  
  Reply View | 0 replies
- actionfromafar 10 months ago
  
  The trade-off is a bit of speed.
  
  Reply View | 2 replies
  
  cjbillington 10 months ago
  
  This might be what you meant, but the ordered dicts are faster, no? I believe ordering was initially an implementation detail that arose as part of performance optimisations, and only later declared officially part of the spec.
  
  Reply View | 1 reply
  
  Someone 10 months ago
  
  > but the ordered dicts are faster, no?
  They may be in the current implementations, but removing an implementation constraint can only increase the solution space, so it cannot make the best implementation slower.
  As a trivial example, the current implementation that guarantees iteration happens in insertion order also is a valid implementation for a spec that does not require that guarantee.
  
  Reply View | 0 replies
adammarples 10 months ago

Well would you claim that Python 3.5 isn't python?

Reply View | 5 replies
- stoperaticless 10 months ago
  
  All versions of python are python.
  If lang is not compatible with any of python versions, then the lang isn’t python.
  False advertising is not nice. (even if the fineprint clarifies)
  
  Reply View | 4 replies
  
  thesz 10 months ago
  
  > If lang is not compatible with any of python versions, then the lang isn’t python.
  Python versions are not compatible between themselves, as python does not preserve backward compatibility, ergo python is not python.
  
  Reply View | 3 replies

wpietri 10 months ago

Yeah, this right here would kill it for me:

> Strings: Codon currently uses ASCII strings unlike Python's unicode strings.

That rules out almost anything web-ish for me.

The use case I could imagine is places where you have a bunch of python programmers who don't really want to learn another language but you have modest amounts of very speed-sensitive work.

E.g., you're a financial trading company who has hired a lot of PhDs with data science experience. In that context, I could imagine saying, "Ok, quants, all of your production code has to work in Codon". It's not like they're programming masters anyhow, and having it be pretty Python-ish will be good enough for them.

Reply View 2 replies

Retr0id 10 months ago

>> Strings: Codon currently uses ASCII strings unlike Python's unicode strings.
Yikes. These days I wouldn't even call those strings, just bytes. I can live with static/strong typing (I prefer it, even), but not having support for actual strings is a huge blow.

Reply View | 0 replies
wpietri 10 months ago

Ah, looking further, I find this about the company: "Their focus lies in bridging the gap between these two aspects across various domains, with a particular emphasis on life sciences and bioinformatics."
That makes sense as a sales pitch. "Hey, company with a lot of money! Want your nerds to go faster and need less expensive hardware? Pay us for magic speed-ups!" So it's less a product for programmers than it is for executives.

Reply View | 0 replies

bpshaver 10 months ago

Who is out here mixing types in a list anyway?

Reply View 31 replies

dathinab 10 months ago

parsing json is roughly of the type:
type Json = None | bool | float | str | dict[str, Json] | list[Json]
you might have similar situations for configs e.g. float | str for time in seconds or a human readable time string like "30s" etc.
given how fundamental such things are I'm not sure if there will be any larger projects (especially wrt. web servers and similar) which are compatible with this
also many commonly used features for libraries/classes etc. are not very likely to work (but idk. for sure, they just are very dynamic in nature)
so IMHO this seems to be more like a python-like language you can use for idk. some since computations and similar then a general purpose faster python

Reply View | 3 replies
- bpshaver 10 months ago
  
  Agreed, I was just joking. I understand heterogenous lists are possible with Python, but with the use of static type checking I feel like its pretty rare for me to have heterogenous lists unless its duck typing.
  
  Reply View | 2 replies
  
  JonChesterfield 10 months ago
  
  If your language obstructs heterogeneous lists your programs will tend to lack them. Look for classes containing multiple hashtables from the same strings to different object types as a hint that they're missed.
  Whether that's a feature is hard to say. Your language stopped you thinking in those terms, and stopped your colleagues from doing so. Did it force clarity of thought or awkward contortions in the implementation? Tends to depend on the domain.
  
  Reply View | 1 reply
  
  bobbylarrybobby 10 months ago
  
  Heterogeneity is easily achieved in statically typed languages via sum types.
  
  Reply View | 0 replies
orf 10 months ago

It’s common to have a list of objects with different types, but which implement the same interface. Duck typing of this kind is core to Python.

Reply View | 1 reply
- bpshaver 10 months ago
  
  Good point.
  
  Reply View | 0 replies
CaptainNegative 10 months ago

I often find myself mixing Nones into lists containing built-in types when the former would indicate some kind of error. I could wrap them all into a nullable-style type, but why shouldn't the interpreter implicitly handle that for me?

Reply View | 1 reply
- bpshaver 10 months ago
  
  Yeah, that seems fair.
  
  Reply View | 0 replies
itishappy 10 months ago

The json module returns heterogenous dicts.
https://docs.python.org/3/library/json.html

Reply View | 6 replies
- bpshaver 10 months ago
  
  Yeah, just because it can do that doesn't mean that it is good design.
  
  Reply View | 5 replies
  
  gwking 10 months ago
  
  It is the design of JSON! Which is a reflection of the same dynamic typing choice made in the original design of Javascript.
  
  Reply View | 1 reply
  
  mrguyorama 10 months ago
  
  They, uh, still aren't wrong hah.
  Tell me again why we somehow standardized on sending the equivalent of JSObject.toString() for everything? Especially when "standardized" isn't
  
  Reply View | 0 replies
  
  gpderetta 10 months ago
  
  how would you represent an arbitrary JSON array in python then? A potentially heterogeneous list seems the obvious solution.
  
  Reply View | 2 replies
dekhn 10 months ago

I've been mixing types in Python lists for several decades now. Why wouldn't you? it's a list of PyObjects.

Reply View | 0 replies
gwking 10 months ago

An example related to JSON content is HTML content. I have a Python library that represents all of the standard HTML tags as a family of classes. It is like a lightweight DOM on the server side, and has resulted in a web server that does not use string based templating at all. It lets me construct trees of HTML completely in Python and then render them out with everything correctly escaped. I can also parse HTML into trees and manipulate them as I please (for e.g. scraping tasks and document transforms). It is all strongly typed using mypy and I adhere to the strictest generic typing I can manage.
Each node has a list of children, and the element type is `str|HtmlNode`. I find this vastly easier to use than the LXML ETree api, where nodes have `text` and `tail` attributes to represent interleaved text.
Interestingly, the LXML docs promote their design as follows: > he two properties .text and .tail are enough to represent any text content in an XML document. This way, the ElementTree API does not require any special text nodes in addition to the Element class, that tend to get in the way fairly often (as you might know from classic DOM APIs). https://lxml.de/tutorial.html#elements-contain-text
It could be a simple matter of taste! But I suspect that the difference between what they are describing as "classic DOM" vs what I am doing is that they are referring to experience with C/C++/Java libraries circa 2009 that had much less convenient dynamic type introspection. The "get in the way fairly often" reminds me of how verbose it is to deal with heterogenous data in C/C++/ObjC. In ObjC for example, you could have an array mixing NSString with other NSObject subclasses, but you had to do work to type it correctly. If you wanted numbers in there you had to use NSNumber which is an annoying box type that you never otherwise use. And ObjC was considered very dynamic in its day!
I have long felt that the root of much evil was the overbearing distinction between primitive and object types in C++/Java/Objective-C.
All of this is a long way of saying, I think "how to deal with heterogenous lists of stuff" is a huge question in language design, library design, and the daily work of programming. Modern languages have by no means converged on a single way to represent varying types of elements. If you want to create trees of stuff, at some level that is "mixing types in a list" no matter how you might try to encode it. Just food for thought!

Reply View | 0 replies
nicce 10 months ago

Everyone who chooses the Python in the first hand.

Reply View | 1 reply
- bpshaver 10 months ago
  
  Well, I'm one of those people, and I feel that I rarely do this. Except if I have a list of different objects that implement the same interface, as another commenter mentioned.
  
  Reply View | 0 replies
RogerL 10 months ago

return [key, value]

Reply View | 10 replies
- Myrmornis 10 months ago
  
  You should use a tuple there: it's a collection of fixed size where each slot has an identity. (There's a common confusion in Python circles that the main point of tuples is immutability; that's not so).
  
  Reply View | 0 replies
- ghxst 10 months ago
  
  Why would you do this over `return key, value` which produces a tuple? Just curious.
  
  Reply View | 8 replies
  
  dgan 10 months ago
  
  Not the parent, but i return heterogeneous lists of the same length to the excel to be used by xlwings. The first row being the headers, but every row below is obviously heterogeneous
  
  Reply View | 0 replies
  
  slightwinder 10 months ago
  
  To quote the Zen of Python:
  Explicit is better than implicit. Readability counts.
  
  Reply View | 5 replies
  
  0xDEADFED5 10 months ago
  
  javascript refugee?
  
  Reply View | 0 replies
__mharrison__ 10 months ago

Someone who is using Python the wrong way.

Reply View | 0 replies

BiteCode_dev 10 months ago

For a real compiler try nuitka.

Reply View 0 replies

odo1242 10 months ago

Yeah, it feels closer to something like Cython without the python part.

Reply View 0 replies

jjk7 10 months ago

The differences seem relatively minor. Your specific example can be worked around by using a tuple; which in most cases does what you want.

Reply View 4 replies

itishappy 10 months ago
Altering python's core datatypes is not what I'd call minor.
They don't even mention the changes to `list`.
> Integers: Codon's int is a 64-bit signed integer, whereas Python's (after version 3) can be arbitrarily large. However Codon does support larger integers via Int[N] where N is the bit width.
> Strings: Codon currently uses ASCII strings unlike Python's unicode strings.
> Dictionaries: Codon's dictionary type does not preserve insertion order, unlike Python's as of 3.6.
> Tuples: Since tuples compile down to structs, tuple lengths must be known at compile time, meaning you can't convert an arbitrarily-sized list to a tuple, for instance.
https://docs.exaloop.io/codon/general/differences
Pretty sure this means the following doesn't work either:
config = { "name": "John Doe", "age": 32 }
Note: It looks like you can get around this via Python interop, but that further supports the point that this isn't really Python.
Reply View | 3 replies
- dathinab 10 months ago
  
  > Strings: Codon currently uses ASCII strings unlike Python's unicode strings.
  wtf this is a supper big issue making this basically unusable for anything handling text (and potentially even just fixed indents, if you aren't limited to EU+US having non us-ascii idents in code or text is common, i.e. while EU companies most times code in english this is much less likely in Asia, especially China and Japan.
  it isn't even really a performance benefit compared to utf-8 as utf-8 only using us-ascii letters _is_ us-ascii and you don't have to use unicode aware string operations
  
  Reply View | 2 replies
  
  gpderetta 10 months ago
  
  In fact most EU languages are not representable in ASCII.
  
  Reply View | 0 replies
  
  robjwells 10 months ago
  
  Yeah this is a baffling decision. I’d like to know what the motivation is. ASCII doesn’t even contain € or £.
  
  Reply View | 0 replies