Comment by tiborsaas

Comment by tiborsaas 16 hours ago

> We see that that’s a quite a long line. Mail servers don’t like that

Why do mail server care about how long a line is? Why don't they just let the client reading the mail worry about wrapping the lines?

direwolf20 15 hours ago

SMTP is a line–based protocol, including the part that transfers the message body

The server needs to parse the message headers, so it can't be an opaque blob. If the client uses IMAP, the server needs to fully parse the message. The only alternative is POP3, where the client downloads all messages as blobs and you can only read your email from one location, which made sense in the year 2000 but not now when everyone has several devices.

Reply View 20 replies

Joker_vD 2 hours ago

But everything after headers can (almost) be a blob. Just copy buffers while taking care to track CRLF and look if what follows is a space. In fact, you have to do it anyhow, because line-folding is allowed in headers as well! And this "chunking long lines" technique has been around since the 70s, when people started writing parsers on top of hand-crafted buffered I/O.

Reply View | 0 replies
fluoridation 14 hours ago

Hey, POP3 still makes sense. Having a local copy of your emails is useful.

Reply View | 18 replies
- direwolf20 14 hours ago
  
  If you want it to be the only copy and not sync with anything
  POP3 is line–based too, anyway. Maybe you can rsync your maildir?
  
  Reply View | 5 replies
  
  kevin_thibedeau 3 hours ago
  
  Any decade now and we'll be ready for JMAP.
  
  Reply View | 0 replies
  
  fluoridation 13 hours ago
  
  I just read it mainly in one place and through the web interface when I have to.
  
  Reply View | 3 replies
- Jaxan 9 hours ago
  
  Isn’t the only difference between pop and imap that pop removes the mail from the server? I only use imap, and all my email is available offline.
  
  Reply View | 5 replies
  
  direwolf20 7 hours ago
  
  POP is a simple mail transfer protocol (hehe...). It supports three things: get number of mails, download mail by number, delete mail by number. This is what you need to move mails in bulk from one point to another. POP3 mail clients are local maildir clients that use POP3 to get new mail from the server. It's like SMTP if it were based on polling.
  IMAP is an interactive protocol that is closer to the interaction between Gmail frontend and backend. It does many things. The client implements a local view of a central source of truth.
  
  Reply View | 0 replies
  
  fluoridation 9 hours ago
  
  No, the difference is that IMAP doesn't store anything other than headers on the client (at least, not until the user tries to read a message), while POP3 eagerly downloads messages whenever they're available. A POP3 client can be configured with various remote retention policies, or even to never delete downloaded messages.
  I don't have an IMAP account available to check, but AFAIK, you should not have locally the content of any message you've never read before. The whole point of IMAP is that it doesn't download messages, but instead acts like a window into the server.
  
  Reply View | 1 reply
  
  mmooss 8 hours ago
  
  Also, IMAP syncs the other way. If you locally tag a message locally or move it to another folder, it also happens on the server.
  
  Reply View | 0 replies
  
  ars 3 hours ago
  
  Not at all. IMAP can do a lot of complex operations on the email while leaving it on the server, for example you can have the server search the email, flag it (mark it important, or read, or unread).
  POP can download the email, and that's about it.
  
  Reply View | 0 replies
  
  gsich 5 hours ago
  
  Depending on what you configured. It can also keep the mail on the server.
  
  Reply View | 0 replies
- ahoka 11 hours ago
  
  But it's more akin to consuming a message queue. You have fetched it, it's gone.
  
  Reply View | 1 reply
  
  foresto 8 hours ago
  
  This is incorrect. POP3 does not require fetched messages to be deleted from the server.
  
  Reply View | 0 replies
- encom 9 hours ago
  
  Nothing stops you from locally archiving your email with IMAP.
  
  Reply View | 3 replies
  
  mmooss 8 hours ago
  
  How do you do that, by default? Can you tell an IMAP client to work like POP3 and download everything?
  
  Reply View | 2 replies

layer8 16 hours ago

Mails are (or used to be) processed line-by-line, typically using fixed-length buffers. This avoids dynamic memory allocation and having to write a streaming parser. RFC 821 finally limited the line length to at most 1000 bytes.

Given a mechanism for soft line breaks, breaking already at below 80 characters would increase compatibility with older mail software and be more convenient when listing the raw email in a terminal.

This is also why MIME Base64 typically inserts line breaks after 76 characters.

Reply View 2 replies

SoftTalker 13 hours ago

In early days, many/most people also read their email on terminals (or printers) with 80-column lines, so breaking lines at 72-ish was considered good email etiquette (to allow for later quoting prefix ">" without exceeding 80 characters).

Reply View | 1 reply
- bjourne 10 hours ago
  
  One of the technical marvels of the day were mail and usenet clients that could properly render quoted text from infinite, never ending flame wars!
  
  Reply View | 0 replies

GMoromisato 10 hours ago

I don't think kids today realize how little memory we had when SMTP was designed.

For example, the PDP-11 (early 1970s), which was shared among dozens of concurrent users, had 512 kilobytes of RAM. The VAX-11 (late 1970s) might have as much as 2 megabytes.

Programmers were literally counting bytes to write programs.

Reply View 1 reply

NetMageSCW 6 hours ago

I assure you we were not, at least it wasn’t really necessary. Virtual Memory is a powerful drug.

Reply View | 0 replies

liveoneggs 15 hours ago

This is how email work(ed) over smtp. When each command was sent it would get a '200'-class message (success) or 400/500-class message (failure). Sound familiar?

telnet smtp.mailserver.com 25

HELO

MAIL FROM: me@foo.com

RCPT TO: you@bar.com

DATA

blah blah blah

how's it going?

talk to you later!

QUIT

Reply View 7 replies

1718627440 13 hours ago

For anyone who wants to try this against a modern server:

    openssl s_client -connect smtp.mailserver.com:smtps -crlf
    220 smtp.mailserver.com ESMTP Postfix (Debian/GNU)
    EHLO example.com
    250-smtp.mailserver.com
    250-PIPELINING
    250-SIZE 10240000
    250-VRFY
    250-ETRN
    250-AUTH PLAIN LOGIN
    250-ENHANCEDSTATUSCODES
    250-8BITMIME
    250-DSN
    250-SMTPUTF8
    250 CHUNKING

    MAIL FROM:me@example.com
    250 2.1.0 Ok

    RCPT TO:postmaster
    250 2.1.5 Ok

    DATA
    354 End data with <CR><LF>.<CR><LF>

    Hi
    .
    250 2.0.0 Ok: queued as BADA579CCB

    QUIT
    221 2.0.0 Bye

Reply View 0 replies

Telemakhos 14 hours ago

This brings back some fun memories from the 1990s when this was exactly how we would send prank emails.

Reply View | 2 replies
- kstrauser 11 hours ago
  
  Yep! And also, if you included a blank line and then the headers for a new email in the bottom of your message, you could tell the server, hey, here comes another email for you to process!
  If you were typing into a feedback form powered by something from Matt’s Script Archive, there was about a 95% chance you could trivially get it to send out multiple emails to other parties for every one email sent to the site’s owner.
  
  Reply View | 0 replies
- fix4fun 12 hours ago
  
  That was nice part of 1990s - many systems allow for funny things ;)
  
  Reply View | 0 replies
xg15 13 hours ago

I like how SMTP was at least honest in calling it the "receipt to" address and not the "sender" address.
Edit: wrong.

Reply View | 2 replies
- 1718627440 13 hours ago
  
  RCPT TO specifies the destination (recipient) address, the "sender" is what is written in MAIL FROM.
  However what most mail programs show as sender and recipient is neither, they rather show the headers contained in the message.
  
  Reply View | 1 reply
  
  xg15 13 hours ago
  
  Ah, sorry. You're right.
  
  Reply View | 0 replies

jcynix 13 hours ago

"BITNET was a co-operative university computer network in the United States founded in 1981 by Ira Fuchs at the City University of New York (CUNY) and Greydon Freeman at Yale University."

https://en.wikipedia.org/wiki/BITNET

BITNET connected mainframes, had gateways to the Unix world and was still active in the 90s. And limited line lengths … some may remember SYSIN DD DATA … oh my goodness …

https://www.ibm.com/docs/en/zos/2.1.0?topic=execution-systsi...

Reply View 0 replies

citrin_ru 16 hours ago

Back in 80s-90s it was common to use static buffers to simplify implementation - you allocate a fixed size buffer and reject a message if it has a line longer than the buffer size. SMTP RFC specifies 1000 symbols limit (including \r\n) but it's common to wrap around 87 symbols so it is easy to examine source (on a small screen).

Reply View 0 replies

thephyber 16 hours ago

The simplest reason: Mail servers have long had features which will send the mail client a substring of the text content without transferring the entire thing. Like the GMail inbox view, before you open any one message.

I suspect this is relevant because Quoted Printable was only a useful encoding for MIME types like text and HTML (the human readable email body), not binary (eg. Attachments, images, videos). Mail servers (if they want) can effectively treat the binary types as an opaque blob, while the text types can be read for more efficient transfer of message listings to the client.

Reply View 0 replies

Pinus 16 hours ago

As far as I can remember, most mail servers were fairly sane about that sort of thing, even back in the 90’s when this stuff was introduced. However, there were always these more or less motivated fears about some server somewhere running on some ancient IBM hardware using EBCDIC encoding and truncating everything to 72 characters because its model of the world was based on punched cards. So standards were written to handle all those bizarre systems. And I am sure that there is someone on HN who actually used one of those servers...

Reply View 6 replies

jcynix 13 hours ago

EBCDIC wasn't the problem, this was (part of) the problem:
https://www.ibm.com/docs/en/zos/2.1.0?topic=execution-systsi...
And BITNET …

Reply View | 1 reply
- kstrauser 11 hours ago
  
  > EBCDIC wasn't the problem
  Wake up, everyone! Brand new sentence just dropped!
  
  Reply View | 0 replies
tiborsaas 16 hours ago

Thanks, I really expected a tale from the 70's, but did not see punch cards coming :)

Reply View | 3 replies
- jibal 15 hours ago
  
  The influence of 80 column punch cards remains pervasive.
  
  Reply View | 2 replies
  
  [removed] 14 hours ago
  
  [deleted]
  
  Reply View | 0 replies
  
  NetMageSCW 6 hours ago
  
  IBM has a lot to make up for.
  
  Reply View | 0 replies

josefx 16 hours ago

RFC822 explicitly says it is for readability on systems with simple display software. Given that the protocol is from 1982 and systems back then had between 4 and 16kb RAM in total it might have made sense to give the lower end thin client systems of the day something preprocessed.

Reply View 3 replies

sumtechguy 14 hours ago

Also it is an easy way to stop a denial of service attack. If you let an infinite amount in that field. I can remotely overflow your system memory. The mail system can just error out and hang up on the person trying the attack instead of crashing out.

Reply View | 1 reply
- fluoridation 14 hours ago
  
  Surely you don't need the message to be broken up into lines just for that. Just read until a threshold is reached and then close the connection.
  
  Reply View | 0 replies
badc0ffee 10 hours ago

You could expect a lot more (512kB, 1MB, 2MB) in an internet-connected machine running Unix or VMS.

Reply View | 0 replies

codingdave 16 hours ago

Keep in mind that in ye olden days, email was not a worldwide communication method. It was more typical for it to be an internal-only mail system, running on whatever legacy mainframe your org had, and working within whatever constraints that forced. So in the 90s when the internet began to expand, and email to external organizations became a bigger thing, you were just as concerned with compatibility with all those legacy terminal-based mail programs, which led to different choices when engineering the systems.

Reply View 4 replies

liveoneggs 15 hours ago

This is incorrect

Reply View | 3 replies
- kstrauser 11 hours ago
  
  Are you certain? Not OP, but a huge chunk of early RFCs was about how to let giant IBM systems talk to everyone else, specifying everything from character sets (nearly universally “7-bit ASCII”) to end of line/message characters. Otherwise, IBM would’ve tried to make EBCDIC the default for everything.
  For instance, consider FTP’s text mode, which was primarily a way to accidentally corrupt your download when you forgot to type “bin” first, but was also handy for getting human readable files from one incompatible system to another.
  
  Reply View | 2 replies
  
  liveoneggs 9 hours ago
  
  I had a pre-'@' email address and it was able to communicate all over the world.
  
  Reply View | 1 reply
  
  kstrauser 8 hours ago
  
  My first reading was that you were disagreeing with the bits about email worrying about compatibility, and that part seemed reasonably true to me.
  As to the other bits, I think even in the uucp era, email was mostly internal, by volume of mail sent, even though you could clearly talk to remote sites if everything was set up correctly. It was capable of being a worldwide communication system. I bet the local admins responsible for monitoring the telephone bill preferred to keep that in check, though.
  
  Reply View | 0 replies