Comment by neilv

Comment by neilv 13 hours ago

Can demonstrable ignoring of robots.txt help the cases of copyright infringement lawsuits against the "AI" companies, their partners, and customers?

thayne 12 hours ago

Probably not copyright infringement. But it is probably (hopefully?) a violation of CFAA, both because it is effectively DDoSing you, and they are ignoring robots.txt.

Maybe worth contacting law enforcement?

Although it might not actually be Amazon.

Reply View 2 replies

to11mtm 11 hours ago

Big thing worth asking here. Depending on what 'amazon' means here (i.e. known to be Amazon specific IPs vs Cloud IPs) it could just be someone running a crawler on AWS.
Or, folks failing the 'shared security model' of AWS and their stuff is compromised with botnets running on AWS.
Or, folks that are quasi-spoofing 'AmazonBot' because they think it will have a better not-block rate than anonymous or other requests...

Reply View | 1 reply
- thayne 10 hours ago
  
  From the information in the post, it sounds like the last one to me. That is, someone else spoofing an Amazonbot user agent. But it could potentially be all three.
  
  Reply View | 0 replies

adastra22 13 hours ago

On what legal basis?

Reply View 11 replies

flir 12 hours ago

In the UK, the Computer Misuse Act applies if:
* There is knowledge that the intended access was unauthorised
* There is an intention to secure access to any program or data held in a computer
I imagine US law has similar definitions of unauthorized access?
`robots.txt` is the universal standard for defining what is unauthorised access for bots. No programmer could argue they aren't aware of this, and ignoring it, for me personally, is enough to show knowledge that the intended access was unauthorised. Is that enough for a court? Not a goddamn clue. Maybe we need to find out.

Reply View | 6 replies
- adastra22 5 hours ago
  
  robots.txt isn't a standard. It is a suggestion, and not legally binding AFAIK. In US law at least a bot scraping a site doesn't involve a human being and therefore the TOS do not constitute a contract. According to the Robotstxt organization itself: “There is no law stating that /robots.txt must be obeyed, nor does it constitute a binding contract between site owner and user, but having a /robots.txt can be relevant in legal cases.”
  The last part basically means the robots.txt file can be circumstantial evidence of intent, but there needs to be other factors at the heart of the case.
  
  Reply View | 0 replies
- pests 12 hours ago
  
  > `robots.txt` is the universal standard
  Quite the assumption, you just upset a bunch of alien species.
  
  Reply View | 4 replies
  
  flir 12 hours ago
  
  Dammit. Unchecked geocentric model privilege, sorry about that.
  
  Reply View | 2 replies
  
  thayne 12 hours ago
  
  Universal within the scope of the Internet.
  
  Reply View | 0 replies
tepidsaucer 3 hours ago

I wind up in jail for ten years if I download an episode of iCarly; Sam Altman inhales every last byte on the internet and gets a ticker tape parade. Make it make sense.

Reply View | 0 replies
readyplayernull 13 hours ago

Terms of use contract violation?

Reply View | 2 replies
- hipadev23 12 hours ago
  
  Robots.txt is completely irrelevant. TOU/TOS are also irrelevant unless you restrict access to only those who have agreed to terms.
  
  Reply View | 0 replies
- bdangubic 13 hours ago
  
  good thought but zippy chance this holds up in Court
  
  Reply View | 0 replies