Comment by bayindirh

Comment by bayindirh 8 hours ago

10 replies

My view on this is simple:

If you're a bot which will ignore all the licenses I put on that content, then I don't want to you to be able to reach that content.

No, any amount of monetary compensation is not welcome either. I use these licenses as a matter of principle, and my principles are not for sale.

That's all, thanks.

beeflet 7 hours ago

I think the problem is that despite the effort, you will still end up in the dataset. So it's futile

warkdarrior 8 hours ago

How can you tell a bot will ignore all your content licenses?

  • bayindirh 8 hours ago

    Currently all AI companies argue that the content they use falls under fair use, and disregard all licenses. This means any future ones respecting these licenses needs to be whitelisted.

    • diggan 8 hours ago

      How do you know that that bot is part of those AI companies? Maybe it's my personal bot you're blocking, should I also not have (indirectly) access to the content?

      • simianparrot 8 hours ago

        No. Access to my content is a privilege I grant you. I decide how you get to access it, and via a bot that my setup confuses for an AI crawler belonging to an anti-human AI corporation is not a valid way to access it. Get off my virtual lawn.

      • bayindirh 8 hours ago

        You can use a honest user string denoting that it's your bot. Some AI companies label their bots transparently, they show up on the logs I keep.

        While I understand that you may need a personal bot to crawl or mirror a site, I can't guarantee that I'll grant you access.

        I don't like to be that heavy-handed in the first place, but capitalism is making it harder to trust entities which you can't see and talk face to face.