Comment by armchairhacker

Comment by armchairhacker 3 days ago

2 replies

Today, an external camera can record your computer screen and audio, AI can extract the data and metadata, and a 2D contraption can physically move your mouse to interact. In the future, these will probably become more effective and cheaper (eventually the AI becoming possible to run locally, though even today it’s possible with a good GPU on simple UIs).

Lots of other comments argue for regulation mandating open APIs. I disagree, instead we should remove and prevent regulations that block scraping. We should also create alternative monetization paths for companies who charge for access or use ads, since they’ll lose those paths, and they’re already suffering from piracy and illegal scraping.

pixl97 3 days ago

The biggest problem here is preventing said scraping from shutting down the sites with cost.

Over the years most of the problems I had with sites getting overloaded were from valid 'scrapers' like Google. Quite often providers were adding more memory/cpu just to ensure Google could index the site.

While hosting costs are cheaper than ever, being on the internet can still get very expensive very fast.

  • armchairhacker 2 days ago

    In theory it could be solved by websites charging a very small fee (maybe crypto) for incoming requests, to pay operating costs. The fees from human browsing (even excessive like 1 site / second) would be negligible. APIs that use scraping would forward the fee to their users. Training or search (index) data would cost a lot to generate, but probably still insignificant compared to training the ML model or operating the search.

    It already costs a small amount of electricity for clients to send requests, so maybe paying for the server to handle them wouldn’t be a big difference, but I don’t know much about networking.

    Although in practice, similar things have been tried and those haven’t worked out, e.g. Coil. It would require adoption by most browsers or ISPs, otherwise participating sites would be mostly ignored (most clients don’t pay the fee so they don’t get access, they don’t cares because it’s probably a small site and/or there are alternatives).