Obligatory
His eyes are creepy as fuck in this image. Doesn’t feel human. No soul. It’s like a demon wearing a human suit.
Too bad the website is still openly accessible and still capable of being scraped
Well that’s part of the thing. Web scraping doesn’t get covered by policies. Like, they could ban your ip or any accounts you have, but web scraping itself will always be acceptable. It’s why projects like NewPipe and Invidious don’t care about YouTube cease and desist letters.
Oops look like this community hasn’t been reviewed. Login if you still want to see the content.
Is it any different for an “API”? I don’t think there’s a very big difference between an HTTP endpoint that returns HTML and an HTTP endpoint that returns JSON.
We use Akamai where I work for security, CDN, etc. Their services make it largely trivial to identify traffic from bots. They can classify requests in real time as coming from known bots like Googlebot to programming frameworks like python & java to bots that impersonate Googlebot, to virtually any other automated traffic from unknown bots.
If Reddit was smart they’d leverage something like that to allow Google, Bing, etc. to crawl their data and block all others, or poison others with bogus data. But we’re talking about Reddit here…
Sure, why not. People gave you all the information on Reddit for free, you might as well sell it to the highest bidder without compensating them. I call it the “Veasey maneuver.”
They have a ton of useful and valuable comments.
They also have some of mine but that’s more of a liability.
For awhile, it was a really special place,
for feeding procrastination.