Sujet : Re: Google Cache finally gone
De : candycanearter07 (at) *nospam* candycanearter07.nomail.afraid (candycanearter07)
Groupes : comp.miscDate : 25. Sep 2024, 14:40:03
Autres entêtes
Organisation : the-candyden-of-code
Message-ID : <slrnvf8433.19b1p.candycanearter07@candydeb.host.invalid>
References : 1 2 3 4
User-Agent : slrn/1.0.3 (Linux)
Rich <
rich@example.invalid> wrote at 13:10 this Wednesday (GMT):
Computer Nerd Kev <not@telling.you.invalid> wrote:
Rich <rich@example.invalid> wrote:
Computer Nerd Kev <not@telling.you.invalid> wrote:
Damn, I've been using it more and more as JS-walls have become more
frequent and prevent me reading pages in lightweight web browsers
without JS support. In fact it's about the only thing I use Google
for!
This one works pretty well for /most/ paywalls: https://archive.is/
Thanks, yes that has the article. It wasn't actually behind a
paywall, but using one of these cache services like Cloudflare that
can block access from browsers without Javascript saying something
like "Enable Javascript and cookies to continue". I thought
"JS-wall" was a good term for it since the effect is like a paywal,
only they demand you run their JS rather than demand payment.
>
Ah, those. The term you are searching for is "capatcha" [1], at least
in the 'cloudfare' case. They are, supposedly, to prevent bots from
scraping/DDOSing the site. However, an awful lot of sites add them
either because they decide to "go cloudfare" (in a belief they are big
and popular enough to justify such) or simply because the web devs are
idiots that just "follow the herd" and because they see caapatcha's
else where, they add one here.
>
>
[1] https://en.wikipedia.org/wiki/CAPTCHA
Cloudflare captchas are very annoying, it completely broke a webscraping
script I used :(
Also I want to use NoScript
-- user <candycane> is generated from /dev/urandom