Sujet : Re: Emigration from Usenet [was: Re: PTD was the most-respected of the AUE regulars ...]
De : ldo (at) *nospam* nz.invalid (Lawrence D'Oliveiro)
Groupes : comp.miscDate : 28. Jul 2024, 02:55:16
Autres entêtes
Organisation : A noiseless patient Spider
Message-ID : <v848e4$3kh8o$3@dont-email.me>
References : 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
User-Agent : Pan/0.159 (Vovchansk; )
On 26 Jul 2024 22:18:48 +1000, Computer Nerd Kev wrote:
I'm not really sure whether a HTML parser
library would be helpful or just a pointless extra layer of complexity.
So far I've just used regular expressions for scraping webpages.
I learned about BeautifulSoup early on, and never looked back. I use it
for all my web-scraping projects nowadays.
By the way, this is the kind of discussion you could not have on a
platform like Discord. The last time I was on there, the server Ts&Cs had
prohibitions against talking about web-scraping, since so many websites
didn’t like it.