Scraping URLs with BeautifulSoup
https://hackingandslacking.com/scraping-urls-with-beautifulsoup-e794a555bb0f
There are plenty of reliable and open sources of data on the web. Datasets are freely released to the public domain by the likes of Kaggle, Google Cloud, and of course local & federal government. Like most things free and open, however, following the rules to obtain public data can be a bit… boring. I’m not suggesting we go and blatantly break some grey-area laws by stealing data, but this blog isn’t exactly called People Who Play It Safe And Slackers, either.
My personal Python roots can actually be traced back to an ambitious side-project: to aggregate all new music from across the web and deliver it the masses. While that project may have been abandoned (after realizing it already existed), BeautifulSoup was more-or-less my first ever experience with Python.