Week 08
Scraping
Lectures Notes:
- Parsing Unstructured Data: Scraping Static Websites: html – Jupyter Notebook
Slides
Readings
On HTMLs
CSS Selector Reference - w3schools.com
Web Scraping
- Tutorial: A Practical Introduction to Web Scraping in Python
- Tutorial: Python Web Scraping Using BeautifulSoup - dataquest.io
- Ultimate Guide to Web Scraping with Python Part 1: Requests and BeautifulSoup - LearnDataSci - Martin, Brendan
- On the legality of webscraping
- robots.txt
Changes in the APIs
https://inews.co.uk/news/twitter-researchers-delete-data-unless-pay-2364535
https://www.reddit.com/r/modnews/comments/134tjpe/reddit_data_api_update_changes_to_pushshift_access/
https://www.tiktok.com/legal/page/global/terms-of-service-research-api/en
https://www.bloomberg.com/news/articles/2022-06-23/meta-pulls-support-for-tool-used-to-keep-misinformation-in-check?leadSource=uverify%20wall