The Biggest Mistake Beginners Make When Web Scraping

Published: 04 May 2022
on channel: John Watson Rooney
129,140
3.4k

The first 1,000 people to use the link or my code johnwatsonrooney will get a 1 month free trial of Skillshare: https://skl.sh/johnwatsonrooney05221

The biggest mistake I see beginners make is trying to scrape from the frontend system, and not the backend.

The most common thing I see when I am asking about how to scrape a certain site is people struggling to get data from the front end of a modern website. These sites are using SPA (single page applications) that rely heavily on JavaScript or JS framework to load up the data. This way, whilst sometimes possible is just not option for extracting the data. We need to understand a little bit about how these sites work so we can use the right approach or method to accessing the data.

https://github.com/jhnwr/billionaires...

Scraper API I use: https://www.scrapingbee.com/?fpr=jhnwr
DISCORD (NEW):   / discord  
Patreon:   / johnwatsonrooney  
Proxies: https://proxyscrape.com/?ref=jhnwr
Hosting: Digital Ocean: https://m.do.co/c/c7c90f161ff6

-------------------------------------
Disclaimer: These are affiliate links and as an Amazon Associate I earn from qualifying purchases
-------------------------------------

Video

Camera: https://amzn.to/3b5N1ej
Lens: https://amzn.to/3QAm7LT
Lighting: https://amzn.to/3Oivn64

Audio

Interface: https://amzn.to/3OqEqRQ
Microphone: https://amzn.to/3xISfEC
Headphones: https://amzn.to/3zQrwZv

PC

Case: https://amzn.to/3dEz6Jw
PSU: https://amzn.to/3kc7SfB
CPU: https://amzn.to/2ILxGSh
Mobo: https://amzn.to/3lWmxw4
Ram: https://amzn.to/31muxPc
GFX card: https://amzn.to/2SKYraW
32" monitor: https://amzn.to/3sqgq98 (main)
27" monitor: https://amzn.to/2GAH4r9 (vertical)
24" monitor: https://amzn.to/3jIFamt (spare)
Dual monitor arm: https://amzn.to/3wpBvlg
Mouse: https://amzn.to/2SH1ssK
Keyboard: https://amzn.to/3A1B8jU