๐Ÿ‘ฉโ€๐Ÿ’ป What is Web Scraping?


It really is as cool as it seems.

๐Ÿ”ง Technical Terms

Web scraping is a term used for scraping data off websites using code. Think of a list of products and prices - then putting that data into a spreadsheet, CSV, or database.

๐Ÿ”จ Tools

There is a ton of tools out there, especially in coding. But here is a list of things you will need to start web scraping.

๐Ÿ“š Libraries: Pandas (database) and Beautiful Soup (web scraping)

๐Ÿ‘ฉโ€๐Ÿ’ป Code Editor: VS Code

๐Ÿ“ Code Storage: Git and GitHub

๐Ÿ’ก Skills

You actually need to know more than just Python. Here is a list of skills you may need to brush up on.

๐Ÿ‘ฉโ€๐Ÿ’ป Python: Importing libraries

๐Ÿผ Pandas: Converting to a CSV

๐Ÿฒ Beautiful Soup: Finding elements

๐ŸŒ HTML: Types of elements (div, span, p)

๐ŸŽจ CSS: Classes

๐ŸŒŽ Real World Application

There are websites filled with cost information like Platt, an electrical parts website. You might want to stick all these items and their prices into an excel spreadsheet.

๐Ÿ‘ฉโ€๐Ÿ’ป My Code

Here is my base code in order to start web scraping. You can edit the baseURL variable as needed for other websites. Then you need to inspect the code on their website to see how they tagged(through CSS classes) the HTML element you want to grab.

Platt-Web-Scraping/ConduitDuctRaceway.py at main ยท lizzyrowe1107/Platt-Web-Scraping
Contribute to lizzyrowe1107/Platt-Web-Scraping development by creating an account on GitHub.

๐ŸŽฌ Video

I just finished filming a video on web scraping. It will come out next Monday. I walk through the exact code I wrote!