Category

Data Extraction

Category

Data scraping data, also known as web scraping is the process of extracting data from a website programmatically. The destination of the extracted data can vary, in some cases to channel that data to another website but is commonly saved to a spreadsheet or local file saved on your computer. It’s one of the most efficient ways to get data from the web aside from directly querying a REST API. We are going to be extracting Premier League data such as: All-time Top Scorers2020-21 – League Table2020/21 Top Scorers Table of Contents ? Importing LibrariesMethod 1: HTML Table ScrapingMethod 2: Beautiful Soup ScrapingMethod 3: Using an API and JSONs Importing Libraries ? We are going to be using the official Premier League website to extract the data we need. The libraries we are using are as follows: Pandas: This will be used to generate our dataframesNumpy: Numpy will be used to calculate our numeric…