Class 3

EDGAR Explorer I

Wednesday, October 2, 2024

Class Overview

We will aim to complete our EDGAR Explorer case in both python and Alteryx. Combined with our AlphaVantage API keys, we will have built a tool that extracts real-time data sourced from the U.S. Securities and Exchange Commission's (SEC) EDGAR database and stock return data sourced from APIs. This will allow us to examine the data and plan how to transform this valuable data into a form we can use in our subsequent analysis.

Why is this important?
This sequence of classes where we Extract, Transform, and Load data for subsequent analysis is crucial for graduate accounting students because it provides them with a solid foundation in core analytical processes essential for modern financial analysis. Mastery of techniques such as filtering, aggregation, and merging enables students to efficiently work with large, complex datasets and derive meaningful insights. In a world where data-driven decision-making is increasingly important, these skills will prepare students to handle real-world financial data with confidence, enabling them to analyze corporate filings, assess financial performance, and make informed, strategic decisions in their future roles as accounting professionals.

Class Materials and Details

Materials:

Case: EDGAR Explorer
Slides: will be available for download by the beginning of class in either powerpoint or pdf formats.
Data: A data update may be required for this class. To ensure your files are the most up-to-date, navigate to ACCTG522_Labs folder and run the command git pull.
Analytics Tools: Git and GitHub

Review and Extension:
We will continue to work with extracting real time data using APIs. We will finish off our data extraction exercise for the SEC_Submissions.py and begin to replicate this process using Alteryx.

Preparation:
  1. There is no required preparation for this class

Class Plan:
Teams: during this class, please sit in your discussion teams.
  1. After a very brief review, this class will be primarily in the labs, continuing the large data extraction task of obtaining real time data from the SEC.