Software
We will use Jupyter notebook to run python. If you have not installed Jupyter notebook, please follow the instructions to install. For both windows and Mac, the easiest way is to install ANACONDA (recommend download and install the python 3.7 version). ANACONDA will automatically install python, Jupyter and few packages that are frequently used in data analytics to your computer. You can also choose to use
python -m pip install Jupyter
command to install if you already have python in your computer.After installing anaconda,
Windows
- Create a new folder on your desktop called “python_workshop”
- Start Anaconda Navigator.
- Launch Jupyter Notebook from Anaconda Navigator. It will automatically open your browser.
- Click into python_workshop folder under Files tab.
Macs
- Create a new folder “python_workshop” on your user home folder.
- Start Anaconda Navigator. Search “Anaconda” in Launchpad to find Anaconda Navigator icon.
- Launch Jupyter Notebook from Anaconda Navigator. It will automatically open your browser.
- Click into python_workshop folder under Files tab.
Now you successfully have the notebook open!
Data
Download this data to your computer:
Unzip the data file and copy all files to your python_workshop folder.About the data
The data for this lesson is a part of the Data Carpentry Business workshop. The soda data is collected from the State of Iowa. The titanic data is collected from Kaggle. Both data are open for public use, we have modified the data for the workshop
The data sets are stored in comma-separated values (CSV) format. We will learn:
- load that data into memory
- perform arithmetic calculation, aggregation and analytics with the data
- visualize the dataset
- construct linear regression.