Skip to content

Commit 51588dd

Browse files
committed
in the middle of editing
1 parent 49fbc61 commit 51588dd

File tree

4 files changed

+78
-54
lines changed

4 files changed

+78
-54
lines changed

config.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -73,9 +73,11 @@ episodes:
7373

7474
# Information for Learners
7575
learners:
76+
- setup.md
7677

7778
# Information for Instructors
7879
instructors:
80+
- instructor-notes.md
7981

8082
# Learner Profiles
8183
profiles:

episodes/01_introduction.md

Lines changed: 35 additions & 28 deletions
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,9 @@
11
---
22
title: "Introduction"
33
teaching: 10
4-
exercises: 2
4+
exercises: 0
55
---
66

7-
<span style="color: red;">Update teaching time and number of exercises.</span>
8-
97
:::::::::::::::::::::::::::::::::::::: questions
108

119
- How can the humanities benefit from data visualization?
@@ -17,45 +15,54 @@ exercises: 2
1715

1816
::::::::::::::::::::::::::::::::::::: objectives
1917

20-
- Introduce the use cases of data visualization for the humanities
21-
- Introduce the concept of statistical inference to humanities researchers
22-
- Teach humanities researchers to visualize data with python to infer information from it
23-
- Teach humanities researchers to use data visualization and statistical inference for data storytelling
18+
- Introduce the use cases of data visualization for the humanities.
19+
- Introduce the concept of statistical inference to humanities researchers.
20+
- Teach humanities researchers to visualize data with python to infer information from it.
21+
- Teach humanities researchers to use data visualization and statistical inference for data storytelling.
2422

2523
::::::::::::::::::::::::::::::::::::::::::::::::
2624

27-
### Who can benefit from this lesson?
28-
29-
<u>The main goal of this lesson is to demonstrate the importance of data visualization and how it can unlock a variety of learning and research pathways—ranging from exploratory data analysis and statistical inference to understanding machine learning processes and data storytelling.</u>
25+
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: instructor
26+
To warm up, conduct a brief brainstorming session to elicit potential answers to the
27+
questions above. Write the answers on the board and engage learners in a discussion about their background knowledge
28+
in each area.
3029

31-
If you're looking for ways to approximately predict specific values based on a given dataset for data storytelling, or if you've ever wondered how machine learning models that predict values (rather than categories) work, this lesson is for you. It will introduce you to the concept of statistical inference—a mathematical calculation used in predictive machine learning algorithms—through various data visualization techniques. These visualization methods will also enhance your data storytelling skills, not only in describing existing data but also in predicting values based on the available data.
30+
Depending on the backgrounds and interest areas of workshop attendees, you can focus on one or the other point
31+
from the text below.
32+
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
3233

33-
Data visualization is central to this lesson, serving as both the means and the goal. You’ll not only learn to write Python code and engage in hands-on data visualization, but also discover how to explore, understand, and predict dataset values through visualization techniques.
34+
### Who can benefit from this lesson?
3435

35-
### How is this lesson structured?
36+
The main goal of this lesson is to demonstrate the importance of data visualization and how it can unlock a
37+
variety of learning and research pathways—ranging from exploratory data analysis and statistical inference to
38+
understanding machine learning processes and data storytelling.
3639

37-
* The lesson begins with a brief overview of various graph types and their applications.
38-
* Next, you'll explore statistical inference and linear regression, which will help you understand correlations and make predictions based on datasets. These concepts also provide foundational insights into how machine learning models work.
39-
* Finally, you'll learn how to use visualization techniques to identify patterns within a dataset and extract statistical insights, bringing together the concepts from the previous sections and engaging in hands-on data visualization in Python.
40+
If you're looking for ways to approximately predict specific values based on a given dataset for data storytelling,
41+
or if you've ever wondered how machine learning models that predict values (rather than categories) work,
42+
this lesson is for you. It will introduce you to the concept of statistical inference—a mathematical
43+
calculation used in predictive machine learning algorithms—through various data visualization techniques.
44+
These visualization methods will also enhance your data storytelling skills, not only in describing existing
45+
data but also in predicting values based on the available data.
4046

41-
### What background knowledge do you need for this lesson?
47+
Data visualization is central to this lesson, serving as both the means and the goal. You’ll not only learn
48+
to write Python code and engage in hands-on data visualization, but also discover how to explore, understand,
49+
and predict dataset values through visualization techniques.
4250

43-
1. Basic acquaintance with Python
44-
2. Basic mathematical background
45-
3. Curiosity to learn more about Python programming, statistics and data storytelling
4651

47-
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: instructor
52+
### How is this lesson structured?
4853

49-
Use the text above as a warm-up activity. Conduct a brief brainstorming session to elicit potential answers to the
50-
questions above. Write the answers on the board and engage learners in a discussion about their background knowledge
51-
in each area.
54+
* The lesson begins with a brief overview of various graph types and their applications.
55+
* Next, you'll explore statistical inference and linear regression, which will help you understand correlations
56+
and make predictions based on datasets. These concepts also provide foundational insights into how machine
57+
learning models work.
58+
* Finally, you'll learn how to use visualization techniques to identify patterns within a dataset and extract
59+
statistical insights, bringing together the concepts from the previous sections and engaging in hands-on
60+
data visualization in Python.
5261

53-
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
5462

5563
::::::::::::::::::::::::::::::::::::: keypoints
56-
57-
- An overview of the lesson structure and objectives
58-
64+
- Getting to know each other.
65+
- An overview of the lesson structure and objectives.
5966
::::::::::::::::::::::::::::::::::::::::::::::::
6067

6168

episodes/02_graph_categories.md

Lines changed: 11 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,9 @@
11
---
22
title: 'Graph Categories'
3-
teaching: 10
4-
exercises: 2
3+
teaching: 15
4+
exercises: 0
55
---
66

7-
<span style="color: red;">Update teaching time and number of exercises.</span>
87

98
:::::::::::::::::::::::::::::::::::::: questions
109

@@ -20,6 +19,14 @@ exercises: 2
2019

2120
::::::::::::::::::::::::::::::::::::::::::::::::
2221

22+
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: instructor
23+
This episode is more meant for self study. You don't need to go into extensive detail about the content of this
24+
episode. Instead, focus on reviewing the graphs with the learners and ask if they are already familiar with
25+
them and their use cases. The most important graphs to highlight—those that will also be featured in the
26+
visualization section of this lesson—are scatter plots, bubble charts, and correlograms. Place greater emphasis
27+
on these and prepare the learners to create them in the visualization section.
28+
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
29+
2330
::::::::::::::::::::::::::::::::::::: challenge
2431

2532
## Question
@@ -34,23 +41,14 @@ Data visualization has multiple purposes. It can help you understand trends and
3441
It can also help you introduce a dataset to others in scientific texts or in data storytelling.
3542

3643
:::::::::::::::::::::::::::::::::
44+
3745
::::::::::::::::::::::::::::::::::::::::::::::::
3846

3947
Most graphs used for data visualization fall into one of the following four general categories, based on their function.
4048
In this lesson, we won’t cover how to create all of these graphs in Python, but will focus on a few that are
4149
useful for statistical inference and data storytelling with our specific dataset. However,
4250
it's helpful to know the names of these graphs and understand the contexts in which they can be applied.
4351

44-
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: instructor
45-
46-
This episode is more meant for self study. You don't need to go into extensive detail about the content of this episode. Instead, focus on reviewing the graphs
47-
with the learners and ask if they are already familiar with them and their use cases. The most important graphs to
48-
highlight—those that will also be featured in the visualization section of this lesson—are scatter plots,
49-
bubble charts, and correlograms. Place greater emphasis on these and prepare the learners to create them in the
50-
visualization section.
51-
52-
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
53-
5452
## 2.1. Explore Relationships between two or more Features
5553

5654
**Scatter Plot:** "A scatter plot (aka scatter chart, scatter graph) uses dots to represent values for two different

learners/setup.md

Lines changed: 30 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -2,32 +2,49 @@
22
title: Setup
33
---
44

5-
This lesson teaches you how to visualize data using Python.
5+
In this lesson, you'll explore the different types of graphs and
6+
their use cases. You'll then dive into the concept of statistical
7+
inference. Next, you'll get hands-on with Python coding to analyze
8+
the happiness and income dataset provided below. Finally, you'll
9+
use the graphs you've created to make informed estimates about
10+
countries not included in the dataset.
11+
12+
:::::::::::::::: callout
13+
### What background knowledge do you need for this lesson?
14+
15+
1. Basic acquaintance with Python
16+
2. Basic mathematical background
17+
3. Curiosity to learn more about Python programming, statistics and data storytelling
18+
::::::::::::::::::
619

720
## Dataset
821

22+
The dataset we are working with in this lesson originates from [Kaggle](https://www.kaggle.com/datasets/levyedgar44/income-and-happiness-correction).
923
If you wish to save the the dataset on your computer, go ahead and download the
1024
[Income and Happiness Correlation dataset](data/income_happiness_correlation.csv) and save it to your working directory.
11-
Otherwise, you can directly load it into your code using the following link:
25+
Otherwise, you can directly load it into your code later using the following link:
1226

1327
```
1428
https://raw.githubusercontent.com/Goli-SF/stat_inf_data_vis/tree/main/episodes/data/income_happiness_correlation.csv
1529
```
1630

17-
<span style="color: red;">The above link should be updated later.</span>
31+
<span style="color: red;">The above link should be updated later, when the lesson is pushed to the incubator.</span>
1832

19-
The dataset originates from [Kaggle](https://www.kaggle.com/datasets/levyedgar44/income-and-happiness-correction).
2033

2134
## Software Setup
2235

2336
::::::::::::::::::::::::::::::::::::::: discussion
2437

2538
### Python and Jupyter Notebook/Google Colab
2639

27-
To do the exercises in this lesson, you need an IDE (Integrated Development Environment). We recommend you use Jupyter Notebook or an equivalent
28-
(such as [Google Colab](https://colab.google/)).
40+
To do the exercises in this lesson, you need an IDE (Integrated Development Environment). We recommend you use
41+
Jupyter Notebook or cloud-based equivalent such as [Google Colab](https://colab.google/).
42+
43+
If you're using Google Colab, you don't need any installation. Just create a Google account - if you don't have one
44+
already -, create a new Colab notebook and start coding.
2945

30-
To install Jupyter Notebook and Python on your computer together, we recommend using [Anaconda](https://www.anaconda.com/download/success)
46+
Otherwise, to install Jupyter Notebook and Python on your computer together, we recommend using
47+
[Anaconda](https://www.anaconda.com/download/success).
3148
To do so, click on your operating system from the list below and follow the instructions.
3249

3350
:::::::::::::::::::::::::::::::::::::::::::::::::::
@@ -68,12 +85,12 @@ This [video tutorial](https://www.youtube.com/watch?v=TcSAln46u9U) can help you
6885
2. Download the Anaconda Installer with Python 3 for Linux.
6986
(The installation requires using the shell. If you aren't comfortable doing the installation yourself stop here and request help at the workshop.)
7087
3. Open a terminal window and navigate to the directory where the executable is downloaded (e.g., `cd ~/Downloads`).
71-
4. Type
72-
```bash Anaconda3-```
73-
and then press `Tab` to autocomplete the full file name. The name of file you just downloaded should appear.
74-
5. Press <kbd>Enter</kbd> (or <kbd>Return</kbd> depending on your keyboard). You will follow the text-only prompts. To move through the text,
75-
press <kbd>Spacebar</kbd>. Type `yes` and press enter to approve the license. Press <kbd>Enter</kbd> (or <kbd>Return</kbd>) to approve the default location
76-
for the files. Type `yes` and press <kbd>Enter</kbd> (or <kbd>Return</kbd>) to prepend Anaconda to your `PATH` (this makes the Anaconda distribution the default Python).
88+
4. Type `bash Anaconda3-` and then press <kbd>Tab</kbd> to autocomplete the full file name. The name of file you just
89+
downloaded should appear.
90+
5. Press <kbd>Enter</kbd> (or <kbd>Return</kbd> depending on your keyboard). You will follow the text-only prompts.
91+
To move through the text, press <kbd>Spacebar</kbd>. Type `yes` and press <kbd>Enter</kbd> (or <kbd>Return</kbd>)
92+
to approve the license. Press <kbd>Enter</kbd> (or <kbd>Return</kbd>) to approve the default location for the files.
93+
Type `yes` and press <kbd>Enter</kbd> (or <kbd>Return</kbd>) to prepend Anaconda to your `PATH` (this makes the Anaconda distribution the default Python).
7794
6. Close the terminal window.
7895

7996
::::::::::::::::::::::::

0 commit comments

Comments
 (0)