Introduction

# Introduction
### Applied Machine Learning with R <a href='https://therbootcamp.github.io'> The R Bootcamp </a> <a href='https://therbootcamp.github.io/AML_2021AMLD/'> </a>  <a href='https://therbootcamp.github.io'> </a>  <a href='mailto:therbootcamp@gmail.com'> </a>  <a href='https://www.linkedin.com/company/basel-r-bootcamp/'> </a>
### November 2021

---

<div class="my-footer">
 
 
 <img src="https://raw.githubusercontent.com/therbootcamp/therbootcamp.github.io/master/_sessions/_image/by-sa.png" height=14 style="vertical-align: middle"/>
 
 <a href="https://therbootcamp.github.io/">
 
 
 www.therbootcamp.com
 
 
 </a>
 <a href="https://therbootcamp.github.io/">
 
 Applied Machine Learning with R | November 2021
 
 </a>
 
 </div>

---

# What is machine learning?

]

<img src="image/ml_robot.jpg" height=380px> 
from <a href="https://medium.com/@dkwok94/machine-learning-for-my-grandma-ca242e97ef62">medium.com</a>

]

---

# What is machine learning?

Machine learning is...

...a <high>field of artificial intelligence</high>...

...that uses <high>statistical techniques</high>...

...to allow computer systems to <high>"learn"</high>,...

...i.e., to progressively <high>improve performance</high> on a specific task...

...from small or large amounts of <high>data</high>,...

....<high>without being explicitly programmed</high>....

....with the goal to <high>discover structure</high> or </high>improve decision making and predictions</high>.

]

]

---

# Origin of ML

---

# Types of machine learning tasks

<ul>
 <li class="m1">There are many types of machine learning tasks, each of which call for different models.</li>
 <li class="m2"><high>We will focus on supervised machine learning</high>.</li>
</ul>

]

<img src="image/mltypes.png" height=500px> 
from <a href="image/mltypes.png">amazonaws.com</a>

]

---

# Unsupervised learning

<ul>
 <li class="m1">Analyzes the relationships to <high>discover structures</high> such as groups or meta-features.</li>
 <ul class="level">
 <li><high>Clustering</high> - similarity between cases.</li>
 <li><high>Dimensionality reduction</high> - similarity between features.</li>
 </ul>
</ul>

<tr>
 <td bgcolor="white">
 Approach
 </td>
 <td bgcolor="white">
 Description
 </td> 
 <td bgcolor="white">
 Example
 </td> 
</tr>
<tr>
 <td bgcolor="white">
 Clustering
 </td>
 <td bgcolor="white">
 Analyze distances between cases to identify <high>clusters of homogeneous cases</high>.
 </td> 
 <td bgcolor="white">
 Types of customers or patients.
 </td> 
</tr>
<tr>
 <td bgcolor="white">
 Dimension- ality reduction
 </td>
 <td bgcolor="white">
 Analyze correlations between features to identify <high>higher order features</high>. 
 </td> 
 <td bgcolor="white">
 Dimensions of personality or user experience.
 </td> 
</tr>
</table>

]

]

---

# Reinforcement learning

<ul>
 <li class="m1"><high>Learns iteratively</high> from minimal supervision provided by <high>performance feedback</high>.</li>
 <li class="m2">RL is closely related to <high>psychological theories of learning</high>.</li>
</ul>

Examples

<table style="cellspacing:0; cellpadding:0; border:none;">
 <col width="30%">
 <col width="70%">
<tr>
 <td bgcolor="white">
 Application
 </td>
 <td bgcolor="white">
 Description
 </td> 
</tr>
<tr>
 <td bgcolor="white">
 Model fitting
 </td>
 <td bgcolor="white">
 Iteratively <high>change model parameters</high> to improve prediction. 
</tr>
<tr>
 <td bgcolor="white">
 Robot movements
 </td>
 <td bgcolor="white">
 Iteratively <high>change movement</high> patterns to increase pancake-catch probability. 
</tr>
<tr>
 <td bgcolor="white">
 Games
 </td>
 <td bgcolor="white">
 Iteratively <high>change controller input</high> patterns to improve Mario Kart racing time. 
</tr>
</table>

]

<img src="image/roboarm.gif" width=320px> 
from <a href="https://giphy.com/explore/reinforcement-learning">giphy.com</a>

<img src="image/mariokart.gif" width=320px> 
from <a href="https://blogs.nvidia.com/blog/2017/04/14/tensorkart-ai-mario-kart/">nvidia.com</a>

]

---

# Supervised learning

<ul>
 <li class="m1"><high>The <high>dominant type</high> of machine learning.</li>
 <li class="m2">Supervised learning uses <high>labeled data</high> to learn <high>a model</high> that relates the criterion to the features.</li>
</ul>

]

<img src="image/supervised.png"> 

]

---

# 2 types of supervised problems

There are two types of supervised learning problems typically can be approached <high>using the same model</high>.

Regression

Regression problems involve the <high>prediction of a quantitative feature</high>.

E.g., predicting the cholesterol level as a function of age.

Classification

Classification problems involve the <high>prediction of a categorical feature</high>.

E.g., predicting the origin of chest pain as a function of age and heart attack risk.

]

]

---

# Three supervised models

---

# ML in R

<ul>
 <li class="m1">R has advanced tremendously with respect to ML.</li>
 <li class="m2">There exist <high>powerful and user-friendly</high> tools for all ML steps and algorithms.</li>
</ul>

]

]

---

# tidymodels

<ul>
 <li class="m1"><mono>tidymodels</mono> is a new meta-package for tidy ML in R.</li>
 <li class="m2">Multiple packages span every important step of ML.</li>
</ul>

<img src="https://www.tidymodels.org/images/tidymodels.png" width=180px> 
from <a href="https://www.tidymodels.org/packages/">tidymodels.org</a>

]

]

---

<h1><a href=https://therbootcamp.github.io/AML_2021AMLD/index.html>Schedule</a></h1>