class: center, middle, inverse, title-slide # Warum R? ### Einführung in die moderne Datenanalyse mit R
Basel R Bootcamp
### November 2019 --- layout: true <div class="my-footer"> <span style="text-align:center"> <span> <img src="https://raw.githubusercontent.com/therbootcamp/therbootcamp.github.io/master/_sessions/_image/by-sa.png" height=14 style="vertical-align: middle"/> </span> <a href="https://therbootcamp.github.io/"> <span style="padding-left:82px"> <font color="#7E7E7E"> www.therbootcamp.com </font> </span> </a> <a href="https://therbootcamp.github.io/"> <font color="#7E7E7E"> Einführung in die moderne Datenanalyse mit R | November 2019 </font> </a> </span> </div> --- # Die Datenrevolution .pull-left4[ <i>"Fuel of the future - Data is giving rise to a new economy."</i><br> The Economist, May 2017 <br><br> <i>"Wie Big Data die Finanzmärkte verändern könnte"</i><br> NZZ, August 2018 <br><br> <i>"Machine learning will be the engine of global growth."</i><br> Financial times, July 2018 ] .pull-right55[ <p align = "center"> <img src="image/dataworld_sm.png" height = 375px><br> <font style="font-size:10px">from <a href="https://www.shutterstock.com/video/clip-19206613-world-map-shining-flying-stars-particles-create">shutterstock.com</a></font> </p> ] --- .pull-left2[ # Datenanalysten gesucht ] .pull-right7[ <p align = "center"> <br><br> <img src="image/jobs2022.png" height = 520px><br> <font style="font-size:10px">adapted from <a href="https://www.weforum.org/agenda/2018/09/future-of-jobs-2018-things-to-know/">weforum.org</a></font> </p> ] --- # Daten revolutionieren die Medizin <p align = "center" style="padding-top:30px"> <img src="image/cancer.png" height = 400px><br> <font style="font-size:10px">from <a href="https://www.researchgate.net/publication/333228756_End-to-end_lung_cancer_screening_with_three-dimensional_deep_learning_on_low-dose_chest_computed_tomography">Ardila et al. (2019)</a></font> </p> --- # Daten treiben Verkauf <p align = "center" style="padding-top:30px"> <img src="image/skin.png" height = 400px><br> <font style="font-size:10px">from <a href="https://venturebeat.com/2018/07/19/how-olay-used-ai-to-double-its-conversion-rate/">venturebeat.com</a></font> </p> --- # Daten dringen in unsere Privatssphäre ein <p align = "center" style="padding-top:30px"> <img src="image/personality.png" height = 400px><br> <font style="font-size:10px">from <a href="https://www.pnas.org/content/112/4/1036">Youyou, Kosinski, & Stillwell (2019)</a></font> </p> --- # Daten haben klare Vorteile <br> <table class="tg" style="cellspacing:0; cellpadding:0; border:none;"> <tr valign="top"> <td style="padding:10px"> <p align = "center"> <font style="font-size:28px"><i>Präzise</i></font><br><br> <img src="image/clockwork.png" height = 225px width=300px style="border-radius:20%"><br> <font style="font-size:10px">adapted from <a href="https://stock.adobe.com/ee/search/images?k=clockwork">stock.adobe.com</a></font> </p> </td> <td style="padding:10px"> <p align = "center"> <font style="font-size:28px"><i>Reproduzierbar</i></font><br><br> <img src="image/bottles.png" height = 225px width=300px style="border-radius:20%"><br> <font style="font-size:10px">adapted from <a href="https://www.dreamstime.com/beer-filling-brewery-conveyor-belt-glass-bottles-machine-image106996530">dreamstime.com</a></font> </p> </td> <td style="padding:10px"> <p align = "center"> <font style="font-size:28px"><i>Objektiv</i></font><br><br> <img src="image/datastartrek.png" height = 225px width=300px style="border-radius:20%"><br> <font style="font-size:10px">adapted from <a href="https://memory-alpha.fandom.com/wiki/Data">memory-alpha.fandom.com</a></font> </p> </td> </tr> </table> --- # Die 3 Säulen der Datenrevolution <table class="tg" style="cellspacing:0; cellpadding:0; border:none;"> <tr valign="top"> <td style="padding:10px"> <p align = "center"> <font style="font-size:28px"><i>Data</i></font><br><br> <img src="image/data.png" height = 260px width=300px style="border-radius:20%"><br> <font style="font-size:10px">adapted from <a href="https://www.rathenau.nl/en/digital-society/data-driven-cities">rathenau.nl</a></font> </p> </td> <td style="padding:10px"> <p align = "center"> <font style="font-size:28px"><i>Computing</i></font><br><br> <img src="image/server.png" height = 260px width=300px style="border-radius:20%"><br> <font style="font-size:10px">adapted from <a href="https://cei.org/file/internet-server-farm">cei.org</a></font> </p> </td> <td style="padding:10px"> <p align = "center"> <font style="font-size:28px"><i>Tools</i></font><br><br> <img src="image/code.png" height = 260px width=300px style="border-radius:20%"><br> <font style="font-size:10px">adapted from <a href="https://www.ionos.de/digitalguide/websites/web-entwicklung/code-editoren/">ionos.de</a></font> </p> </td> </tr> </table> --- .pull-left3[ # Die Datenmenge wächst rasant ] .pull-right65[ <br><br><br> <p align = "center"> <img src="image/bigdatagrowth.png" height = 480px><br> <font style="font-size:10px">from <a href="https://blog.siib.ac.in/changing-world-development-of-artificial-intelligence/">blog.siib.ac.in</a></font> </p> ] --- # Computing wird rasant schneller und billiger <table class="tg" style="cellspacing:0; cellpadding:0; border:none;"> <tr valign="top"> <td style="padding-bottom:60px;padding-right:50px;vertical-align:bottom"> <p align = "center"> <img src="image/kurzweil.png" height = 360px ><br> <font style="font-size:10px">Ray Kurzweil, adapted from <a href="https://www.wsj.com/articles/ray-kurzweil-looks-into-the-future-1401490952">wsj.com</a></font> </p> </td> <td style="padding-bottom:60px;padding-left:50px;vertical-align:bottom"> <p align = "center"> <img src="image/kurzweilcurve.png" height = 420px ><br> <font style="font-size:10px">from <a href="http://www.americanprof.net/apn-ai/index.php/tool-box/ray-kurzweil">americanprof.net</a></font> </p> </td> </tr> </table> --- # Immer mehr (gute) tools <p align = "center"> <img src="image/bigdatalandscape.png" height = 460px><br> <font style="font-size:10px">adapted from <a href="https://mattturck.com/data2019/">mattturck.com</a></font> </p> --- # Point-and-click tools <br> <table style="cellspacing:0; cellpadding:0; border:none; width:70%"> <col width="20%"> <col width="20%"> <col width="20%"> <tr style="padding:20px;background-color:white"> <td style="padding:20px;text-align:center"> <img src="https://upload.wikimedia.org/wikipedia/commons/thumb/7/72/Microsoft_Excel_Logo.svg/2000px-Microsoft_Excel_Logo.svg.png" height = 150px> </td> <td style="padding:20px;text-align:center"> <img src="https://upload.wikimedia.org/wikipedia/commons/7/78/SPSS_An_IBM_Company_logo.svg" height = 150px> </td> <td style="padding:20px;text-align:center"> <img src="https://www.alignbi.com/wp-content/uploads/2019/02/tableau-logo-copy.png" height = 150px> </td> </tr> <tr style="padding:20px;background-color:white"> <td style="padding:20px;text-align:center"> <img src="https://pbs.twimg.com/profile_images/876198719397474305/DKUPgGWz_400x400.jpg" height = 150px> <td style="padding:20px;text-align:center"> <img src="https://upload.wikimedia.org/wikipedia/commons/thumb/0/0d/JASP_logo.svg/1200px-JASP_logo.svg.png" height = 150px> </td> <td style="padding:20px;text-align:center"> <img src="https://www.excelofficeservices.com/wp-content/uploads/2018/11/IBM-Watson-Logo.jpeg" height = 150px> </td> </tr> </table> --- .pull-left3[ # Der Data Science Prozess ] .pull-right7[ <br><br> <p align = "center"> <img src="image/datasciencewheel.png" height = 480px><br> <font style="font-size:10px">from <a href="https://www.bytelion.com/services/datascience/">bytelion.com</a></font> </p> ] --- # Syntaxbasierte tools <br><br> <p align = "center"> <img src="image/RvsPython.png" height = 300px><br> <font style="font-size:10px">from <a href="https://www.sharpsightlabs.com/blog/r-vs-python/">sharpsightlabs.com</a></font> --- # Nachfrage .pull-left45[ <p align = "center"> <img src="image/indeed2019.png" height = 400px><br> <font style="font-size:10px">from <a href="http://r4stats.com/articles/popularity/">r4stats.com</a></font> </p> ] .pull-right45[ <p align = "center"> <img src="image/indeed2019change.png" height = 400px><br> <font style="font-size:10px">from <a href="http://r4stats.com/articles/popularity/">r4stats.com</a></font> </p> ] --- # Charakterzüge <font style="font-size:16px">siehe auch diese <a href="image/rversuspython.jpeg">Infographik</a> <br><br> <table style="cellspacing:0; cellpadding:0; border:none; width:90%"> <col width="35%"> <tr style="padding:20px;background-color:white"> <td> </td> <td style="padding:10px;text-align:center"> <font style="font-size:20px"><b>Benutzt von</b></font> </td> </td> <td style="padding:10px;text-align: center"> <font style="font-size:20px"><b>Entwickelt für</b></font> </td> <td style="padding:10px;text-align: center"> <font style="font-size:20px"><b>Besser in</b></font> </td> <td style="padding:10px;text-align: center"> <font style="font-size:20px"><b>Umgebung</b></font> </td> <td style="padding:10px;text-align: center"> <font style="font-size:20px"><b>Für wen?</b></font> </td> </tr> <tr style="padding:20px;background-color:white"> <td style="padding:10px;text-align: left"> <img src="image/R.png" height = 100px width = 120px> </td> <td style="padding:10px;text-align: center"> Wissenschaftlern, Statistikern und Analysten </td> <td style="padding:10px;text-align: center"> Datenanalyse </td> <td style="padding:10px;text-align: center"> Produktivität und Reporting </td> <td style="padding:10px;text-align: center"> RStudio </td> <td style="padding:10px;text-align: center"> Programmier Einsteiger </td> </tr> <tr style="padding:20px;background-color:white"> <td style="padding:10px;text-align: left""> <img src="image/Python.png" height = 100px width = 120px> </td> <td style="padding:10px;text-align: center"> Softwareentwickler und Programmierer </td> <td style="padding:10px;text-align: center"> Systemprogrammierung </td> <td style="padding:10px;text-align: center"> Einbettung in Systeme und Apps </td> <td style="padding:10px;text-align: center"> Jupyter Notebooks </td> <td style="padding:10px;text-align: center"> Erfahrene Programmierer </td> </tr> </table> --- # R wird relevant bleiben <br> .pull-left45[ ### Pro 1. **Open-source** und umsonst. 2. **Community** (e.g., [stackoverflow](https://stackoverflow.com/)) 3. **Erweiterbarkeit** ([CRAN](https://cran.r-project.org/)) 4. [**Tidyverse**](https://www.tidyverse.org/) 5. [**RStudio**](https://www.rstudio.com/) 6. **Productivität**: [Latex](https://www.latex-project.org/), [Markdown](https://daringfireball.net/projects/markdown/), [GitHub](https://github.com/) ] .pull-right45[ ### Ehemals Contra 1. **Unschön**e Sprache wird überarbeitet ([Tidyverse](https://www.tidyverse.org/)) 2. **Langsam**e Elemente werden ersetzt ([Rcpp](http://www.rcpp.org/)) 3. **Brücken** zu externen Tools/Sprachen ([rPython](http://rpython.r-forge.r-project.org/), [tensorflow](https://tensorflow.rstudio.com/)) ] --- # Komponenten von R <br> <table class="tg" style="cellspacing:0; cellpadding:0; border:none;" width="95%"> <col width="25%"> <col width="35%"> <col width="25%"> <tr valign="top"> <td style="padding:20px"> <p align = "center"> <font style="font-size:28px"><i>R</i></font><br><br> <img src="image/R.png" height = 130px><br> <font style="font-size:10px">adapted from <a href="https://cei.org/file/internet-server-farm">cei.org</a></font> </p> </td> <td style="padding:20px"> <p align = "center"> <font style="font-size:28px"><i>RStudio</i></font><br><br> <img src="image/rstudio2.png" height = 130px><br> <font style="font-size:10px">adapted from <a href="https://rstudio.com/">rstudio.com</a></font> </p> </td> <td style="padding:20px"> <p align = "center"> <font style="font-size:28px"><i>R Packages</i></font><br><br> <img src="image/packages.png" height = 130px><br> <font style="font-size:10px">adapted from <a href="https://towardsdatascience.com/ten-random-useful-things-in-r-that-you-might-not-know-about-54b2044a3868">towardsdatascience.com</a></font> </p> </td> </tr> </table> --- class: middle, center <h1><a href=https://therbootcamp.github.io/I2R_2019Nov/index.html>Schedule</a></h1>