class: center, middle, inverse, title-slide # First Steps on Using R as a GIS ## Introduction ### Stefan Jünger
July 08, 2021 --- layout: true --- ## About This Course <br> <br> This (short) course will teach you how to exploit `R` and apply some of its geospatial techniques.* By the end of this course, you should... - be less frightened with using geospatial data in `R` - including importing, wrangling, and exploring geospatial data - be able to create (simple) maps based on your very own processed geospatial data in `R` .footnote[*Some materials are part of a previous workshop, for which I'd like to thank [Anne-Kathrin Stroppe](https://twitter.com/AStroppe).] --- ## Prerequisites for This Course .pull-left[ - At least basic knowledge of `R`, its syntax, and internal logic - Affinity for using script-based languages - Don't be scared to wrangle data with complex structures - Working versions of `R` (and `Rstudio`) on your computer - Ideally, with the packages installed, I asked you upfront ] .pull-right[ <br> <br> <img src="data:image/png;base64,#C:\Users\mueller2\talks_presentations\esra-workshop-first-steps-R-GIS\content\img\i-love-programming.jpg" width="75%" style="display: block; margin: auto;" /> .tinyisher[.center[[Source](https://makeameme.org/meme/i-love-programming)]] ] --- ## About Me .pull-left[ <img src="data:image/png;base64,#C:\Users\mueller2\talks_presentations\esra-workshop-first-steps-R-GIS\content\img\stefan.png" width="50%" style="display: block; margin: auto;" /> ] .pull-right[ - Postdoctoral researcher in the team Data Augmentation at the GESIS department Survey Data Curation - Ph.D. in social sciences, University of Cologne ] - Research interests: - quantitative methods - social inequalities & attitudes towards minorities - data management & data privacy - reproducible research .small[ [stefan.juenger@gesis.org](mailto:stefan.juenger@gesis.org) | [@StefanJuenger](https://twitter.com/StefanJuenger) | [https://stefanjuenger.github.io](https://stefanjuenger.github.io) ] --- ## About You .pull-left[ - What's your name? - Where do you work/research? What are you working on/researching? - What is your experience with `R` or other programming languages? - Do you already have experience with geospatial data? ] .pull-right[ <br> <img src="data:image/png;base64,#C:\Users\mueller2\talks_presentations\esra-workshop-first-steps-R-GIS\content\img\so-tell-me-5be02d.jpg" width="800" style="display: block; margin: auto;" /> .tinyisher[.center[[Source](https://makeameme.org/meme/so-tell-me-5be02d)]] ] - **What do you want to use `R` and geospatial data for?** --- ## Preliminaries <br> <br> - The workshop consists of a combination of a few lectures and hands-on exercises - Feel free to ask questions at any time - Slides and other materials are available at: .center[`https://github.com/StefanJuenger/esra-workshop-first-steps-R-GIS`] --- ## Course Schedule <table class="table" style="margin-left: auto; margin-right: auto;"> <thead> <tr> <th style="text-align:left;"> Time </th> <th style="text-align:left;"> Title </th> </tr> </thead> <tbody> <tr> <td style="text-align:left;"> 01:00pm-01:20pm </td> <td style="text-align:left;"> Introduction </td> </tr> <tr> <td style="text-align:left;"> 01:20pm-01:30pm </td> <td style="text-align:left;"> Exercise 1: R Warm up </td> </tr> <tr> <td style="text-align:left;"> 01:30pm-02:00pm </td> <td style="text-align:left;"> Data Processing & Spatial Linking </td> </tr> <tr> <td style="text-align:left;"> 02:00pm-02:30pm </td> <td style="text-align:left;"> Exercise 2: Geospatial Data Wrangling </td> </tr> <tr> <td style="text-align:left;"> 02:30pm-02:45pm </td> <td style="text-align:left;"> Break </td> </tr> <tr> <td style="text-align:left;"> 02:45pm-03:15pm </td> <td style="text-align:left;"> Easy Maps </td> </tr> <tr> <td style="text-align:left;"> 03:15pm-03:45pm </td> <td style="text-align:left;"> Excercise 3: Build your own map </td> </tr> <tr> <td style="text-align:left;"> 03:45pm-04:00pm </td> <td style="text-align:left;"> Closing, Q & A </td> </tr> </tbody> </table> --- ## Now <table class="table" style="margin-left: auto; margin-right: auto;"> <thead> <tr> <th style="text-align:left;"> Time </th> <th style="text-align:left;"> Title </th> </tr> </thead> <tbody> <tr> <td style="text-align:left;background-color: yellow !important;"> 01:00pm-01:20pm </td> <td style="text-align:left;background-color: yellow !important;"> Introduction </td> </tr> <tr> <td style="text-align:left;"> 01:20pm-01:30pm </td> <td style="text-align:left;"> Exercise 1: R Warm up </td> </tr> <tr> <td style="text-align:left;"> 01:30pm-02:00pm </td> <td style="text-align:left;"> Data Processing & Spatial Linking </td> </tr> <tr> <td style="text-align:left;"> 02:00pm-02:30pm </td> <td style="text-align:left;"> Exercise 2: Geospatial Data Wrangling </td> </tr> <tr> <td style="text-align:left;"> 02:30pm-02:45pm </td> <td style="text-align:left;"> Break </td> </tr> <tr> <td style="text-align:left;"> 02:45pm-03:15pm </td> <td style="text-align:left;"> Easy Maps </td> </tr> <tr> <td style="text-align:left;"> 03:15pm-03:45pm </td> <td style="text-align:left;"> Excercise 3: Build your own map </td> </tr> <tr> <td style="text-align:left;"> 03:45pm-04:00pm </td> <td style="text-align:left;"> Closing, Q & A </td> </tr> </tbody> </table> --- ## What Are Geospatial Data? .pull-left[ Data with a direct spatial reference `\(\rightarrow\)` **geo-coordinates** - Information about geometries - Optional: Content in relation to the geometries Can be projected jointly in one single space - Allows data linking and extraction of substantial information ] .pull-right[ <img src="data:image/png;base64,#C:\Users\mueller2\talks_presentations\esra-workshop-first-steps-R-GIS\content\img\fig_geometries.png" width="85%" style="display: block; margin: auto;" /> .tinyisher[Sources: OpenStreetMap / GEOFABRIK (2018), City of Cologne (2014), and the Statistical Offices of the Federation and the Länder (2016) / Jünger, 2019] ] --- ## Geospatial Data in This Course In the folder called `./data` in the same folder as the other materials for this workshop, you can find the data files prepped for all the exercises and slides. - Covid-19 cases for Cologne across the city's districts available at the [Open Data Portal of Cologne](https://www.offenedaten-koeln.de/dataset/corona-fallzahlen-koeln) - Hospital locations in Cologne are also distributed via the [Open Data Portal of Cologne](https://www.offenedaten-koeln.de/dataset/krankenh%C3%A4user-k%C3%B6ln) - Cologne's road network; [guess from where](https://www.offenedaten-koeln.de/dataset/strasse) - Number of immigrants and inhabitants from German Census 2011 data are provided by the [Federal Statistical Office Germany, Wiesbaden 2020](https://www.zensus2011.de/EN/Home/home_node.html) **Please make sure that if you reuse any of the provided data to cite the original data sources.** --- ## What Is GIS? Most common understanding: Geographic Information Systems (GIS) as specific software to process geospatial data for - Visualization - Analysis <img src="data:image/png;base64,#C:\Users\mueller2\talks_presentations\esra-workshop-first-steps-R-GIS\content\img\gis.png" width="70%" style="display: block; margin: auto;" /> .center[.tinyisher[Screenshot of the Open Source GIS [`QGIS`](https://qgis.org)]] --- ## Data Specifics .pull-left[ </br> </br> <img src="data:image/png;base64,#C:\Users\mueller2\talks_presentations\esra-workshop-first-steps-R-GIS\content\img\fig_3d_simple.png" width="9083" style="display: block; margin: auto;" /> .tinyisher[Sources: OpenStreetMap / GEOFABRIK (2018) and City of Cologne (2014)] ] .pull-right[ Formats - Vector data (points, lines, polygons) - Raster data (grids) Coordinate reference systems (CRS) - Allow the projection on earth's surface - Differ in precision for specific purposes ] --- ## Layers Must Match! .pull-left[ .center[ **Projected CRS** (EPSG:3857) ] ] .pull-right[ .center[ **Geographic CRS** (EPSG:3035) ] ] <img src="data:image/png;base64,#C:\Users\mueller2\talks_presentations\esra-workshop-first-steps-R-GIS\content\img\fig_projections.png" width="9956" style="display: block; margin: auto;" /> .tinyisher[Source: Statistical Office of the European Union Eurostat (2018) / Jünger, 2019] --- ## Types of CRS <br> .pull-left[ **Projected CRS** - projection of geometries on a flat surface (planar) - distance between two points is a straight line ] .pull-right[ **Geographic CRS** - (unprojected) description of specific points on a sphere - distance between two points is a bent line ] <br> (There are also geocentric CRS requiring a z-coordinate...) --- class: middle <img src="data:image/png;base64,#C:\Users\mueller2\talks_presentations\esra-workshop-first-steps-R-GIS\content\img\layers_meme.jpg" width="70%" style="display: block; margin: auto;" /> --- ## CRS Definitions Old Standard: `PROJ.4` Strings ``` +proj=laea +lat_0=52 +lon_0=10 +x_0=4321000 +y_0=3210000 +ellps=GRS80 +towgs84=0,0,0,0,0,0,0 +units=m +no_defs ``` -- .small[Source: https://epsg.io/3035] New Kid in Town: `WKT` (“Well Known Text”) .small[ ``` PROJCS["ETRS89 / LAEA Europe", GEOGCS["ETRS89", DATUM["European_Terrestrial_Reference_System_1989", SPHEROID["GRS 1980",6378137,298.257222101, AUTHORITY["EPSG","7019"]], TOWGS84[0,0,0,0,0,0,0], AUTHORITY["EPSG","6258"]], PRIMEM["Greenwich",0, AUTHORITY["EPSG","8901"]], UNIT["degree",0.0174532925199433, AUTHORITY["EPSG","9122"]], AUTHORITY["EPSG","4258"]], PROJECTION["Lambert_Azimuthal_Equal_Area"], PARAMETER["latitude_of_center",52], PARAMETER["longitude_of_center",10], PARAMETER["false_easting",4321000], PARAMETER["false_northing",3210000], UNIT["metre",1, AUTHORITY["EPSG","9001"]], AUTHORITY["EPSG","3035"]] ``` ] .tinyisher[Source: https://epsg.io/3035] --- ## EPSG Codes .pull-left[ Eventually, it's not as challenging to work with CRS in `R` as it may seem - we don't have to use PROJ.4 or WKT strings directly. Most of the times it's enough to use so-called EPSG Codes ("European Petroleum Survey Group Geodesy") - Small digit sequence ] .pull-right[ <br> <img src="data:image/png;base64,#C:\Users\mueller2\talks_presentations\esra-workshop-first-steps-R-GIS\content\img\ImageEPSGlogo.gif" style="display: block; margin: auto;" /> .tinyisher[.center[[Source](http://geotiff.maptools.org/proj_list/guid7.html)]] ] --- ## More Details on Geospatial Data Let's learn about geospatial data as we learn about specific formats: - vector data - raster data .pull-left[ <img src="data:image/png;base64,#C:\Users\mueller2\talks_presentations\esra-workshop-first-steps-R-GIS\content\img\Raster-and-Vector-data-types-as-representative-of-real-world-geographic-space_W640.jpg" width="75%" style="display: block; margin: auto;" /> .tinyisher[.center[[Source](https://dx.doi.org/10.13140/2.1.3030.1767)]] ] .pull-right[ Be assured: `R` can serve as a full-blown Geographic Information System (GIS) for all these different data types. ] --- ## `R` Packages for Geospatial Data There have been packages for geospatial data in `R` already for a long time. .pull-left[ - [`sp`](https://cran.r-project.org/web/packages/sp/index.html) for vector data - [`raster`](https://cran.r-project.org/web/packages/raster/index.html) for raster data - the successor: [`terra`](https://cran.r-project.org/web/packages/terra/index.html) Cutting-edge for vector data and raster data (cubes) - [`sf`](https://cran.r-project.org/web/packages/sf/index.html) - [`stars`](https://r-spatial.github.io/stars/) ] .pull-right[ <br> <img src="data:image/png;base64,#C:\Users\mueller2\talks_presentations\esra-workshop-first-steps-R-GIS\content\img\sf.jpg" width="80%" style="display: block; margin: auto;" /> .tinyisher[Illustration by [Allison Horst](https://github.com/allisonhorst/stats-illustrations)] ] --- ## Packages in This Course We will use plenty of different packages during the course, but only a few are our main drivers (e.g., the `sf` package). Here's the list of packages - [`dplyr`](https://cran.r-project.org/web/packages/dplyr/index.html) - [`mapview`](https://cran.r-project.org/web/packages/mapview/index.html) - [`sf`](https://cran.r-project.org/web/packages/sf/index.html) - [`stars`](https://cran.r-project.org/web/packages/stars/index.html) - [`tmap`](https://cran.r-project.org/web/packages/tmap/index.html) - [`tmaptools`](https://cran.r-project.org/web/packages/tmaptools/index.html) *Note*: Some additional packages will be installed as dependencies. --- ## What You'll See During the Course: Piping In `R` Usually, in `R` we apply functions as follows: ```r f(x) ``` In the logic of pipes this function is written as: ```r x %>% f(.) ``` -- We can use pipes on more than one function: ```r x %>% f_1() %>% f_2() %>% f_3() ``` More details: https://r4ds.had.co.nz/pipes.html --- class: middle ## Exercise 1: R Warm-Up [Exercise](https://stefanjuenger.github.io/esra-workshop-first-steps-R-GIS/exercises/1_R_Warm_Up.html) [Solution](https://stefanjuenger.github.io/esra-workshop-first-steps-R-GIS/solutions/1_R_Warm_Up.html) --- layout: false class: center background-image: url(data:image/png;base64,#../assets/img/the_end.png) background-size: cover .left-column[ </br> <img src="data:image/png;base64,#C:\Users\mueller2\talks_presentations\esra-workshop-first-steps-R-GIS\content\img\stefan.png" width="90%" style="display: block; margin: auto;" /> ] .right-column[ .left[.small[<svg viewBox="0 0 512 512" style="height:1em;position:relative;display:inline-block;top:.1em;" xmlns="http://www.w3.org/2000/svg"> <path d="M464 64H48C21.49 64 0 85.49 0 112v288c0 26.51 21.49 48 48 48h416c26.51 0 48-21.49 48-48V112c0-26.51-21.49-48-48-48zm0 48v40.805c-22.422 18.259-58.168 46.651-134.587 106.49-16.841 13.247-50.201 45.072-73.413 44.701-23.208.375-56.579-31.459-73.413-44.701C106.18 199.465 70.425 171.067 48 152.805V112h416zM48 400V214.398c22.914 18.251 55.409 43.862 104.938 82.646 21.857 17.205 60.134 55.186 103.062 54.955 42.717.231 80.509-37.199 103.053-54.947 49.528-38.783 82.032-64.401 104.947-82.653V400H48z"></path></svg> [`stefan.juenger@gesis.org`](mailto:stefan.juenger@gesis.org)] </br> .small[<svg viewBox="0 0 512 512" style="height:1em;position:relative;display:inline-block;top:.1em;" xmlns="http://www.w3.org/2000/svg"> <path d="M459.37 151.716c.325 4.548.325 9.097.325 13.645 0 138.72-105.583 298.558-298.558 298.558-59.452 0-114.68-17.219-161.137-47.106 8.447.974 16.568 1.299 25.34 1.299 49.055 0 94.213-16.568 130.274-44.832-46.132-.975-84.792-31.188-98.112-72.772 6.498.974 12.995 1.624 19.818 1.624 9.421 0 18.843-1.3 27.614-3.573-48.081-9.747-84.143-51.98-84.143-102.985v-1.299c13.969 7.797 30.214 12.67 47.431 13.319-28.264-18.843-46.781-51.005-46.781-87.391 0-19.492 5.197-37.36 14.294-52.954 51.655 63.675 129.3 105.258 216.365 109.807-1.624-7.797-2.599-15.918-2.599-24.04 0-57.828 46.782-104.934 104.934-104.934 30.213 0 57.502 12.67 76.67 33.137 23.715-4.548 46.456-13.32 66.599-25.34-7.798 24.366-24.366 44.833-46.132 57.827 21.117-2.273 41.584-8.122 60.426-16.243-14.292 20.791-32.161 39.308-52.628 54.253z"></path></svg> [`@StefanJuenger`](https://twitter.com/StefanJuenger)] </br> .small[<svg viewBox="0 0 496 512" style="height:1em;position:relative;display:inline-block;top:.1em;" xmlns="http://www.w3.org/2000/svg"> <path d="M165.9 397.4c0 2-2.3 3.6-5.2 3.6-3.3.3-5.6-1.3-5.6-3.6 0-2 2.3-3.6 5.2-3.6 3-.3 5.6 1.3 5.6 3.6zm-31.1-4.5c-.7 2 1.3 4.3 4.3 4.9 2.6 1 5.6 0 6.2-2s-1.3-4.3-4.3-5.2c-2.6-.7-5.5.3-6.2 2.3zm44.2-1.7c-2.9.7-4.9 2.6-4.6 4.9.3 2 2.9 3.3 5.9 2.6 2.9-.7 4.9-2.6 4.6-4.6-.3-1.9-3-3.2-5.9-2.9zM244.8 8C106.1 8 0 113.3 0 252c0 110.9 69.8 205.8 169.5 239.2 12.8 2.3 17.3-5.6 17.3-12.1 0-6.2-.3-40.4-.3-61.4 0 0-70 15-84.7-29.8 0 0-11.4-29.1-27.8-36.6 0 0-22.9-15.7 1.6-15.4 0 0 24.9 2 38.6 25.8 21.9 38.6 58.6 27.5 72.9 20.9 2.3-16 8.8-27.1 16-33.7-55.9-6.2-112.3-14.3-112.3-110.5 0-27.5 7.6-41.3 23.6-58.9-2.6-6.5-11.1-33.3 2.6-67.9 20.9-6.5 69 27 69 27 20-5.6 41.5-8.5 62.8-8.5s42.8 2.9 62.8 8.5c0 0 48.1-33.6 69-27 13.7 34.7 5.2 61.4 2.6 67.9 16 17.7 25.8 31.5 25.8 58.9 0 96.5-58.9 104.2-114.8 110.5 9.2 7.9 17 22.9 17 46.4 0 33.7-.3 75.4-.3 83.6 0 6.5 4.6 14.4 17.3 12.1C428.2 457.8 496 362.9 496 252 496 113.3 383.5 8 244.8 8zM97.2 352.9c-1.3 1-1 3.3.7 5.2 1.6 1.6 3.9 2.3 5.2 1 1.3-1 1-3.3-.7-5.2-1.6-1.6-3.9-2.3-5.2-1zm-10.8-8.1c-.7 1.3.3 2.9 2.3 3.9 1.6 1 3.6.7 4.3-.7.7-1.3-.3-2.9-2.3-3.9-2-.6-3.6-.3-4.3.7zm32.4 35.6c-1.6 1.3-1 4.3 1.3 6.2 2.3 2.3 5.2 2.6 6.5 1 1.3-1.3.7-4.3-1.3-6.2-2.2-2.3-5.2-2.6-6.5-1zm-11.4-14.7c-1.6 1-1.6 3.6 0 5.9 1.6 2.3 4.3 3.3 5.6 2.3 1.6-1.3 1.6-3.9 0-6.2-1.4-2.3-4-3.3-5.6-2z"></path></svg> [`StefanJuenger`](https://github.com/StefanJuenger)] </br> .small[<svg viewBox="0 0 576 512" style="height:1em;position:relative;display:inline-block;top:.1em;" xmlns="http://www.w3.org/2000/svg"> <path d="M280.37 148.26L96 300.11V464a16 16 0 0 0 16 16l112.06-.29a16 16 0 0 0 15.92-16V368a16 16 0 0 1 16-16h64a16 16 0 0 1 16 16v95.64a16 16 0 0 0 16 16.05L464 480a16 16 0 0 0 16-16V300L295.67 148.26a12.19 12.19 0 0 0-15.3 0zM571.6 251.47L488 182.56V44.05a12 12 0 0 0-12-12h-56a12 12 0 0 0-12 12v72.61L318.47 43a48 48 0 0 0-61 0L4.34 251.47a12 12 0 0 0-1.6 16.9l25.5 31A12 12 0 0 0 45.15 301l235.22-193.74a12.19 12.19 0 0 1 15.3 0L530.9 301a12 12 0 0 0 16.9-1.6l25.5-31a12 12 0 0 0-1.7-16.93z"></path></svg> [`https://stefanjuenger.github.io`](https://stefanjuenger.github.io)]] </br> ]