Course Catalog
Fast Track to R Programming for Data Science
Code: TTDS6683
Duration: 3 Day
$2195 USD

OVERVIEW

R is a functional programming environment for business analysts and data scientists. It's a language that many non-programmers can easily work with, naturally extending a skill set that is common to high-end Excel users. It's the perfect tool for when the analyst has a statistical, numerical, or probabilities-based problem based on real data and they've pushed Excel past its limits.

Introduction to R Programming for Data Science & Analytics is a comprehensive hands-on course that presents common scenarios encountered in analysis and present practical solutions. In this course, special attention is paid to data science theory including AI grouping theory. A discussion of using R with AI libraries like Madlib are included.

DELIVERY FORMAT

This course is available in the following formats:

Virtual Classroom

Duration: 3 Day
Classroom

Duration: 3 Day

CLASS SCHEDULE

Delivery Format: Virtual Classroom
Date: Jun 12 2024 - Jun 14 2024 | 10:00 - 18:00 EST
Location: Online
Course Length: 3 Day

$ 2195

Delivery Format: Virtual Classroom
Date: Aug 14 2024 - Aug 16 2024 | 10:00 - 18:00 EST
Location: Online
Course Length: 3 Day

$ 2195

Delivery Format: Virtual Classroom
Date: Sep 25 2024 - Sep 27 2024 | 10:00 - 18:00 EST
Location: Online
Course Length: 3 Day

$ 2195

GOALS

This course provides indoctrination in the practical use of the umbrella of technologies that are on the leading edge of data science development focused on R and related tools.  Working in a hands-on learning environment, led by our expert practitioner, students will learn R and its ecosystem, and where it’s a better a tool than Excel.

This course is approximately 50% hands-on, combining expert lecture, real-world demonstrations and group discussions with machine-based practical labs and exercises.  Our engaging instructors and mentors are highly experienced practitioners who bring years of current "on-the-job" experience into every classroom.  Working in a hands-on learning environment, guided by our expert team, attendees will learn about and explore:

  • R Language and Mathematics
  • How to work with R Vectors
  • How to read and write data from files, and how to categorize data in factors
  • How to work with Dates and perform Date math
  • How to work with multiple dimensions and DataFrame essentials
  • Essential Data Science and how to use R with it
  • Visualization in R
  • How R can be used in Spark (Optional / Overview)
OUTLINE

  1. From Excel or SAS to R (Optional)
  • Common challenges with Excel / SAS
  • The R Environment
  • Hello, R
  1. Working with R Studio
  • Rshiny
  • Rpresentations
  • Rmarkdown
  1. R Basics
  • Simple Math with R
  • Working with Vectors
  • Functions
  • Comments and Code Structure
  • Using Packages
  1. Vectors
  • Vector Properties
  • Creating, Combining, and Iteratorating
  • Passing and Returning Vectors in Functions
  • Logical Vectors
  1. Reading and Writing
  • Text Manipulation
  • Factors
  1. Dates
  • Working with Dates
  • Date Formats and formatting
  • Time Manipulation and Operations
  1. Multiple Dimensions
  • Adding a second dimension
  • Indices and named rows and columns in a Matrix
  • Matrix calculation
  • n-Dimensional Arrays
  • Data Frames
  • Lists
  1. R in Data Science
  • AI Grouping Theory
  • K-means
  • Linear Regression
  • Logistic Regression
  • Elastic Net
  1. R with MadLib
  • Importing and Exporting static Data (CSV, Excel)
  • Using Libraries with CRAN
  • K-means with Madlib
  • Regression with Madlib
  • Other libraries
  1. Data Visualization
  • Powerful Data through Visualization: Communicating the Message
  • Techniques in Data Visualization
  • Data Visualization Tools
  • Examples
  1. Databases, Data lakes & Additional Topics
  • Building connections to Databases and Data lakes, for both Python and R (using Hive server)
  • Methods to “query” data from database and data lakes, for both Python and R
  • Creating and passing macro variables. Specifically, R sprint, paste, paste0, and paste3 (not sure of the equivalent in Python).
  • Optional - Time Permitting Topics
  1. R with Hadoop
  • Overview of Hadoop
  • Overview of Distributed Databases
  • Overview of Pig
  • Overview of Mahout
  • Exploiting Hadoop clusters with R
  • Hadoop, Mahout, and R
  1. Business Rule Systems
  • Rule Systems in the Enterprise
  • Enterprise Service Busses
  • Drools & Using R with Drools
  1. From Excel or SAS to R (Optional)
  • Common challenges with Excel / SAS
  • The R Environment
  • Hello, R
  1. Working with R Studio
  • Rshiny
  • Rpresentations
  • Rmarkdown
  1. R Basics
  • Simple Math with R
  • Working with Vectors
  • Functions
  • Comments and Code Structure
  • Using Packages
  1. Vectors
  • Vector Properties
  • Creating, Combining, and Iteratorating
  • Passing and Returning Vectors in Functions
  • Logical Vectors
  1. Reading and Writing
  • Text Manipulation
  • Factors
  1. Dates
  • Working with Dates
  • Date Formats and formatting
  • Time Manipulation and Operations
  1. Multiple Dimensions
  • Adding a second dimension
  • Indices and named rows and columns in a Matrix
  • Matrix calculation
  • n-Dimensional Arrays
  • Data Frames
  • Lists
  1. R in Data Science
  • AI Grouping Theory
  • K-means
  • Linear Regression
  • Logistic Regression
  • Elastic Net
  1. R with MadLib
  • Importing and Exporting static Data (CSV, Excel)
  • Using Libraries with CRAN
  • K-means with Madlib
  • Regression with Madlib
  • Other libraries
  1. Data Visualization
  • Powerful Data through Visualization: Communicating the Message
  • Techniques in Data Visualization
  • Data Visualization Tools
  • Examples
  1. Databases, Data lakes & Additional Topics
  • Building connections to Databases and Data lakes, for both Python and R (using Hive server)
  • Methods to “query” data from database and data lakes, for both Python and R
  • Creating and passing macro variables. Specifically, R sprint, paste, paste0, and paste3 (not sure of the equivalent in Python).
  • Optional - Time Permitting Topics
  1. R with Hadoop
  • Overview of Hadoop
  • Overview of Distributed Databases
  • Overview of Pig
  • Overview of Mahout
  • Exploiting Hadoop clusters with R
  • Hadoop, Mahout, and R
  1. Business Rule Systems
  • Rule Systems in the Enterprise
  • Enterprise Service Busses
  • Drools & Using R with Drools
LABS

This hands-on course focuses on ‘learning by doing’, combining expert lecture, practical demonstrations and group discussions with plenty of machine-based real-world programming labs and exercises. Student machines are required.

This hands-on course focuses on ‘learning by doing’, combining expert lecture, practical demonstrations and group discussions with plenty of machine-based real-world programming labs and exercises. Student machines are required.

WHO SHOULD ATTEND

This comprehensive course provides a solid foundation in core R programming for experienced Data Science Analysts, Developers, Administrators, Architects, and technical Managers who need to leverage R for analytics.

PREREQUISITES

This course, geared for Data Analysts and Data Scientists who need to learn the essentials of how to program in R. Incoming students should have prior experience working with Excel or SAS, and should know the basics of SQL. Students should have intermediate-level experience in their field, and prior experience working with programming languages.