Get in Touch

Course Outline

Day One: Language Basics

  • Course introduction
  • About data science
    • Defining data science
    • The process of conducting data science
  • Introducing the R language
  • Variables and data types
  • Control structures (loops and conditionals)
  • R scalars, vectors, and matrices
    • Defining R vectors
    • Matrices
  • String and text manipulation
    • Character data type
    • File I/O
  • Lists
  • Functions
    • Introducing functions
    • Closures
    • lapply/sapply functions
  • DataFrames
  • Practical labs for all sections

Day Two: Intermediate R Programming

  • DataFrames and file I/O
  • Reading data from files
  • Data preparation
  • Built-in datasets
  • Visualisation
    • Graphics package
    • plot(), barplot(), hist(), boxplot(), and scatter plots
    • Heat maps
    • ggplot2 package (qplot(), ggplot())
  • Exploration with dplyr
  • Practical labs for all sections

Day Three: Advanced Programming With R

  • Statistical modelling with R
    • Statistical functions
    • Handling missing values (NA)
    • Distributions (Binomial, Poisson, Normal)
  • Regression
    • Introduction to linear regression
  • Recommendations
  • Text processing (tm package and word clouds)
  • Clustering
    • Introduction to clustering
    • K-Means
  • Classification
    • Introduction to classification
    • Naive Bayes
    • Decision trees
    • Training using the caret package
    • Evaluating algorithms
  • R and big data
    • Connecting R to databases
    • The big data ecosystem
  • Practical labs for all sections

Requirements

  • A basic programming background is preferred

Setup

 21 Hours

Number of participants


Price per participant

Testimonials (7)

Provisional Upcoming Courses (Require 5+ participants)

Related Categories