Starting off as a muggle that naïve to the Math's and Data Science world.

Data Management

Day 15

  • Get started to SAS Studio

Day 16

  • Load data into SAS Studio
  • Select Statement
  • Where Clause
  • Sort
  • Aggregation
  • Joins

Day 17

  • Aggregation (Con’t)
  • Data manipulate (Insert, Update & Delete)
  • Backup table

Day 18

  • Table structure/dictionary
  • Univariate analysis categorical variable

Day 19

  • Univariate analysis continuous variable
  • Bivariate analysis

Day 20

  • Data cleansing (empty/null value)
  • Data cleansing (unwanted symbol)

Day 21

  • Data cleansing (continuous)

Day 22

  • Exploratory data analysis (EDA)
    • summarize
    • statistical table
    • n-miss
    • cross tabulation analysis
    • gplot aka scatter plot
    • histogram
    • correlation analysis
    • factor analysis
  • Feature engineering
    • One hot encoding

Day 23

  • Feature engineering (Con’t)
    • Label encoding
  • Data encapsulate aka view

Day 24

  • Data warehouse
  • Data lake
  • Data mart
  • Hadoop
    • hive
    • pig

Leave a comment