Big Data Analytics

Introduction

Analytics is defined as the process of extracting and creating information from new data by the process of filtering and bringing data into context

Types of Analytics

Descriptive Analytics

  • This comprises of analyzing past data to present it in a summarized form which can be easily interpreted.

  • The aim is to answer the question What happened

Diagnostic Analytics

  • This is the analysis of past data to diagnose the reasons as to why certain events happened

  • The aim is to answer the question Why did it happen

Predictive Analytics

  • This comprises of predicting the occurence of an event or likely outcome of an event

  • The aim is to answer What is likely to happen

Prescriptive Analytics

  • This uses multiple prediction models to predict the various outcomes and best course of action for each outcome

  • The aim is to answer What can we do to make it happen

A summary is given in Table below:

Analytics What it Entails
Descriptive What happened
Diagnostic Why did it happen
Predictive What is likely to happen
Prescriptive What can we do to make it happen

Big Data

  • This is a collection of datasets whose volume, velocity or variety is too large that it is difficult to store and manage it using traditional databases

Characteristics of Big Data

  • Volume: The data is so large that it cannot fit on a single machine therefore need specialized tools.

  • Velocity This refers to how fast the data is generated

  • Variety: This refers to the forms of the data; structured, unstructured and semi-structured.

  • Veracity This is how accurate the data is

  • Value The usefulness of the data for the intended purpose