Exploring and Analyzing Data with Splunk (EADS)

 

Course Overview

This 9-hour course is for users who want to attain operational intelligence level 4, (business insights) and covers exploratory data analysis by using statistical tools and custom visualizations.

Prerequisites

To be successful, students should have a solid understanding of the following courses:

Course Objectives

  • Analytics Framework
  • Exploring and visualizing data
  • Cleaning and Preprocessing Data
  • Numerical and String based clustering
  • Data Correlation
  • Meta Transactions
  • Detecting Anomalies
  • Forecasting

Outline: Exploring and Analyzing Data with Splunk (EADS)

Topic 1 – What is Data Science

  • Define terms related to analytics and data science
  • Describe the analytics workflow
  • Describe Artificial Intelligence and Machine Learning
  • Examine common Machine Learning myths
  • Describe Splunk’s Machine Learning tools

Topic 2 – Exploratory Data Analysis

  • Use bin and makecontinuous to restructure and visualize data
  • Examine field statistics with fieldsummary
  • Transform fields with eval and fillnull
  • Clean text with the rex and cleantext commands
  • Solve Anscombe’s Quartet
  • Apply boxplots and 3d scatterplots to visualize data

Topic 3 – Event Clustering

  • Take a behavioral based approach to cluster data
  • Cluster numerical fields using the kmeans command
  • Cluster based of string similarity with the cluster command
  • Find patterns in clusters

Topic 4– Correlations and Transactions

  • Define correlation and co-occurrence
  • Use SPL correlation commands
  • Use the statistical tests from the Machine Learning Toolkit to correlate fields
  • Use streamstats and chart commands to correlate data

Topic 5– Anomaly Detection

  • Define Statistical Outliers
  • Use Add-hoc methods of numerical anomaly detection
  • Find numerical or categorical anomalies with the AnomalyDetection command

Topic 6 – Forecasting

  • Define forecasting use cases
  • Use the predict command to forecast future timeseries

Prices & Delivery methods

Online Training

Duration
9 hours

Price
  • Online Training: CAD 1,320
  • Online Training: US $ 1,000
  • Splunk Training Units: 100 SPC
Classroom Training

Duration
9 hours

Price
  • Canada: CAD 1,320
  • Splunk Training Units: 100 SPC

Schedule

Currently there are no training dates scheduled for this course.