2.1.0
User Documentation for Apache MADlib
Data Types and Transformations

Detailed Description

Data types and operations that transform and shape data.

Modules

 Arrays and Matrices
 Mathematical operations for arrays and matrices.
 
 Encoding Categorical Variables
 Functions to encode categorical variables to prepare data for input into predictive algorithms.
 
 Path
 A function to perform complex pattern matching across rows and extract useful information about the matches.
 
 Pivot
 Pivoting and data summarization tools for preparing data for modeling operations.
 
 Sessionize
 Session reconstruction of data consisting of a time stampled sequence of events.
 
 Stemming
 Provides porter stemmer operations supporting other MADlib modules.