User Documentation for Apache MADlib

Detailed Description

Useful utilities for data science workflows.


 Columns to Vector
 Create a new table with all feature columns inserted into a single column as an array.
 Database Functions
 Provides a collection of user-defined functions for performing common tasks in the database.
 Linear Solvers
 Methods that implement solutions for systems of consistent linear equations.
 Mini-Batch Preprocessor
 Utility that prepares input data for use by models that support mini-batch as an optimization option.
 PMML Export
 Implements the PMML XML standard to describe and exchange models produced by data mining and machine learning algorithms.
 Term Frequency
 Provides a collection of functions for performing common tasks related to text analytics.
 Vector to Columns
 Converts a feature array in a single column of an output table into multiple columns.