Statistics¶
- statistics.get_top_deviations(scores, metric='mpe', n=5)[source]¶
Given a matrix that each row contains scores of how well a segment fits a model, find the indices of the top most deviant segments.
- Args:
scores: A 2-D numpy array (NxM) that contains M scores for each one of the N segments. metric: A string that specifies which score to consider. n: number of the deviant segments
- Return:
The indices of the segments.
- statistics.mape1(y_true, y_pred)[source]¶
Computes the Mean Absolute Percentage Error between the 2 given time series
- Args:
y_true: A numpy array that contains the actual values of the time series. y_pred: A numpy array that contains the predicted values of the time series.
- Return:
Mean Absolute Percentage Error value.
- statistics.mpe1(y_true, y_pred)[source]¶
Computes the Mean Percentage Error between the 2 given time series.
- Args:
y_true: A numpy array that contains the actual values of the time series. y_pred: A numpy array that contains the predicted values of the time series.
- Return:
Mean Absolute Error value.
- statistics.multi_corr(df, dep_column)[source]¶
Computation of the coefficient of multiple correlation. The input consists of a dataframe and the column corresponding to the dependent variable.
- Args:
df: Date/Time DataFrame or any Given DataFrame. dep_column: The corresponding the column to the dependent variable.
- Return:
The coefficient of multiple correlation between the dependant column and the rest.
- statistics.score(y_true, y_pred)[source]¶
Computes a set of values that measure how well a predicted time series matches the actual time series.
- Args:
y_true: A numpy array that contains the actual values of the time series. y_pred: A numpy array that contains the predicted values of the time series.
- Return:
Returns a value for each of the following measures: r-squared, mean absolute error, mean error, mean absolute percentage error, mean percentage error