Statistics¶
- statistics.get_top_deviations(scores, metric='mpe', n=5)[source]¶
- Given a matrix that each row contains scores of how well a segment fits a model, find the indices of the top most deviant segments. - Args:
- scores: A 2-D numpy array (NxM) that contains M scores for each one of the N segments. metric: A string that specifies which score to consider. n: number of the deviant segments 
- Return:
- The indices of the segments. 
 
- statistics.mape1(y_true, y_pred)[source]¶
- Computes the Mean Absolute Percentage Error between the 2 given time series - Args:
- y_true: A numpy array that contains the actual values of the time series. y_pred: A numpy array that contains the predicted values of the time series. 
- Return:
- Mean Absolute Percentage Error value. 
 
- statistics.mpe1(y_true, y_pred)[source]¶
- Computes the Mean Percentage Error between the 2 given time series. - Args:
- y_true: A numpy array that contains the actual values of the time series. y_pred: A numpy array that contains the predicted values of the time series. 
- Return:
- Mean Absolute Error value. 
 
- statistics.multi_corr(df, dep_column)[source]¶
- Computation of the coefficient of multiple correlation. The input consists of a dataframe and the column corresponding to the dependent variable. - Args:
- df: Date/Time DataFrame or any Given DataFrame. dep_column: The corresponding the column to the dependent variable. 
- Return:
- The coefficient of multiple correlation between the dependant column and the rest. 
 
- statistics.score(y_true, y_pred)[source]¶
- Computes a set of values that measure how well a predicted time series matches the actual time series. - Args:
- y_true: A numpy array that contains the actual values of the time series. y_pred: A numpy array that contains the predicted values of the time series. 
- Return:
- Returns a value for each of the following measures: r-squared, mean absolute error, mean error, mean absolute percentage error, mean percentage error 
 
