Operations · Logs · Anomaly Detection P1778698833
Executive Summary

Executive Summary

Overall model accuracy and which signal characteristics best identify anomalies

Total Observations
2000
Training Segments
1499
Test Segments
501
Model Accuracy
0.9641
Anomaly Detection AUC
0.9897
Precision (Anomalies)
0.9592
Recall (Sensitivity)
0.8704
F1-Score
0.9126
Specificity
0.9898
A Random Forest classifier achieved 96.4% accuracy (AUC 0.990) in detecting 413 anomalies across 2000 telemetry segments. The model demonstrates strong sensitivity (87.0%) with acceptable specificity (99.0%), balanced by F1-score of 0.913. The strongest anomaly discriminator is peak_count (importance 97.023), suggesting signal-level characteristics are primary indicators.
Interpretation

A Random Forest classifier achieved 96.4% accuracy (AUC 0.990) in detecting 413 anomalies across 2000 telemetry segments. The model demonstrates strong sensitivity (87.0%) with acceptable specificity (99.0%), balanced by F1-score of 0.913. The strongest anomaly discriminator is peak_count (importance 97.023), suggesting signal-level characteristics are primary indicators.

Overview

Analysis Overview

Renderer-generated

Data Preparation

Data Quality Assessment

Renderer-generated

Visualization

Anomaly Distribution

Class balance between anomalous and normal segments

Interpretation

Out of 2000 telemetry segments, 413 (20.6%) are anomalies while 1587 (79.3%) are normal operations. The dataset shows class imbalance, with anomalies representing the minority class. This distribution affects model training and inference thresholds.

Visualization

Feature Correlation Matrix

Pairwise feature correlations; strongest relationships indicate redundant information

Interpretation

Feature correlations range from -0.785 to 0.998, with the strongest pairwise relationship being 0.998. Most feature pairs show weak to moderate correlation (|r| < 0.5), suggesting features capture distinct signal properties. Strong correlations with anomaly label indicate predictive signal characteristics.

Visualization

Feature Importance Ranking

Top 10 features ranked by importance (mean decrease in Gini); shows which signal characteristics best discriminate anomalies

Interpretation

The top 10 features contributing to anomaly detection span statistical moments (mean, variance, kurtosis), structural patterns (peak counts, smoothed peaks), and normalized energy metrics. Peak/Structural features dominate (97.023 importance), with the single most important feature being 'Peak Count'. This suggests peak/structural characteristics are the primary anomaly discriminators.

Visualization

ROC Curve Analysis

True positive rate vs false positive rate across all classification thresholds (AUC measures discriminative power)

Interpretation

The ROC curve exhibits strong discriminative power with AUC = 0.990, indicating the classifier ranks anomalies higher than normal segments 99.9% of the time on average. The curve bows well above the random baseline (diagonal), demonstrating the model's ability to trade off sensitivity and specificity. At any threshold, operators can choose operating points balancing missed anomalies against false alarms.

Visualization

Confusion Matrix

Actual vs predicted labels; diagonal (correct predictions) vs off-diagonal (errors)

Interpretation

The confusion matrix shows 4 false positives (1.0% of normal segments incorrectly flagged) and 14 false negatives (13.0% of true anomalies missed). The model prioritizes sensitivity (minimizing missed anomalies) over strict precision, suitable for safety-critical anomaly detection where missing an anomaly carries higher risk than false alarms.

Data Table

Model Performance Metrics

Summary of key classification metrics on test set

MetricValue
Test Set Size501
Accuracy96.41%
Precision95.92%
Recall (Sensitivity)87.04%
F1-Score0.9126
Specificity98.98%
AUC (ROC)0.9897
Interpretation

The Random Forest classifier achieves 96.41% accuracy on the holdout test set with F1-score of 0.9126. Precision (95.92%) and Recall (87.04%) are well-balanced, enabling reliable anomaly detection without excessive false alarms. The model demonstrates strong discriminative power with ROC AUC of 0.9897.

Your data has more stories to tell. Run any analysis on your own data — 60+ validated R modules, interactive reports, AI insights, and PDF export. 2,000 free credits on signup.
Try Free — No Signup Sign Up Free

Report an Issue

Tell us what's wrong. You'll get a free re-run of this analysis so you can try again with different parameters. If the re-run still doesn't meet your expectations, we'll refund your credits.

Want to run this analysis on your own data? Upload CSV — Free Analysis See Pricing