AI4I 2020 Predictive Maintenance Dataset

The AI4I 2020 Predictive Maintenance Dataset is a synthetic dataset that reflects real predictive maintenance data encountered in industry.

Associated TasksClassification, Regression, Causal-Discovery
Data SourceSynthetic
Dataset CharacteristicsMultivariate, Time-Series
Date Donated2020-08-29
Feature TypeReal
LabeledYes
Missing ValuesNo
NameAI4I 2020 Predictive Maintenance Dataset
Number of Features6
Number of Instances10000
SourceUCI Machine Learning Repository
Time SeriesYes

Description

Since real predictive maintenance datasets are generally difficult to obtain and in particular difficult to publish, we present and provide a synthetic dataset that reflects real predictive maintenance encountered in industry to the best of our knowledge.

The dataset consists of 10,000 data points stored as rows with 14 features in columns. UID is a unique identifier ranging from 1 to 10000. Product ID consists of letter L, M, or H indicating product quality variants and variant-specific serial number. Air temperature [K] and Process temperature [K] are generated using a random walk process with normalization. Rotational speed [rpm] is calculated with noise, Torque [Nm] values are normally distributed. Tool wear [min] is modified according to product quality variant.

The machine failure label indicates if the machine has failed based on five independent failure modes: tool wear failure (TWF), heat dissipation failure (HDF), power failure (PWF), overstrain failure (OSF), and random failures (RNF). A failure occurs if any one of these modes is true and the machine failure label is set to 1. This makes it non-transparent to machine learning methods which failure mode caused the failure.

Tags

References

Back to Index