Machine Learning for detecting drivers' drowsiness

With a keen interest in neuroscience and curiosity in HCI technology, I was drawn to the topic of brain-computer interaction and worked as an undergraduate research intern for approximately seven months at DGIST's BRAIN LAB. There, I focused on building an AI model to predict a driver's level of sleepiness based on brain signals. I managed entire learning process from designing and extracting training data from raw data, constructing models, training adn analyzing the results.

1. Extracting train dataset from raw brain signal data

There are various devices for collecting brain signals. I was able to work with fNIRS, PPG, and EEG data, which respectively represent brain oxygenation, blood flow, and electrical signals. Each data type has n channels depending on sensor and electrode placement. For machine learning, I used fNIRS data as the initial training data. I needed to analyze drowsiness level based on m-minute data from n channels according to brain region. I believed that changes in oxygenation contain information on brain activity and drowsiness, so I extracted statistical information such as average, peak, and skewness from 10-second windows. By aggregating k data extracted for each window, n channels, and m' windows, I obtained a (k x n x m) size dataset.

2. Training machine learning models

I utilized various machine learning models such as LDA, KNN, and SVM through sklearn. Among them, SVM showed the best performance, so I continued the experiment with SVM using linear, Gaussian, and polynomial kernels. However, the results showed a low performance with an average f1 score of 0.4. Therefore, I decided to conduct a data evaluation experiment, considering that the preprocessed data may not fully capture the necessary information.

3. Analyzing training dataset

The first issue is that the data used consisted of randomly shuffled non-sleep/half-sleep/sleep state fNIRS signals from multiple subjects. The goal was to detect the half-sleep state, but due to the imbalance between the non-sleep/sleep state signals and the half-sleep state signals, the experiment was conducted by randomly extracting data and equalizing the amount of each label data. The problem with this approach is that the performance of such a classifier cannot be trusted because the training and evaluation sets are divided within the randomly selected data. For example, if the randomly selected data happens to belong to one subject, the classification performance is very likely to be good.

Also, due to the large size of the dataset, feature selection and reduction were performed by training the machine learning model based on the importance of the features. However, there were issues with this feature selection process, such as even the most important features having evaluation importance of 2% or less.

Despite trying various feature selection methods, I could not achieve good classification performance in machine learning. Therefore, I decided to introduce deep learning.

4. Building & Training CNN+LSTM deep learning model

After researching various papers, it was decided to use CNN+LSTM. Since the data was 2-dimensional in terms of channels and signals over time, a 1-second window's features were extracted using CNN and passed through LSTM to preserve spatial locality between channels (i.e., brain regions) and temporal locality over time. The model was implemented using Keras, and the final classification performance was recorded as approximately 62% for accuracy, precision, and recall metrics.