English Abstract
Abstract :
Deep learning is a type of machine learning known for its competitive advantage in discovering complex relationships in all data types. However, the insurance applications of deep learning were used for damage detection and churn prediction applications, while the premium prediction received low attention from previous researchers. This study aims to build an incremental deep leaming model to predict insurance premiums. The model contributes to the previously studied Usage-Based Insurance (UBI) concept.
This thesis proposes a deep learning model consistent with the UBI concept that considers the available factors affecting the premium to predict the insurance premium. The proposed model consists of two parts. Part one is the Convolutional Neural Network (CNN) for deep features extraction. Part two is the Support Vector Regression (SVR) built on the extracted deep features to predict the premium. The proposed model is called CNN-SVR after combining the two parts of CNN and SVR.
The dataset was collected from an insurance company to train the proposed model and evaluate its performance compared to the other classical models adopted previously by other researchers, namely the Neural Network (NN), Random Forests (RF), Decision Trees (DT), Linear Regression and Support Vector Regression (SVR).
The model performance evaluation was achieved using some metrics and the execution time needed to add a new data point to the model. The selected metrics are the Mean Squared Error (MSE), Root Mean Squared Error (RMSE), Mean Absolute Percentage of Error (MAPE), Explained Variance (EV), Correlation Coefficient (R), and t-test.
The proposed CNN-SVR model reported the best averages among the other models of 1363.935 MSE, 36.838 RMSE, 18.774 MAE, 11.940 ΜΑΡΕ, 0.957 R, and one minus P-Value values close to lin the t-test. The proposed incremental model reported a faster execution time than the classical models, which need to be retrained fally to add a new data point.
The study concluded that CNN-SVR model outperforms the other models in prediction performance and execution time, which supports the hypothesis. A possible future direction for this study is to use a larger dataset with more factors affecting the premium for a better contribution to the UBI and predictions.
Keywords: CNN, SVR, Deep Features, Prediction, Insurance, UBI, Incremental SVR,
Premium