Traffic accident data multiclass classification

Multiclass classification is the problem of classifying instances into one of three or more classes. It can be naturally solved by extending the binary classification technique for some algorithm, such as support vector machines (SVMs)

Note

In machine learning, classification refers to a predictive problem where a class label is predicted for a given input data.

Traffic accidents can be classified in 4 classes according to its impact on traffic. In this project, we apply two classification algorithms (SVMs and random forest) to a Traffic accident Dataset.

Info

The car accident dataset covers 49 states of the USA and was collected from February 2016 to December 2021. There are 2.8 million accident records.

The goal is to predict the degree of impact of an accident given the value of a set of explanatory variables and study the performance of the predictors.

You can download the project here and find the source code in GitHub-classification.

Written on August 13th, 2022 by Guillermo Villanueva