Emotion Recognition from Speech using Mel Spectrograms and CNNs

Overview

This project focuses on developing an emotion recognition system from speech using Mel Spectrograms and Convolutional Neural Networks (CNNs). The dataset used is the Acted Emotional Speech Dynamic Database (AESDD), which contains audio files categorized into five emotions: angry, disgust, fear, happy, and sad.

Objectives

Transform raw audio files into numerical representations using Mel Spectrograms.
Train CNNs to classify emotions from these spectrograms.
Address challenges such as:
- Variable input shapes.
- Small dataset size.
Compare performance using original data with batch of size 1, resized data, and augmented datasets.

Key Findings

The best results were obtained using the original dataset with a batch size of 1, achieving an accuracy of 74.38%.
Resized spectrograms resulted in lower accuracy, likely due to loss of crucial information during resizing.
Artificially augmented data achieved accuracy similar to the original dataset, but with possible information loss during augmentation.

Quick Start

Clone the repository:

git clone https://github.com/SigurdST/emotion_recognition.git
cd emotion_recognition

Explore notebook.ipynb to review all the code implementation and processes, and REPORT.md for detailed explanations, results, and insights derived from the project.

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
data		data
plot_spectrograms		plot_spectrograms
plots		plots
.gitignore		.gitignore
README.md		README.md
REPORT.md		REPORT.md
notebook.ipynb		notebook.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Emotion Recognition from Speech using Mel Spectrograms and CNNs

Overview

Objectives

Key Findings

Quick Start

About

Uh oh!

Releases

Packages

Languages

SigurdST/emotion_recognition

Folders and files

Latest commit

History

Repository files navigation

Emotion Recognition from Speech using Mel Spectrograms and CNNs

Overview

Objectives

Key Findings

Quick Start

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages