This repository contains different CNN methods for audio classification. It starts with canceling noise from audio. Then it converts the audio into a mel-spectrogram and trains with CNN models.
-
Updated
Sep 9, 2023 - Python
This repository contains different CNN methods for audio classification. It starts with canceling noise from audio. Then it converts the audio into a mel-spectrogram and trains with CNN models.
A text-to-speech program using VAE on Mel spectrograms of phonemes.
Embed chiptunes in 2D with Convolutional Auto Encoder and Mel Spectrograms
This repository is to introduce the application of Activation Maximization for audio-domain data.
Music Genre Classification
A pet project on music genre classification. Assigning the correct genre to the provided audio track.
Speech emotion recognition models for the Moody web application.
Open Source Repository for the MASA Project
This project was for the pattern recognition course I studied in college. This was the beginning of dealing with neural networks and 2 CNN models were made, 1-d model and 2-d model to deal with different forms of the data, audio and image, respectively.
Music genre classification with CNN (exam project)
Step onto the stage with Saxophone Hero, where your tenor saxophone is the key to unlocking a rhythmic adventure through a world of sheet music. In this game, your character scores points by hitting the right notes. Powered by machine learning, the game captures the pitch from your saxophone and translates it to player movement in real time.
Leveraged Dynamic Time Warping (DTW) to assess the similarity between specific audio tracks
[Big Data] Detection of Animal Species in Tropical Soundscapes
Bali has a diversity of arts that has been recognized by the world, where one of the most famous Balinese arts is the Karawitan art, especially the Kendang Tunggal instrument. Notation documentation or more commonly known as music transcription, can make learning a song easier, and in the case of this research, it makes it easier to learn to pla…
Simple neural net to classify the emotion in an audio
Project to classify wav audio files using a CNN.
Different Signal Processing Tasks
MAIC VOICE AI 대회. 음성 멜-스펙트럼 데이터를 이용한 음성 질환 진단 및 분류.
Speech Emotion Recognition (SER) using CNNs and CRNNs Based on Mel Spectrograms and Mel Frequency Cepstral Coefficients (MFCCs)
Add a description, image, and links to the mel-spectrogram topic page so that developers can more easily learn about it.
To associate your repository with the mel-spectrogram topic, visit your repo's landing page and select "manage topics."