audio-augmentation

Here are 17 public repositories matching this topic...

AgaMiko / data-augmentation-review

List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.

review machine-learning survey generative-adversarial-network style-transfer data-generation data-augmentation image-augmentation data-synthesis autoaugment audio-augmentation data-augmentations augmentation-policies nlp-augmentation graph-data-augmentation

Updated Aug 14, 2024

KentoNishi / torch-pitch-shift

Star

Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

torch pytorch sound-processing augmentation pitch-shift gpu-support torchaudio audio-augmentation

Updated Sep 25, 2024
Python

KentoNishi / torch-time-stretch

Star

Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

torch pytorch sound-processing augmentation gpu-support torchaudio time-stretch audio-augmentation

Updated Sep 5, 2022
Python

Lallapallooza / fast-audiomentations

Star

⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.

audio python machine-learning gpu dsp pytorch triton data-augmentation audio-effects audio-augmentation augmentations audio-data-augmentation

Updated Jan 19, 2024
Python

zhaoyi2 / audio_augment

Star

A tool/script for batch speech data enhancement with speed/volume/RIRS/MUSAN

speed optional volume musan audio-augmentation rirs

Updated Jun 28, 2020
Shell

zabir-nabil / audioperm

Star

A python library for generating different permutations of audible segments from audio files.

audio-classification speaker-recognition audio-processing augmentation speech-augmentation audio-augmentation

Updated Jun 13, 2022
Jupyter Notebook

zabir-nabil / torch-speech-dataloader

Star

A ready-to-use pytorch dataloader for audio classification, speech classification, speaker recognition, etc. with in-GPU augmentations

speech torch audio-augmentation torch-dataloader pytorch-speech-dataloader gpu-augmentation speech-augmentation-gpu

Updated Nov 6, 2022
Python

LarsMonstad / amt-augmentor

Star

Python augmentation toolkit for Automatic Music Transcription datasets

audio music amt augmentation automatic-music-transcription audio-augmentation music-augmentation music-transc

Updated Sep 30, 2025
Python

lucas-fpaiva / survey-audio-aug

Star

Implementation of audio, image, and spectrogram augmentation techniques provided by the librosa, Keras and audiomentations

music-information-retrieval automatic-speech-recognition data-augmentation audio-augmentation environmental-sound-classification

Updated May 24, 2022
Jupyter Notebook

DBraun / audiotree

Star

Audio data loading and augmentations in JAX

audio dataloader jax audio-augmentation

Updated Mar 1, 2026
Python

moego0 / custom_KWS

Star

End-to-end pipeline for training a custom keyword detection model with TensorFlow & TFLite expor

deep-learning tensorflow keras speech-recognition mfcc keyword-spotting cnn-model voice-detection audio-processing tflite audio-processing-with-python edge-ai audio-augmentation esc50

Updated Feb 24, 2026
Python

hperer02 / Bird-sound-classification

Star

This repository contains the code and methodology used for the BirdCLEF 2024 Kaggle competition, where I achieved a rank of 55th out of 974 participants, earning a bronze medal. The goal of this competition was to build a model that can accurately classify bird sounds.

pytorch librosa audio-processing torchaudio mel-spectrogram audio-augmentation efficientnet

Updated Jun 20, 2024
Jupyter Notebook

lgpearson1771 / openwakeword-trainer

Star

Train custom wake word models with openWakeWord. A granular 13-step pipeline with compatibility patches for torchaudio 2.10+, Piper TTS, and speechbrain. Generates tiny ONNX models (~200 KB) for real-time keyword detection — like building your own "Hey Siri" trigger. WSL2/Linux + CUDA required.

python text-to-speech deep-learning speech-recognition speech-to-text keyword-spotting voice-assistant wake-word-detection onnx on-device training-pipeline edge-ai audio-augmentation wsl2 speechbrain openwakeword piper-tts wake-word-training custom-wake-word

Updated Feb 13, 2026
Python

nicolagulmini / augdio

Star

your new pocket-sized audio wizard

desktop-app macos audio-processing audio-augmentation

Updated Nov 26, 2025

laurencecliffe / SoundScaper

Star

SoundScaper is an audio augmented reality mobile application that allows users to author, save and reload virtual, and spatially interactive, three-dimensional binaural soundscapes within physical, real world spaces.

augmented-reality mobile-app soundscapes augmented-reality-applications audio-augmentation

Updated Jan 1, 2021

imane-ayouni / Text-to-Speech-using-Tacotron2

Star

Converting text to audio and applying audio augmentation

text-to-speech audio-data audio-augmentation tacotron2

Updated Oct 28, 2023
HTML

AndreasScharnetzki / EmotionClassifier

Star

A Convolutional Neural Network that distinguishes between the speakers emotions. Comes with multiple preprocessors to improve the models performance.

natural-language-processing supervised-learning convolutional-neural-networks transfer-learning preprocessing human-computer-interaction audio-processing multi-class-classification audio-augmentation variable-length-data speech-emotion-classification

Updated Jan 20, 2022
Python

Improve this page

Add a description, image, and links to the audio-augmentation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the audio-augmentation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

audio-augmentation

Here are 17 public repositories matching this topic...

AgaMiko / data-augmentation-review

KentoNishi / torch-pitch-shift

KentoNishi / torch-time-stretch

Lallapallooza / fast-audiomentations

zhaoyi2 / audio_augment

zabir-nabil / audioperm

zabir-nabil / torch-speech-dataloader

LarsMonstad / amt-augmentor

lucas-fpaiva / survey-audio-aug

DBraun / audiotree

moego0 / custom_KWS

hperer02 / Bird-sound-classification

lgpearson1771 / openwakeword-trainer

nicolagulmini / augdio

laurencecliffe / SoundScaper

imane-ayouni / Text-to-Speech-using-Tacotron2

AndreasScharnetzki / EmotionClassifier

Improve this page

Add this topic to your repo