ai-audio-generation

Star

Here are 11 public repositories matching this topic...

serp-ai / ai-text-to-audio-latent-diffusion

Sponsor

Star

text-to-audio-latent-diffusion

text-to-audio latent-diffusion audio-diffusion text-to-audio-ai latent-audio-diffusion audio-ai ai-audio-generation

Updated Aug 25, 2023
Python

RhythrosaLabs / soundstorm

Star

Soundstorm is a cutting-edge AI-powered audio manipulation application designed to provide a rich yet simplified experience for sound designers, algorithmic composers, and experimental audio enthusiasts. From sample pack creation and algorithmic composition to AI text-to-audio and onscreen ChatGPT, Soundstorm is a sonic powerhouse.

midi chatbot sound sound-processing gpt algorithmic-music algorithmic-composition sounds audio-processing random-music audio-tools sound-design text-to-audio audio-toolbox ai-audio gpt-4 chatgpt chat-gpt ai-audio-generation

Updated May 4, 2024
Python

stefaner1 / SRT2SoundFX

Star

An audiobook sound effect generator that transforms SRT files into immersive audio experiences. It parses SRT files, uses ChatGPT to create sound effect prompts, generates sounds via the ElevenLabs API, and syncs the audio on an MP3 timeline.

sound-effects ai-audio-generation elevenlabs

Updated Nov 21, 2024
Python

Yuan-ManX / SoundHub

Star

AI Audio Framework 🎵

deep-learning ai-framework ai-audio ai-audio-generation

Updated Apr 28, 2024
Python

ibrahimm7004 / AI-Voice-Agents

Star

Production-ready voice agents and speech pipelines: STT → LLM/Agent → TTS, voice receptionists, telephony, call recording, tool/function calling. Built with Twilio, OpenAI Whisper, ElevenLabs, Vapi/Retell, FastAPI, WebSockets, ffmpeg; designed for deployment, monitoring, and real-world reliability

text-to-speech automation ai twilio openai speech-to-text ai-agents ai-agent ai-audio-generation ai-speech ai-voice-assistant ai-voice-agent

Updated Feb 22, 2026
TypeScript

shinshekai / VoxForge-Pro

Star

VoxForge Pro is a premium, offline audiobook generator powered by Kokoro-82M & Chatterbox TTS. Transform PDFs and text into professional audio using 47 lifelike voices across 6 languages. Features include voice cloning, smart OCR for scanned documents, and multi-speaker narration support.

ai tts ai-audio-generation pinokio pinokio-community

Updated Mar 5, 2026
Python

dragonhub0710 / soundscroll

Star

SoundScroll is an AI audiobook generator

text-to-speech speech-to-text ai-audio-generation

Updated Apr 28, 2025
JavaScript

Ayushverma135 / AudFake-AI-Audio-Generator

Sponsor

Star

This project demonstrates real-time audio processing using Python. It captures audio from a microphone, converts the speech to text, and then synthesizes the text back to speech using a different voice. This can be useful for applications such as voice changers, real-time translation, and more.

audio python speech-recognition speech-to-text audio-processing pyttsx3 ai-audio-generation