A library for real-time voice processing in web browsers
-
Updated
Feb 11, 2025 - TypeScript
A library for real-time voice processing in web browsers
A comprehensive AI companion leveraging advanced semantic analysis, sentiment detection, and voice processing to provide personalized and context-aware interactions using Autogen, semantic-router, and VoiceProcessingToolkit.
Advanced Topics in Speech and Language Processing
The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications
Live microphone quality detection system in browser Js
An audio signal processing project that detects speaker gender from recorded voice samples and enhances speech using spectral subtraction techniques in MATLAB.
A cutting-edge AI-powered phone agent designed for seamless voice interactions, dynamic data handling, and scalable communication. Perfect for modern sales and customer engagement solutions.
Timbre Transfer for R2D2-alike Robot voice turning into instrument using Diffusion Model
This repository is made in lieu of submission towards the solution of problem statement 2 of the OPEN AI NLP hackathon. The objective here is to classify the voice recordings of a call center proceeding by treating them as consumer complaints into the said categories of the automotive industry.
AI-powered platform for creative content generation and management, featuring advanced AI integrations, seamless accessibility, and community collaboration.
Curso de procesado digital de la señal (24-25) : Aplicación al procesado de la voz.
🖼️ framed picture cloud base smart photo frame with voice activation paired with an android app
This is an algorithm to identify human voice and do segmentation automatically. The result will be compared to the manual segmentation data, then a accuracy report will be generated based on match rate, insertion rate and omission rate.
Universal Function Library of Scientific Calculation
A Telegram bot that processes voice messages using Sber's speech recognition API. This bot converts audio formats, generates authentication tokens, and transcribes voice messages into text, enabling seamless communication via Telegram.
Web Application that Identifies Animal from their Sound. Right now restricted to binary classification between cat and dog sounds.
This repository presents a comprehensive PyTorch implementation of an end-to-end Speaker Verification system, incorporating state-of-the-art deep learning architectures and language models. The system features robust speaker recognition capabilities, with specialized support for the Vietnamese
Final_Project_of_Siganls_&_Sytems_Spring_1401
Add a description, image, and links to the voice-processing topic page so that developers can more easily learn about it.
To associate your repository with the voice-processing topic, visit your repo's landing page and select "manage topics."