Blaizzy / mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
See what the GitHub community is most excited about today.
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
An AI-powered data science team of agents to help you perform common data science tasks 10X faster.
Open-Source Frontier Voice AI
An autonomous agent that conducts deep research on any data using any LLM providers.
Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.