Open Source Voice Recognition

Logic Intelligence Releases the World's First Open Source Voice Large Model Framework LLaSO

In the rapid wave of artificial intelligence development, Beijing Deep Logic Intelligence Technology Co., Ltd. has recently launched a remarkable innovation — LLaSO. This groundbreaking research ...

Logic Intelligence Launches the World's First Fully Open-Source End-to-End Voice Model Framework LLaSO, Empowering AI Technology for Shared Innovation

Beijing Deep Logic Intelligent Technology Co., Ltd. has recently stirred up a new wave in the field of artificial ...

Slator

Alibaba’s New Speech Recognition Model Pushes Accuracy But Keeps Weights Closed

Alibaba unveils a new speech recognition model covering 11 languages, noise-robust transcription, and even singing voice ...

InfoQ

Mistral Voxtral is an Open-Weights Competitor to OpenAI Whisper and Other ASR Tools

A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...

Digi Times

Xiaomi open-sources voice AI model to enter automotive and smart home markets

Xiaomi has launched a 7-billion-parameter version of its open-source voice model, MiDashengLM, which incorporates Alibaba's open-source Qwen 2.5 series. This model focuses on in-car systems and smart ...

Geeky Gadgets

Professional Quality Voice Cloning : Open Source vs ElevenLabs

What if you could replicate a voice so convincingly that even the closest of listeners couldn’t tell the difference? The rise of professional-quality voice cloning has made this a reality, ...

Business Wire

SignalWire Launches Open Source Agent Builder Beta, Accelerating Voice AI Development

PALO ALTO, Calif.--(BUSINESS WIRE)--SignalWire, the leader in Programmable Unified Communications (PUC), today announced the early access beta release of its open source SignalWire Agent Builder. This ...

The Punch on MSN

FG unveils AI model for local languages

ATLAS, an open-source, multilingual and multimodal large language model that supports Yoruba, Hausa, Igbo and Nigerian-accented English.Minister of Communications, Innovation, and Digital Economy, Dr ...

12d

Voice AI Needs an Accurate Evaluation Layer. Podonos Just Raised $2.4M to Build It

Podonos, a startup building the infrastructure layer for evaluating voice AI, has raised $2.4 million in pre-seed funding to bring structure and speed to one of the most overlooked parts of voice AI ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results