Jargonic Sets New Standards for Japanese ASR

Explore Benchmarks

Jargonic Sets New Standards for Japanese ASR

Explore Benchmarks

aiOla Research: Shaping the Future of Speech Intelligence

At aiOla, we advance the frontier of Speech and Voice AI—focusing on enterprise-grade speech recognition, audio intelligence, and jargon understanding to solve real-world challenges across industries.

Meet the Minds Behind Our IP

aiOla’s research team is a world-class powerhouse in voice and speech AI, with seven PhDs from top companies and academic institutions.

Led by Gil Hetz PhD, Professor Yossi Keshet and Professor Bhiksha Raj, our experts are redefining industry standards, pioneering breakthroughs in ASR and Conversational AI. Their cutting-edge work drives aiOla’s unmatched accuracy and adaptability, empowering enterprises to unlock the full potential of spoken data.

Gil Hetz
Gil Hetz
VP AI. PhD
Aviv Navon
Aviv Navon
Head of Research, PhD
Aviv Shamsian
Aviv Shamsian
Research Tech Lead, PhD Candidate
Neta Glazer
Neta Glazer
Senior Data Scientist. PhD Candidate
Yael Segal-Feldman
Yael Segal-Feldman
Senior Data Scientist. PhD
Prof. Yossi Keshet
Prof. Yossi Keshet
Chief Scientist
Prof. Bhiksha Raj
Prof. Bhiksha Raj
Distinguished Researcher

Jargonic Sets New Standards for Japanese ASR

After setting new benchmarks in English, Spanish, French, and more, Jargonic V2 now leads in Japanese as well—delivering not just superior transcription accuracy, but also unmatched recall of specialized terms across industries like manufacturing, logistics, healthcare, and finance.

Industry-ready ASR for Enterprises: Jargonic

An enterprise-grade speech recognition model that outperforms all competitors across both academic benchmarks and real-world business environments. In comprehensive testing, Jargonic achieved the highest accuracy on standard datasets and superior jargon recognition capabilities, establishing it as the industry’s most accurate speech-to-text solution available. 

Whisper in Medusa’s Ear: Multi-head Efficient Decoding for Transformer-based ASR

A novel multi-head efficient decoding approach for transformer-based Automatic Speech Recognition (ASR), improving inference speed and accuracy.

WhisperNER-tag-and-mask: Enterprise-level Speech Privacy

A privacy-focused speech recognition approach that enables entity recognition while anonymizing sensitive information, meeting enterprise-grade security and compliance requirements. 

WhisperNER: Unified Open Named Entity and Speech Recognition

An advanced framework that integrates named entity recognition (NER) into speech-to-text pipelines, enhancing real-time voice data processing.

Combining Language Models for Specialized Domains: A Colorful Approach

A novel method for combining multiple language models to improve speech recognition across specialized industries, ensuring more accurate jargon recognition.

Keyword-Guided Adaptation of Automatic Speech Recognition

An advanced adaptation model that enhances ASR performance in specialized domains by guiding recognition with contextual keyword injection.

Open-vocabulary Keyword-spotting with Adaptive Instance Normalization

A cutting-edge technique enabling open-vocabulary keyword spotting using adaptive instance normalization to enhance real-time voice interaction and command execution.