News

2026-04-07
Update: SAPC2 Deadline Extension
The submission deadline has been extended: End of April 2026August 31, 2026
This extension aims to accommodate ongoing submissions, particularly for Track 2, and teams with pending Data Use Agreements (DUAs). We hope this provides additional time for participation and preparation.
2026-03-18
Update: Inference Time Limit (Track 1)
The Ingestion time limit per submission has been increased: 15000s21600s
2026-03-12
Starter Template Update: Local decoding script is now available for both tracks (Track 1; Track 2). In addition, the Dev_streaming subset for latency measurement is shared through Box, and the corresponding code has been added to preprocess.sh.
2026-02-27
Competition Pages & Leaderboards: The Codabench API for (Track 1; Track 2) has been released.
2026-02-27
Starter Template: To help you begin, SAPC-template (https://github.com/xiuwenz2/SAPC-template) is now available.
2025-12-03
Data Access: Request the SAP corpus via (the official website), by submitting the (DUA) and a one-page proposal to email.
Note: Approval typically takes ~2–4 weeks. Upon approval, access will be immediately granted to the SAP Research Release, which contains most of the same waveforms that will be part of the official competition release, but in a different data format. The official competition release will be made available on 2026-03-01 to early approvals, or immediately once approved after that date.
2025-12-03
Team Registration: Team registration is now open through link.
2025-12-03
SAPC2 Challenge website launched!

Introduction

Welcome to the Speech Accessibility Project Challenge 2(SAPC2).

SAPC2 builds on the success of the Interspeech 2025 Speech Accessibility Project Challenge (Challenge API), which demonstrated significant progress in dysarthric speech recognition — reducing Word Error Rate (WER) from the Whisper-large-v2 baseline of 17.82% to 8.11%. This new edition introduces a larger, more diverse, and etiology-balanced corpus, further promoting fairness, robustness, and inclusivity in impaired-speech ASR. The challenge invites the research community to push the state of the art, develop innovative modeling techniques, and set new standards for accessible speech technology.

Challenge Tracks

The challenge features two complementary tracks:

  1. Unconstrained ASR Track: Participants may use models of any size or architecture, aiming to advance the state of the art in dysarthric speech recognition.
  2. Streaming ASR Track: Submitted systems will be placed on a Pareto chart of system latency and system accuracy, promoting lightweight and deployable solutions for real-world use.

Competitors will submit trained model parameters and inference code through Codabench (Track 1; Track 2) up to a maximum number of permitted submissions. Results on test1 will be released within three days of submission. Results on test2 will be released after the close of competition.

Evaluation Metrics

Prizes & Publication

A total prize of U.S. $10,000 will be divided equally among all teams with a system on the Pareto frontier of accuracy and latency, as measured using the sequestered test2 set.

To clarify how winners are selected across tracks:

Teams submitting to the competition will be invited to present their work at a competition workshop, scheduled in conjunction with a major conference (TBA).

References

Acknowledgements

The Speech Accessibility Project is funded by a grant from the AI Accessibility Coalition. Computational resources for the challenge are provided by the National Center for Supercomputing Applications (NCSA). We would also like to thank Rob Kooper (NCSA), Wei Kang (Xiaomi Corp.), and Maisy Wieman (SoundHound AI) for their expertise and invaluable assistance in setting up the challenge.