Research Output
ASPIRE - Real noisy audio-visual speech enhancement corpus
  ASPIRE is a a first of its kind, audiovisual speech corpus recorded in real noisy environment (such as cafe, restaurants) which can be used to support reliable evaluation of multi-modal Speech Filtering technologies. This dataset follows the same sentence format as the audio-visual Grid corpus. The recorded audiovisual speech corpus can be used for reliable evaluation of next generation multi-modal Speech Filtering technologies.

  • Date:

    01 November 2020

  • Publication Status:

    Published

  • DOI:

    10.5281/zenodo.4585619

  • Funders:

    Engineering and Physical Sciences Research Council

Citation

Gogate, M., Dashtipour, K., Adeel, A., & Hussain, A. (2020). ASPIRE - Real noisy audio-visual speech enhancement corpus. [Dataset]. https://doi.org/10.5281/zenodo.4585619

Authors

Keywords

speech enhancement, speech separation, audio-visual, deep learning

Monthly Views:

Available Documents