Recherche et sélection de publications
Interface en ou

Drum extraction in single channel audio signals using Multi-Layer Non negative Matrix Factor Deconvolution

Clément Laroche #1 #2, Hélène Papadopoulos #1, Matthieu Kowalski #1, Gaël Richard #2
#1 Laboratoire des signaux et systèmes (L2S)
  • UMR8506 CNRS
  • SUPELEC
  • Univ Paris-Sud
#2 Télécom ParisTech
  • Institut Mines-Télécom
References
ICASSP, Nouvelle Orleans, USA, March 2017,
Abstract

In this paper, we propose a supervised multilayer factorization method designed for harmonic/percussive source separation and drum extraction. Our method decomposes the audio signals in sparse orthogonal components which capture the harmonic content, while the drum is represented by an extension of non negative matrix factorization which is able to exploit time-frequency dictionaries to take into account non stationary drum sounds. The drum dictionaries represent various real drum hits and the decomposition has more physical sense and allows for a better interpretation of the results. Experiments on real music data for a harmonic/percussive source separation task show that our method outperforms other state of the art algorithms. Finally, our method is very robust to non stationary harmonic sources that are usually poorly decomposed by existing methods.

Keywords
Category
Paper in proceedings
Research Area(s)
Computer Science/Signal and Image Processing
Identifier(s)
Bibliographic key laroche2017MLNMFD
Export
Last update
on january 18, 2017 by Clément Laroche


Responsable du service
Dominique Asselineau dominique.asselineau@telecom-paristech.fr
Copyright © 1998-2017, Télécom ParisTech/Dominique Asselineau