Musical Instrument Recognition in User-generated Videos using a Multimodal Convolutional Neural Network Architecture | Publicación