Correspondence Between Audio And Visual Deep Models For Musical Instrument Detection In Video Recordings | Publicación