HERSHEY, J. Audio-vision : Using audio-visual synchrony to locate sounds. Advances in Neural Information Processing Systems. 2000, 12
SLANEY, M. Facesync : A linear operator for measuring synchronization of video facial images and audio tracks. Proceedings of Neural Information Processing Society. 2001, 13
LI, D. Multimedia content processing through cross-modal association. Proceedings of 11th ACM International Conference on Multimedia, 2003. 2003, 604-611
FISHER, J. Speaker association with signal-level audiovisual fusion. IEEE Transaction on Multimedia. 2004, 6, 3, 406-413