Unsupervised person clustering in videos with cross-modal communication | IEEE Conference Publication | IEEE Xplore