论文部分内容阅读
The problem of automatically labelling the appearances of characters in video with their names is challenging due to the huge variation in the appearance of each character and the weakness and ambiguity of available annotations.We can achieve high precision by combining multiple sources of information,both visual and textual.The principal novelties that we introduce in this paper are:(i)extracting face features in video by neural network;(ii)strengthening the mapping between names and faces by analyzing the co-occurrence of names and faces;(iii)automatically and efficiently labelling appearances of main characters with their names.