Video Highlight Prediction Using Audience Chat Reactions
read the original abstract
Sports channel video portals offer an exciting domain for research on multimodal, multilingual analysis. We present methods addressing the problem of automatic video highlight prediction based on joint visual features and textual analysis of the real-world audience discourse with complex slang, in both English and traditional Chinese. We present a novel dataset based on League of Legends championships recorded from North American and Taiwanese Twitch.tv channels (will be released for further research), and demonstrate strong results on these using multimodal, character-level CNN-RNN model architectures.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
E-Sports Talent Scouting Based on Multimodal Twitch Stream Data
Neural features from Twitch streams are pooled via hierarchical Bayesian model to estimate CS:GO gamer intrinsic skill, validated by correlation with subsequent public ranks.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.