Context modulated dynamic networks for actor and action video segmentation with language queries

Publication
Proceedings of the AAAI Conference on Artificial Intelligence