Anwar, N. (2025). From Temporal Coherence to Cross-Modal Intelligence: A Modular Framework for Video Object Detection [Thèse de doctorat, Polytechnique Montréal]. Disponible