So You Think You Know Text To Video Diffusion Models?