Beyond Training: Dynamic Token Merging for Zero-Shot Video Understanding

Written on November 21, 2024