Apple trained a large language model to efficiently understand long-form video
Apple researchers have developed an adapted version of the SlowFast-LLaVA model that beats larger models at long-form video analysis and understanding. Here’s what that means.
Expand Expanding Close