Evaluation frame counts inconsistent #32

my-other-github-account · 2024-10-30T04:47:04Z

Hello,

I see in the paper that default MLLM configs were largely used, but frame counts were increased where applicable.

Certain models such as LongVA appear to support video contexts up to 1000 frames, but only 128 are used in the benchmark. If models can handle the extra frame context, it seems like it could potentially help their performance.

What determines the frame counts?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluation frame counts inconsistent #32

Evaluation frame counts inconsistent #32

my-other-github-account commented Oct 30, 2024

Evaluation frame counts inconsistent #32

Evaluation frame counts inconsistent #32

Comments

my-other-github-account commented Oct 30, 2024