Skip to content

showlab/MovieBench

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 

Repository files navigation

MovieBench

A Hierarchical Movie Level Dataset for Long Video Generation

🎶 Updates

  • Release Dataset with Character Bank and Shot-level Description.
  • Dec. 16, 2024. Release DataSplit, Scene Split.
  • Dec. 16, 2024. Release the Scripts for Shot-Level Annotation Generation with GPT4.
  • Nov. 22, 2024. Rep initialization.

🎶 Todo

  • Building Leaderboard.
  • Release Metric Scripts.

🐱 Abstract

MovieBench is a hierarchical dataset designed for advancing research in long video generation. It addresses the challenges associated with generating coherent, movie-length videos by providing a dataset that includes:

  1. Rich, coherent storylines and multi-scene narratives, capturing the essence of movie-level storytelling.
  2. Consistency of character appearance and audio across scenes, enabling character-focused generation.
  3. Hierarchical data structure, featuring both high-level movie information and detailed shot-level descriptions for fine-grained analysis.

This dataset aims to inspire new research directions and tackle challenges such as maintaining character ID consistency across multiple scenes for various characters.


image.


image.

Key Features

  • Movie-Length Video Descriptions: Comprehensive descriptions capturing scene transitions and narrative coherence.
  • Character Bank: Includes detailed information on character identities, roles, and visual/audio consistency across scenes.
  • Shot-Level Details: Fine-grained descriptions for each shot, including visual and narrative cues.
  • Open Access and Continuous Updates: The dataset is publicly available and will be regularly updated to include more diverse content and refined annotations.

⏬ Download Data

Video

This work is annotated and tested based on the LSMDC dataset. For details about the source movies, please refer to the dataset documentation.

Please make sure to request access to the MPII Movie Description dataset (MPII-MD) first and cc the approval email to [email protected].

Character Bank & Shot Level Description

The Annotation of MovieBench is available for download from the following locations:

Both sources contain the full dataset, including character bank information, shot-level descriptions. Make sure to check the platform that best suits your needs.

├── MovieBench/ 
|   ├── Annotation_Shot_Desc_11.15_V2_160movies
|   |   ├—— 0001_American_Beauty
|   |   ├—— 0002_As_Good_As_It_Gets
|   |   |   ├—— 0002_As_Good_As_It_Gets_00.00.43.459-00.00.43.636.json
|   |   |   ├—— ...
|   ├── Character_Bank_All_11.16
|   |   ├—— 0001_American_Beauty
|   |   ├—— 0002_As_Good_As_It_Gets
|   |   |   ├—— Helen_Hunt-Carol_Connelly
|   |   |   |   ├—— best.jpg
|   |   |   |   ├—— ...
|   ├── movies_scenes.json
|   ├── data_split.json
|   ├── mad2plot.json

⏬ Shot-Level Annotation Generation with GPT4

We developed our Shot-Level Annotation Generation system based on MovieSeq, leveraging GPT-4 to enhance its functionality.

image description

Using a Visual Language Model (e.g., GPT-4), you can generate detailed annotations that include the following elements:

{
    "Characters":
    {
        "Character Name 1": "Description for appearance and behavior of Character 1, within 30 words",
        "Character Name 2": "Description for appearance and behavior of Character 2, within 30 words", 
    },
    "Style Elements":
    [
        "Element 1", "Element 2", "Element 3"
    ],
    "Plot":"A concise summary focusing on the main event or emotion, within 80 words",
    "Background Description":"A concise summary focusing on the main event or emotion, within 40 words",
    "Camera Motion":"A concise summary focusing camera motion, within 30 words."
}

For detailed environment setup and usage instructions, please refer to the corresponding README.

📖BibTeX

@misc{wu2024moviebenchhierarchicalmovielevel,
  title={MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation}, 
  author={Weijia Wu and Mingyu Liu and Zeyu Zhu and Xi Xia and Haoen Feng and Wen Wang and Kevin Qinghong Lin and Chunhua Shen and Mike Zheng Shou},
  year={2024},
  eprint={2411.15262},
  archivePrefix={arXiv},
  primaryClass={cs.CV},
  url={https://arxiv.org/abs/2411.15262}, 
  }

🤗Acknowledgements

  • Thanks to Diffusers for the wonderful work.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published