MovieBench

A Hierarchical Movie Level Dataset for Long Video Generation

🎶 Updates

Release Dataset with Character Bank and Shot-level Description.
Dec. 16, 2024. Release DataSplit, Scene Split.
Dec. 16, 2024. Release the Scripts for Shot-Level Annotation Generation with GPT4.
Nov. 22, 2024. Rep initialization.

🎶 Todo

Building Leaderboard.
Release Metric Scripts.

🐱 Abstract

MovieBench is a hierarchical dataset designed for advancing research in long video generation. It addresses the challenges associated with generating coherent, movie-length videos by providing a dataset that includes:

Rich, coherent storylines and multi-scene narratives, capturing the essence of movie-level storytelling.
Consistency of character appearance and audio across scenes, enabling character-focused generation.
Hierarchical data structure, featuring both high-level movie information and detailed shot-level descriptions for fine-grained analysis.

This dataset aims to inspire new research directions and tackle challenges such as maintaining character ID consistency across multiple scenes for various characters.

Key Features

Movie-Length Video Descriptions: Comprehensive descriptions capturing scene transitions and narrative coherence.
Character Bank: Includes detailed information on character identities, roles, and visual/audio consistency across scenes.
Shot-Level Details: Fine-grained descriptions for each shot, including visual and narrative cues.
Open Access and Continuous Updates: The dataset is publicly available and will be regularly updated to include more diverse content and refined annotations.

⏬ Download Data

Video

This work is annotated and tested based on the LSMDC dataset. For details about the source movies, please refer to the dataset documentation.

Please make sure to request access to the MPII Movie Description dataset (MPII-MD) first and cc the approval email to [email protected].

Character Bank & Shot Level Description

The Annotation of MovieBench is available for download from the following locations:

Hugging Face: Access the dataset on Hugging Face's platform at:
MovieBench on Hugging Face
Google Drive: Alternatively, you can download the dataset from Google Drive:
MovieBench on Google Drive

Both sources contain the full dataset, including character bank information, shot-level descriptions. Make sure to check the platform that best suits your needs.

├── MovieBench/ 
|   ├── Annotation_Shot_Desc_11.15_V2_160movies
|   |   ├—— 0001_American_Beauty
|   |   ├—— 0002_As_Good_As_It_Gets
|   |   |   ├—— 0002_As_Good_As_It_Gets_00.00.43.459-00.00.43.636.json
|   |   |   ├—— ...
|   ├── Character_Bank_All_11.16
|   |   ├—— 0001_American_Beauty
|   |   ├—— 0002_As_Good_As_It_Gets
|   |   |   ├—— Helen_Hunt-Carol_Connelly
|   |   |   |   ├—— best.jpg
|   |   |   |   ├—— ...
|   ├── movies_scenes.json
|   ├── data_split.json
|   ├── mad2plot.json

⏬ Shot-Level Annotation Generation with GPT4

We developed our Shot-Level Annotation Generation system based on MovieSeq, leveraging GPT-4 to enhance its functionality.

Using a Visual Language Model (e.g., GPT-4), you can generate detailed annotations that include the following elements:

{
    "Characters":
    {
        "Character Name 1": "Description for appearance and behavior of Character 1, within 30 words",
        "Character Name 2": "Description for appearance and behavior of Character 2, within 30 words", 
    },
    "Style Elements":
    [
        "Element 1", "Element 2", "Element 3"
    ],
    "Plot":"A concise summary focusing on the main event or emotion, within 80 words",
    "Background Description":"A concise summary focusing on the main event or emotion, within 40 words",
    "Camera Motion":"A concise summary focusing camera motion, within 30 words."
}

For detailed environment setup and usage instructions, please refer to the corresponding README.

📖BibTeX

@misc{wu2024moviebenchhierarchicalmovielevel,
  title={MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation}, 
  author={Weijia Wu and Mingyu Liu and Zeyu Zhu and Xi Xia and Haoen Feng and Wen Wang and Kevin Qinghong Lin and Chunhua Shen and Mike Zheng Shou},
  year={2024},
  eprint={2411.15262},
  archivePrefix={arXiv},
  primaryClass={cs.CV},
  url={https://arxiv.org/abs/2411.15262}, 
  }

🤗Acknowledgements

Thanks to Diffusers for the wonderful work.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
MovieBench		MovieBench
asset		asset
data		data
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MovieBench

A Hierarchical Movie Level Dataset for Long Video Generation

🎶 Updates

🎶 Todo

🐱 Abstract

Key Features

⏬ Download Data

Video

Character Bank & Shot Level Description

⏬ Shot-Level Annotation Generation with GPT4

📖BibTeX

🤗Acknowledgements

About

Releases

Packages

Languages

showlab/MovieBench

Folders and files

Latest commit

History

Repository files navigation

MovieBench

A Hierarchical Movie Level Dataset for Long Video Generation

🎶 Updates

🎶 Todo

🐱 Abstract

Key Features

⏬ Download Data

Video

Character Bank & Shot Level Description

⏬ Shot-Level Annotation Generation with GPT4

📖BibTeX

🤗Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages