VIDEO_CAPTIONING_PIPELINE

We attempt to tackle the challenging task of recipe generation from videos using only pre-trained models. We divided the process of recipe generation into various modules which include event generation, frame extraction, featurizing frames, removing frame redundancy, frame enhancement, frame captioning, and summarization using LLM. We used various pre-trained models to perform different tasks required to achieve desired results at each stage of our recipe generation pipeline. We used the temporal nature of videos, and the power of image embeddings, and harnessed the power of LLMs to extract meaningful content and generate recipes in an efficient manner. We have demonstrated the quality of the recipe generated using various metrics which highlight the impact of our work.

Experiment 1

Experiment 2

Experiment 3

Experiment 4

For detailed explanation refer to the report and the video presentation which contains demos. https://docs.google.com/presentation/d/1R0FjAj_QXoLjxR3NsZRVTFgYu-KnKj2BpN4EcOvyuGI/edit#slide=id.g21eccad0113_0_38

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
Results		Results
----bv0V6ZjWI.mp4		----bv0V6ZjWI.mp4
---AfxeTnCbVQ.mp4		---AfxeTnCbVQ.mp4
.DS_Store		.DS_Store
COMP646_Report.pdf		COMP646_Report.pdf
Pipeline.ipynb		Pipeline.ipynb
README.md		README.md
Running_PDVC.ipynb		Running_PDVC.ipynb
bard.py		bard.py
captions.py		captions.py
download_youcookii_videos.py		download_youcookii_videos.py
dvc_results.json		dvc_results.json
dvc_results_test.json		dvc_results_test.json
frames.py		frames.py
meteor_metric.py		meteor_metric.py
model.py		model.py
transcipts.py		transcipts.py
trial.txt		trial.txt
valid_transcript_id.txt		valid_transcript_id.txt
yc2_test.json		yc2_test.json
yc2_train.json		yc2_train.json
yc2_val.json		yc2_val.json
yolo.py		yolo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VIDEO_CAPTIONING_PIPELINE

Experiment 1

Experiment 2

Experiment 3

Experiment 4

About

Releases

Packages

Languages

hemanthkumar17/VIDEO_CAPTIONING_PIPELINE

Folders and files

Latest commit

History

Repository files navigation

VIDEO_CAPTIONING_PIPELINE

Experiment 1

Experiment 2

Experiment 3

Experiment 4

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages