"You think that's funny?" Topic modelling and text generation using Amazon and Netflix stand-up comedy scripts

This project explores topic extraction techniques as well as the construction of two language generation model on an atypical collection of documents: stand-up comedy scripts.

The goal is two-fold:

a) The first is using standard Natural Language Processing (NLP) techniques to investigate and summarize the topics debated in a corpus of 143 scripts of stand-up comedy shows released by Amazon and Netflix between 2013 and 2021;
b) The second involves building on the knowledge acquired from the first part to create two Recurrent Neural Network models with different architectures and test the language capabilities of the most performing one in generating new text.

The project is structured in 5 parts:

1) Construction of the dataset
2) Exploratory Data Analysis
3) Topic modelling
4) Text generation
5) Conclusions

The notebook "You_think_thats_funny_SINGLE_FILE.ipynb" merges the five sections in a unique file.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Stand_up_specials_subs		Stand_up_specials_subs
01_Construction_dataset.ipynb		01_Construction_dataset.ipynb
02_Exploratory_Data_Analysis.ipynb		02_Exploratory_Data_Analysis.ipynb
03_Topic_modelling.ipynb		03_Topic_modelling.ipynb
04_Text_generation.ipynb		04_Text_generation.ipynb
05_Conclusions.ipynb		05_Conclusions.ipynb
Installed packages Capstone project.txt		Installed packages Capstone project.txt
LICENSE		LICENSE
List_stand_up_comedy_full.csv		List_stand_up_comedy_full.csv
README.md		README.md
Stand_up_comedy_dataset.csv		Stand_up_comedy_dataset.csv
You_think_thats_funny_SINGLE_FILE.ipynb		You_think_thats_funny_SINGLE_FILE.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

"You think that's funny?" Topic modelling and text generation using Amazon and Netflix stand-up comedy scripts

About

Releases

Packages

Languages

License

slashlan/standupcomedynlp

Folders and files

Latest commit

History

Repository files navigation

"You think that's funny?" Topic modelling and text generation using Amazon and Netflix stand-up comedy scripts

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages