New_Bechdel

Program to set up the alternative to the Bechdel test

Dataset derived from the Cornell Movie-Dialogue Corpus

Steps to consider in the model

Explore the data.
Creating and Finding out features.
Gathering data from the Crew as well and linking it with the existing dataset.
Number of dialogues a Female has spoken in the movie - Percentage.
Dialgoues in the form of f-f, f-m, m-f should be considered.
Identifying Parts of Speech (POS) tags per movie, and understanding how many f-f, f-m and m-f utterances can be found.
Identifying the sentiments of each utternaces in the movie by understanding what are the sentiments like for f-f, f-m and m-f characters.
Linking the overall sentiments of the movie with the genres of the movie.
Identifying how all these above parameters affect the ROI of the particular movie.
We define ROI as the ratio of the 'Budget Of The Movie' to its 'Revenue'
We obtain principal components from the various parameters we obtain from the above steps, in order to depict the variability in the independent variables(features).
We then compute the participation of each of the features in the first few components of the principal components, and use that to establish a score which would be as follows
Score = b1*P1 + b2*P2 + b3*P3 ..
Where b1, b2 .. are the participation of feature1, feature2 ... respectively in the first few principal components.
We would regress our principal components obtained from the features above to obtain their regression coefficients, and how the ROI(dependent variable) is related to them.
Finally find out the existing Bechdel score and prove how this score is better at identifying the gender equality or gender diversity in that particular movie.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Code		Code
Data		Data
Sentiment-BOW		Sentiment-BOW
cornell movie-dialogs corpus		cornell movie-dialogs corpus
gender_recognition		gender_recognition
images		images
Analysis_Latest.xlsx		Analysis_Latest.xlsx
Analysis_csv.csv		Analysis_csv.csv
Bechdel_Approved.csv		Bechdel_Approved.csv
Big_dump.pickle		Big_dump.pickle
Crew_Dataset.csv		Crew_Dataset.csv
Crew_Info_Tagging.ipynb		Crew_Info_Tagging.ipynb
Cumulative_plot.PNG		Cumulative_plot.PNG
Dialog_Denormalizing_Krishna.ipynb		Dialog_Denormalizing_Krishna.ipynb
FINAL_DATASET.csv		FINAL_DATASET.csv
Final_Analysis.ipynb		Final_Analysis.ipynb
Gender_Analysis.html		Gender_Analysis.html
Gender_Character_Analysis.ipynb		Gender_Character_Analysis.ipynb
Genre_Tagging.html		Genre_Tagging.html
Genre_Tagging.ipynb		Genre_Tagging.ipynb
Genre_Tagging2.ipynb		Genre_Tagging2.ipynb
Good_Book.csv		Good_Book.csv
Heatmap.PNG		Heatmap.PNG
LICENSE		LICENSE
Loadings.PNG		Loadings.PNG
New Bechdel.html		New Bechdel.html
New Bechdel.ipynb		New Bechdel.ipynb
New_Bechdel_Class_Labels.csv		New_Bechdel_Class_Labels.csv
Normalised_New_Bechdel_Class_Labels.csv		Normalised_New_Bechdel_Class_Labels.csv
Normalised_PCA_Clustering.ipynb		Normalised_PCA_Clustering.ipynb
PCA.ipynb		PCA.ipynb
PCA_Scratch.ipynb		PCA_Scratch.ipynb
POS.csv		POS.csv
README.md		README.md
SVC.ipynb		SVC.ipynb
Sentiment_Analysis.html		Sentiment_Analysis.html
Sentiment_Analysis.ipynb		Sentiment_Analysis.ipynb
Untitled.ipynb		Untitled.ipynb
Varimax_rotated.PNG		Varimax_rotated.PNG
Year.csv		Year.csv
ba_mtm_csv.csv		ba_mtm_csv.csv
bechdel_json_parser.py		bechdel_json_parser.py
cornell_movie_dialogs_corpus.zip		cornell_movie_dialogs_corpus.zip
file.csv		file.csv
final.csv		final.csv
iris.pdf		iris.pdf
iris2.pdf		iris2.pdf
mc_csv.csv		mc_csv.csv
mcm_csv.csv		mcm_csv.csv
ml_csv.csv		ml_csv.csv
mtm_csv.csv		mtm_csv.csv
nn_vb_jj_pos_dicts.pickle		nn_vb_jj_pos_dicts.pickle
p579-alm.pdf		p579-alm.pdf
pickle_dumps.py		pickle_dumps.py
pickle_dumps2.py		pickle_dumps2.py
pickle_dumps3.py		pickle_dumps3.py
pos_tags_utterances.pickle		pos_tags_utterances.pickle
result_df.pickle		result_df.pickle
sentiment_analysis_model.pkl		sentiment_analysis_model.pkl
test.py		test.py
test1.py		test1.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

New_Bechdel

Steps to consider in the model

Done with the above Steps

About

Releases

Packages

Contributors 2

Languages

License

ck090/Identifying-Missing-Component-in-the-Bechdel-Test-Using-PCA

Folders and files

Latest commit

History

Repository files navigation

New_Bechdel

Steps to consider in the model

Done with the above Steps

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages