eto_swe_interview: Option 1

Minimal normalization of a json file containing organization affiliation data from https://arxiv.org/

Data was initially explored using jupyter notebook. I looked at how many entries there were versus the number of unique organization entries, the most common words, and the most common abbreviations (defined as having all characters capitalized). Though I could have done more exploration and leveraged these, I decided to normalize the data in a way more similar to the example given in the problem statement and fix abbreviations for common words such as "U." or "U" for "University".

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.gitignore		.gitignore
README.md		README.md
challenge.py		challenge.py
eto_swe_interview_data.json		eto_swe_interview_data.json
explore_org_data.ipynb		explore_org_data.ipynb
test_challenge.py		test_challenge.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

eto_swe_interview: Option 1

About

Releases

Packages

Languages

Coniferish/eto_swe_interview

Folders and files

Latest commit

History

Repository files navigation

eto_swe_interview: Option 1

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages