Skip to content

Code to accompany EMNLP paper on nonstandard word dissemination online.

Notifications You must be signed in to change notification settings

ianbstewart/nonstandard_word_dissemination

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Nonstandard word dissemination

This repository accompanies the 2018 EMNLP paper concerning dissemination of nonstandard words online.

I. Stewart and J. Eisenstein. Making "fetch" happen: The influence of social and linguistic context on nonstandard word growth and decline. Proceedings of EMNLP 2018. Paper here.

Structure

  • data/

Directory for all the term frequency, social and linguistic dissemination data to be computed.

  • scripts/

Directory for all the data processing and analysis scripts.

  • writing/

Directory for all the writing (NWAV, EMNLP, ML@GT).

Before starting

We assume that you have already collected the raw Reddit comments data from here for 2013-06 through 2016-05. These files are each about 5 GB zipped, so make sure your server/computer has enough space before downloading.

The scripts have the following dependencies:

  • Python
    • scipy
    • numpy
    • pandas
    • matplotlib
    • sklearn
    • statsmodels
    • nltk
    • lifelines
    • langid
    • stopwords
  • R
    • relaimpo
    • causaldrf
    • survival
  • notebook
    • jupyter-notebook

About

Code to accompany EMNLP paper on nonstandard word dissemination online.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages