Skip to content

ZeweiChu/MQR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 

Repository files navigation

MQR

The is the repository for the paper How to Ask Better Questions? A Large-Scale Multi-Domain Dataset for Rewriting Ill-Formed Questions

Train Data

Dev/Test Data and Model Predictions

  • The dev and test datasets are under directory data
  • The rewritten dev/test splits can be found under each subdirectory of data

Annotation

License

The MQR dataset is under cc-by-sa 4.0 license, intended to be shared and remixed.

The MQR dataset is partially constructed from the Stack Exchange data dumps

We used Quora Question Pairs dataset as part of the training data

We also used the Paralex dataset for training

Reference

@inproceedings{chu-mqr-20,
  author    = {Zewei Chu and Mingda Chen and Jing Chen and Miaosen Wang and Kevin Gimpel and Manaal Faruqui and Xiance Si},
  title     = {How to Ask Better Questions? A Large-Scale Multi-Domain Dataset for Rewriting Ill-Formed Questions},
  booktitle = {Proc. of {AAAI}},
  year      = {2020}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published