A Word-Complexity Lexicon and A Neural Readability Ranking Model for Lexical Simplification

This repository contains the code and resources from the following paper

Repo Structure:

word_complexity_lexicon: Lexicon with complexity scores for ~15000 most frequent words from Google Ngram Corpus. The scores are calculated by aggregating over human ratings. We release both the aggregated ratings and the individual ratings by each annotator.
SimplePPDBpp: SimplePPDB++ resource consisting of around 14.1 million paraphrase rules along with their readability scores.
neural_readability_ranker: Code for our neural readability ranker model.

Citation

Please cite if you use the above resources for your research

@InProceedings{EMNLP-2018-Maddela,
  author = 	"Maddela, Mounica and Xu, Wei",
  title = 	"A Word-Complexity Lexicon and A Neural Readability Ranking Model for Lexical Simplification",
  booktitle = 	"Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP)",
  year = 	"2018",
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

A Word-Complexity Lexicon and A Neural Readability Ranking Model for Lexical Simplification

Repo Structure:

Citation

Files

README.md

Latest commit

History

README.md

File metadata and controls

A Word-Complexity Lexicon and A Neural Readability Ranking Model for Lexical Simplification

Repo Structure:

Citation