seq2seq-keyphrase-pytorch

Note: this repository is basically deprecated. Please move to our latest code/data/model release for keyphrase generation at https://github.com/memray/OpenNMT-kpg-release.

Current code is developed on PyTorch 0.4, not sure if it works on other versions.

A subset of data (20k docs) is provided here for you to test the code. Unzip and place it to data/.

If you need to train on the whole kp20k dataset, download the json data and run preprocess.py first. No trained model will be released in the near future.

Update I will not be updating this repo for a while. But please see the information below to help you run the code. Some Some test datasets in JSON format: download

preprocess.py: entry for preprocessing datasets in JSON format.
train.py: entry for training models.
predict.py: entry for generating phrases with well-trained models (checkpoints).

You can refer to these scripts as examples.

Note that duplicate papers that appear in popular test datasets (e.g. Inspec, SemEval) are also included in the KP20k training dataset. Please be sure to remove them before training.

Name		Name	Last commit message	Last commit date
Latest commit History 230 Commits
pykp		pykp
script		script
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
beam_search.py		beam_search.py
config.py		config.py
evaluate.py		evaluate.py
logger_test.py		logger_test.py
output.txt		output.txt
predict.py		predict.py
preprocess.py		preprocess.py
preprocess_testset.py		preprocess_testset.py
requirements.txt		requirements.txt
run_examples.sh		run_examples.sh
stat_print.py		stat_print.py
train.py		train.py
train_rl.py		train_rl.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

seq2seq-keyphrase-pytorch

Note: this repository is basically deprecated. Please move to our latest code/data/model release for keyphrase generation at https://github.com/memray/OpenNMT-kpg-release.

About

Releases

Packages

Contributors 4

Languages

License

memray/seq2seq-keyphrase-pytorch

Folders and files

Latest commit

History

Repository files navigation

seq2seq-keyphrase-pytorch

Note: this repository is basically deprecated. Please move to our latest code/data/model release for keyphrase generation at https://github.com/memray/OpenNMT-kpg-release.

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages