Falsified Scientific Literature Generation

Grover is a model that was introduced on Defending Against Neural Fake News that is designed to both generate and detect neural fake news. For this project, Falsified Scientific Literature Generation, we leverage the data that was created from prior project and use it to generate fake scientific literature that is nearly undistinguishable from human written literature. Then, we attempt to see if the Grover discriminates the manipulated and plagiarised paper as a fake scientific literature.

For more information about Grover, visit original author's project page at rowanzellers.com/grover, the AI2 online demo, or read the full paper at arxiv.org/abs/1905.12616 and see the original repo.

Generating fake text using Grover

Result

Example of generated fake paper

Ideas for improvement

There were few areas that could use some improvement. For example, while Grover generated believable text, it did poorly on generating fake title which could be improved with a model that performs better on these tasks. We also used DCGAN to generate fake face of the authors, however, for the scientific papers it would be better if we also include scientific images that correspond with the fake texts. Finally, in classifying the fake and human written texts, Grover believed that on average, there is a 68.65% chance that a human-written paper from the Bik dataset was written by machines. We believe that this accuracy could be improved if the model is pretrained with the scientific literature instead of the news article which is what it was built for.

Generate Fake title using GPT-2
Generate fake scientific images that would go well with a fake scientific paper (i.e. using Convolutional VAE)
- Generate caption of the image using Inception-v4
Pre-train Grover based on scientific papers

Bibtex

@inproceedings{zellers2019grover,
    title={Defending Against Neural Fake News},
    author={Zellers, Rowan and Holtzman, Ari and Rashkin, Hannah and Bisk, Yonatan and Farhadi, Ali and Roesner, Franziska and Choi, Yejin},
    booktitle={Advances in Neural Information Processing Systems 32},
    year={2019}
}

Name		Name	Last commit message	Last commit date
Latest commit History 129 Commits
LaTeX		LaTeX
caption_generation		caption_generation
data_preprocess		data_preprocess
discrimination		discrimination
fake_text		fake_text
fake_title		fake_title
generation_examples		generation_examples
image_generation		image_generation
lm		lm
pretrained_model		pretrained_model
realnews		realnews
sample		sample
text_extraction		text_extraction
.gitignore		.gitignore
Bik_v2.tsv		Bik_v2.tsv
Bik_v2_updated.tsv		Bik_v2_updated.tsv
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
download_model.py		download_model.py
requirements-gpu.txt		requirements-gpu.txt
requirements-tpu.txt		requirements-tpu.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Falsified Scientific Literature Generation

Generating fake text using Grover

Result

Ideas for improvement

Bibtex

About

Releases

Packages

Languages

License

alexdseo/Falsified-Scientific-Literature-Generation

Folders and files

Latest commit

History

Repository files navigation

Falsified Scientific Literature Generation

Generating fake text using Grover

Result

Ideas for improvement

Bibtex

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages