Getting started

The original readme is below. Here I describe how to get started with experimentation using this slightly-modified setup.

install docker. Build the container with ./build
run the container and attach with ./enter
install the requirements with pip install -r requirements.txt

Run some demos, e.g.:

bash demo.sh wordavg

para-nmt-50m

Code to train models from "Pushing the Limits of Paraphrastic Sentence Embeddings with Millions of Machine Translations".

The code is written in python and requires numpy, scipy, theano, and the lasagne libraries.

To get started, run setup.sh to download a pre-trained 600d Trigram-Word model, a pre-trained 4096d BiLTSM Avg. model, and required files such as sample training data and evaluation data. The full 50M paraphrase corpora as well as a filtered, tokenized, 5M paraphrase corpora are available at http://www.cs.cmu.edu/~jwieting.

There is also a demo script that takes the model that you would like to train as a command line argument (check the script to see available choices). Check main/train.py for command line options.

If you use our code, models, or data for your work please cite:

@inproceedings{wieting-17-millions, author = {John Wieting and Kevin Gimpel}, title = {Pushing the Limits of Paraphrastic Sentence Embeddings with Millions of Machine Translations}, booktitle = {arXiv preprint arXiv:1711.05732}, year = {2017} }

@inproceedings{wieting-17-backtrans, author = {John Wieting, Jonathan Mallinson, and Kevin Gimpel}, title = {Learning Paraphrastic Sentence Embeddings from Back-Translated Bitext}, booktitle = {Proceedings of Empirical Methods in Natural Language Processing}, year = {2017} }

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
shared/src		shared/src
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
build		build
enter		enter

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Getting started

para-nmt-50m

About

Uh oh!

Releases

Packages

Languages

hunan-rostomyan/para-nmt-50m

Folders and files

Latest commit

History

Repository files navigation

Getting started

para-nmt-50m

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages