Hi, I'm trying to run ZOE on a new dataset and the following questions were raised:
-
In the main.py, should I comment out runner.elmo_processor.load_cached_embeddings("target.min.embedding.pickle", "wikilinks.min.embedding.pickle")? If yes, could you show me how these two files are generated and what are the format for the raw version of these two files? Currently I found running new data is extremely slow (processed 30 sentences after one night). Anything idea how I can speed up things?
-
Are there any other files/data I need to generate for testing on new dataset? (maybe vocab_test.txt?)
Thank you!
Hi, I'm trying to run ZOE on a new dataset and the following questions were raised:
In the main.py, should I comment out runner.elmo_processor.load_cached_embeddings("target.min.embedding.pickle", "wikilinks.min.embedding.pickle")? If yes, could you show me how these two files are generated and what are the format for the raw version of these two files? Currently I found running new data is extremely slow (processed 30 sentences after one night). Anything idea how I can speed up things?
Are there any other files/data I need to generate for testing on new dataset? (maybe vocab_test.txt?)
Thank you!