Web: https://www.reddit.com/r/LanguageTechnology/comments/sesv7z/replicate_webnlg_2017_challenge_with_opennmttf/

Jan. 28, 2022, 3:06 p.m. | /u/Dario_Della

Natural Language Processing reddit.com

Hello guys, i'm data science student, i'm trying to replicate WebNLG 2017 challenge with OpenNMT-tf.

I have already performed the same challenge with OpenNMT-py and everything went well.

When using the tensoflow version, some doubts arose:

  • how to build vocabularies from webnlg_baseline_input.py output files: ['train-webnlg-all-delex.triple', 'train-webnlg-all-delex.lex', 'dev-webnlg-all-delex.triple' , 'dev-webnlg-all-delex.lex']. since in the tensorflow version a transformation step in a bpe file is required;
  • how to build the default model of openNMT-py (LSTM with 2 layers of 500 units);

I tried …

languagetechnology

Research Scientist, 3D Reconstruction

@ Yembo | Remote, US

Clinical Assistant or Associate Professor of Management Science and Systems

@ University at Buffalo | Buffalo, NY

Data Analyst

@ Colorado Springs Police Department | Colorado Springs, CO

Predictive Ecology Postdoctoral Fellow

@ Lawrence Berkeley National Lab | Berkeley, CA

Data Analyst, Patagonia Action Works

@ Patagonia | Remote

Data & Insights Strategy & Innovation General Manager

@ Chevron Services Company, a division of Chevron U.S.A Inc. | Houston, TX