Oct. 19, 2023, 10:07 p.m. | Lennart Langouche

Towards Data Science - Medium towardsdatascience.com

Part 2: Predicting clinical trial outcomes using XGBoost

In the first part of this series I focused on embedding multi-modal real-world data derived from ClinicalTrials.gov. In this article I will implement a basic XGBoost model, train it on the embeddings we created in Part 1 and compare its performance to that of the HINT model (a hierarchical graph neural net) by which this project was inspired.

Workflow schematic (image by author)

These are the steps I will follow in …

article basic clinical clinical trial clinical trials data embedding embeddings gov health-data-science hierarchical large language models multi-modal part performance prediction predictive modeling series series i towards-data-science train world xgboost

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Principal Applied Scientist

@ Microsoft | Redmond, Washington, United States

Data Analyst / Action Officer

@ OASYS, INC. | OASYS, INC., Pratt Avenue Northwest, Huntsville, AL, United States