March 11, 2024, 6:24 p.m. | /u/analyze_hunter

Machine Learning www.reddit.com

I’m working on a research project using machine learning to try to discover patterns in gene promoters. I’m concerned about data leakage in our model, but I want some outside opinions before I push too hard with my lab to change our methodology. Perhaps someone in this community can help me understand this issue better.

Our machine learning model is an L1-regularized logistic regression model that uses sequence patterns as features to predict a binary outcome for a transcription start …

change community data data leakage example gene lab machine machine learning machinelearning methodology opinions patterns project research

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

C003549 Data Analyst (NS) - MON 13 May

@ EMW, Inc. | Braine-l'Alleud, Wallonia, Belgium

Marketing Decision Scientist

@ Meta | Menlo Park, CA | New York City