Feb. 26, 2024, 5:44 p.m. | /u/Primary-Wasabi292

Machine Learning www.reddit.com

I am currently trying to generate embeddings I have of a heterogeneous datasource I am working with. The problem is, that for every sample, I have a varying amount of input features. For the ones interested, the problem is biological. In particular I am trying to calculate a gene-level representation of this data and for every gene I have a different number of variants, with each variant having one feature value associated to it.


My solution to homogenise these features …

decoder embeddings encoder encoder-decoder every features gene generate machinelearning question representation sample transformer transformer encoder

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Business Data Scientist, gTech Ads

@ Google | Mexico City, CDMX, Mexico

Lead, Data Analytics Operations

@ Zocdoc | Pune, Maharashtra, India