May 23, 2022, 5:50 p.m. | /u/No_Technology1455

Machine Learning www.reddit.com

I’m doing PCA for different sentence embeddings (word2vec, BERTtweet, InferSent…) of my data. My question is, should I scale these embeddings before putting them into PCA. I know it’s a standard practice in ML when using PCA, but idk if it still stands for sentence embeddings. Also if I should, will standard scaler be ok?

machinelearning scaling

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Research Associate (Data Science/Information Engineering/Applied Mathematics/Information Technology)

@ Nanyang Technological University | NTU Main Campus, Singapore

Associate Director of Data Science and Analytics

@ Penn State University | Penn State University Park

Student Worker- Data Scientist

@ TransUnion | Israel - Tel Aviv

Vice President - Customer Segment Analytics Data Science Lead

@ JPMorgan Chase & Co. | Bengaluru, Karnataka, India

Middle/Senior Data Engineer

@ Devexperts | Sofia, Bulgaria