June 28, 2022, 3:05 p.m. | /u/BlockDesigns

Machine Learning www.reddit.com

Transformers are awesome for so many things in 2022, but one thing I've found them to struggle with is generating embeddings for long documents.

I put together a blog post going through some interesting techniques. Let me know if it helped you!

[Blog post](https://www.notia.ai/articles/clustering-long-documents)

clustering machinelearning transformers

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Research Associate (Data Science/Information Engineering/Applied Mathematics/Information Technology)

@ Nanyang Technological University | NTU Main Campus, Singapore

Associate Director of Data Science and Analytics

@ Penn State University | Penn State University Park

Student Worker- Data Scientist

@ TransUnion | Israel - Tel Aviv

Vice President - Customer Segment Analytics Data Science Lead

@ JPMorgan Chase & Co. | Bengaluru, Karnataka, India

Middle/Senior Data Engineer

@ Devexperts | Sofia, Bulgaria