all AI news
The Biggest Pre-Training Dataset to Build LLMs from Scratch!!
Oct. 30, 2023, 8:29 p.m. | 1littlecoder
1littlecoder www.youtube.com
Redpajama Data v2 Announcement - https://together.ai/blog/redpajama-data-v2
Redpajama based Projects - https://huggingface.co/search/full-text?q=redpajama
Redpajama Data Processing Scripts - https://github.com/togethercomputer/RedPajama-Data
Redpajama Data v2 on Hugging Face - https://huggingface.co/datasets/togethercomputer/RedPajama-Data-V2
Common Crawl - https://commoncrawl.org/
❤️ If you want to support the channel ❤️
Support here:
Patreon - https://www.patreon.com/1littlecoder/
Ko-Fi - …
annotations build data data quality dataset filtering languages llms pre-training quality raw redpajama support tokens training
More from www.youtube.com / 1littlecoder
This Is The #1 "open" Coding LLM (with a twist)
4 days, 9 hours ago |
www.youtube.com
WARNING: Bad News for CHATGPT!
5 days, 6 hours ago |
www.youtube.com
They Mixed Every small LLM Into One LARGE Expert!!!
6 days, 3 hours ago |
www.youtube.com
I wish every AI Engineer could watch this.
1 week, 4 days ago |
www.youtube.com
NEW PC with (Paranoid) JARVIS AI!!!
1 week, 6 days ago |
www.youtube.com
Poorman's ChatGPT-4o Works!! 🤣
2 weeks, 4 days ago |
www.youtube.com
Jobs in AI, ML, Big Data
Senior Machine Learning Engineer
@ GPTZero | Toronto, Canada
ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)
@ HelloBetter | Remote
Doctoral Researcher (m/f/div) in Automated Processing of Bioimages
@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena
Seeking Developers and Engineers for AI T-Shirt Generator Project
@ Chevon Hicks | Remote
Principal Data Architect - Azure & Big Data
@ MGM Resorts International | Home Office - US, NV
GN SONG MT Market Research Data Analyst 11
@ Accenture | Bengaluru, BDC7A