Dec. 29, 2023, 11:24 a.m. | Matthias Bastian

THE DECODER the-decoder.com


The Chinese government has released a dataset to train language models that reflect their political views. This is another example of how the Chinese government is trying to control generative AI.


The article CCP releases politically approved LLM dataset with 50 billion tokens appeared first on THE DECODER.

ai and language ai in china ai in practice article artificial intelligence billion chinese control dataset decoder example generative government language language models llm political releases the decoder tokens train

More from the-decoder.com / THE DECODER

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

DevOps Engineer (Data Team)

@ Reward Gateway | Sofia/Plovdiv