The technique, “reinforcement learning from human feedback,” is now driving the development of artificial intelligence across the industry.
For years, companies like Google and OpenAI have relied on such workers to prepare data used to train A.I.
Reinforcement learning from human feedback is far more sophisticated than the rote data-tagging work that fed A.I.
Last year, OpenAI and one of its competitors, Anthropic, used freelance workers in the United States through the website Upwork.
Hugging Face, another prominent lab, is using U.S. workers hired through the data curation start-ups Scale AI and Surge.
Organizations:
Google, Workers
Locations:
United States, India, Africa, chatbots