AI companies are too cheap to pay for legit books

Big tech companies are using published books to train their artificial intelligence models—not just without obtaining authorization from their authors, but also by pirating the books and denying the authors their sales royalties.

Big Tech is profiting off pirated books and cheap content

The piracy habit illustrates Big Tech’s tendency to pinch pennies wherever people can be exploited. A software engineer at OpenAI, who works on content from these books, makes an annual salary of up to $370,000. Many book authors, though, never see that kind of income from their writing in their lifetimes, and yet their work is being used to refine and commercialize AI engines.

Although OpenAI’s valuation rose to $29 billion in June, it has previously also been accused of hiring a California-based agency, Sama, that allegedly underpaid Kenyan workers to perfect ChatGPT. Kenyan workers made between $1.32 and $2 per hour, a small fraction of California’s minimum wage of $16.99 per hour.

Meta, too, has found itself under fire for similar reasons. This past March, Meta indicated that its largest investments henceforth will be in AI, and a month later, it announced that it would spend $33 billion to “introduce AI agents to billions of people in ways that will be useful and meaningful.” In June, it launched Llama 2, its latest large language model for commercial use.

But despite these grand announcements of expenditure, Meta has been hit with accusations that its subcontracted employees, recruited through Sama, work in poor conditions. Last year, one such former employee sued Meta and Sama in Nairobi, alleging labor exploitation and the suppression of union organizing efforts.

Google has invested $300 million in Anthropic, a company founded by ex-OpenAI employees and progenitor of Claude, an AI chatbot that rivals ChatGPT. It is unclear clear how much Google has invested in its own Bard chatbot, which has been released to a wide audience in more than 40 languages.

Yet many of the people hired to train Bard are reportedly overworked, undertrained, and underpaid. Some contractors, pressured to deliver complex text audits within short durations, make as little as $14 per hour. In contrast, the median salary for an AI engineer at Google is $230,745.

AI companies are too cheap to pay for legit books

Companies like Meta and OpenAI have been using pirated copies of books to train their AI models

Suggested Reading

Related Content

Big Tech is profiting off pirated books and cheap content

📬 Sign up for the Daily Brief

Our free, fast and fun briefing on the global economy, delivered every weekday morning.