Reddit said it first The company's IPO, to be announced next week, said it could generate $203 million in revenue over the next few years by licensing user submissions to Google and others for AI projects. The community-driven platform was forced to reveal on Friday that U.S. regulators already have questions about its new line of business.
Reddit received a letter from the U.S. Federal Trade Commission on Thursday asking about “selling, licensing, or sharing user-generated content to train AI models with third parties,” the company said in a regulatory filing. said.
The FTC, the U.S. government's primary antitrust regulator, has the authority to sanction companies found to engage in unfair or deceptive trade practices. The idea of licensing user-generated content for AI projects has drawn questions from lawmakers and rights groups about privacy risks, fairness, and copyright.
Reddit isn't the only company trying to profit from using licensed data, including user-generated data, for AI. Programming Q&A site Stack Overflow has signed a deal with Google, The Associated Press has signed a deal with OpenAI, and Tumblr owner Automattic is working with “select AI companies” but will It said it would allow you to opt-out of data transfer. None of the licensors immediately responded to requests for comment. And Reddit isn't the only company to receive an FTC letter regarding data licensing, Axios reported Friday, citing unnamed former officials.
It's unclear whether the letter to Reddit is directly related to the reviews of other companies.
Reddit said in a disclosure Friday that it does not believe the company engaged in any unfair or deceptive conduct, but warned that responding to government investigations could be costly and time-consuming. did. “The letter indicates that FTC staff is interested in meeting with us to learn more about our plans and intends to request information and documents from us as the FTC continues its investigation. ”, the filing states. According to Reddit, the FTC's letter states the investigation is related to a “private investigation.”
Reddit's 17 billion posts and comments are seen by AI experts as valuable for training chatbots in conversational skills, and last month Reddit announced a deal to license its content to Google. Reddit and Google did not immediately respond to requests for comment. The FTC declined to comment. (Advance Magazine Publishers, the parent company of WIRED publisher Condé Nast, owns a stake in Reddit.)
AI chatbots, such as OpenAI's ChatGPT and Google's Gemini, are seen as a competitive threat to Reddit, publishers, and other ad-supported, content-driven businesses. Last year, the prospect of licensing data to AI developers emerged as a potential benefit of generative AI for some companies.
But using data collected online to train AI models has raised many questions in boardrooms, courtrooms, and Congress. For Reddit and other companies whose data is generated by users, those questions include who really owns the content and whether it's fair to license it without giving the creator a cut. included. Security researchers have discovered that AI models can leak personal data contained in the materials used to create them. And some critics suggest the deal could give powerful companies even more control.