OpenAI and Google reportedly used transcriptions of YouTube videos to train their AI models

OpenAI and Google trained their AI models on text transcribed from YouTube videos, potentially violating creators’ copyrights, according to . The report, which describes the lengths OpenAI, Google and Meta have gone to in order to maximize the amount of data they can feed to their AIs, cites numerous people with knowledge of the companies’ practices. It comes just days after YouTube CEO Neal Mohan said in an interview with that OpenAI’s alleged use of YouTube videos to train its new text-to-video generator, Sora, .

According to the NYT, OpenAI used its Whisper speech recognition tool to transcribe more than one million hours of YouTube videos, which were then used to train GPT-4. previously reported that OpenAI had used YouTube videos and podcasts to train the two AI systems. OpenAI president Greg Brockman was reportedly among the people on this team. Per Google’s rules, “unauthorized scraping or downloading of YouTube content” is not allowed, Matt Bryant, a spokesperson for Google, told NYT, also saying that the company was unaware of any such use by OpenAI.

The report, however, claims there were people at Google who knew but did not take action against OpenAI because Google was using YouTube videos to train its own AI models. Google told NYT it only does so with videos from creators who have agreed to take part in an experimental program. Engadget has reached out to Google and OpenAI for comment.

The NYT report also claims Google tweaked its privacy policy in June 2022 to more broadly cover its use of publicly available content, including Google Docs and Google Sheets, to train its AI models and products. Bryant told NYT that this is only done with the permission of users who opt into Google’s experimental features, and that the company “did not start training on additional types of data based on this language change.”

Original Source Link

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

OpenAI and Google reportedly used transcriptions of YouTube videos to train their AI models

TikTok Is Already Back Online

Instagram is rushing out a new video editing app that sure sounds a lot like CapCut

It’s Not Just TikTok: These Other ByteDance Apps Are Gone Too

Instagram swoops in with 3-minute Reels and rectangular profile grids as the TikTok ban gets real

Waymo Finds a Way Around US Restrictions Targeting Chinese Cars

Greta Gerwig’s Narnia movie will get up to four weeks in theaters

Best of Trap Nation Mix ♥️ Remixes of Popular Songs

Watch Glorilla Perform “Yeah Glo!” and Glorious Medley on SNL

Indies Pull Their Weight As ‘Nosferatu’ Reigns, ‘The Substance’ Ramps Back Up, ‘The Brutalist’ Builds On Imax — Specialty Box Office

The Greatest Folk Horror Film of All Time Is Streaming Now!

Days of Our Lives Spoilers For The Week of 1-20-25 Promise A Love Triangle Everyone Saw Coming

Who Is Dave Chappelle’s Wife? Elaine’s Job & Relationship History

Instagram is rushing out a new video editing app that sure sounds a lot like CapCut

OpenAI and Google reportedly used transcriptions of YouTube videos to train their AI models

Related Posts