OpenAI transcribed over a million hours of YouTube videos to train GPT-4

France Nouvelles Nouvelles

OpenAI transcribed over a million hours of YouTube videos to train GPT-4
France Dernières Nouvelles,France Actualités
  • 📰 verge
  • ⏱ Reading Time:
  • 19 sec. here
  • 2 min. at publisher
  • 📊 Quality Score:
  • News: 11%
  • Publisher: 67%

A new report details how OpenAI, Google, and Meta have tried to get over the limits of good data available online — for OpenAI, that includes generating YouTube transcripts.

Earlier this week, The Wall Street Journal reported that AI companies were running into a wall when it comes to gathering high-quality training data. Today, The New York Times detailed some of the ways companies have dealt with this. Unsurprisingly, it involves doing things that fall into the hazy gray area of AI copyright law.

Bryant said Google takes “technical and legal measures” to prevent such unauthorized use “when we have a clear legal or technical basis to do so.” Google also gathered transcripts from YouTube, according to the Times’ sources. Bryant said that the company has trained its models “on some YouTube content, in accordance with our agreements with YouTube creators.

Nous avons résumé cette actualité afin que vous puissiez la lire rapidement. Si l'actualité vous intéresse, vous pouvez lire le texte intégral ici. Lire la suite:

verge /  🏆 94. in US

France Dernières Nouvelles, France Actualités

Similar News:Vous pouvez également lire des articles d'actualité similaires à celui-ci que nous avons collectés auprès d'autres sources d'information.

OpenAI and Google reportedly used transcriptions of YouTube videos to train their AI modelsOpenAI and Google reportedly used transcriptions of YouTube videos to train their AI modelsCheyenne is Engadget’s weekend editor and covers a little bit of everything. She’s particularly interested in emerging technology and niche gadgets, climate change, space, privacy, and internet culture. She’ll talk your ear off about Tamagotchis if you get her started.
Lire la suite »

OpenAI announces GPT-4 AI language modelOpenAI announces GPT-4 AI language modelOpenAI has revealed its latest AI model, GPT-4. After a huge response to the launch of ChatGPT last year, expectations are high for the new system that can accept both text and image inputs.
Lire la suite »

OpenAI Reportedly Looking to Release GPT-5 This SummerOpenAI Reportedly Looking to Release GPT-5 This SummerScience and Technology News and Videos
Lire la suite »

OpenAI’s GPT Store Is Triggering Copyright ComplaintsOpenAI’s GPT Store Is Triggering Copyright ComplaintsA publisher says some chatbots in OpenAI's GPT Store were created using its copyrighted textbooks. OpenAI has taken down some of the bots but could face more complaints from rights holders.
Lire la suite »

GPT-4 vs. GPT-3.5: how much difference is there?GPT-4 vs. GPT-3.5: how much difference is there?Does one use ChatGPT-4 or GPT-3.5? Both versions of the OpenAI chatbot are great, but what are the differences? Let’s find out!
Lire la suite »

Unveiling Sam Altman's Insights from Lex Fridman InterviewUnveiling Sam Altman's Insights from Lex Fridman InterviewSam Altman's insights on AI, GPT-5, AGI, OpenAI, Sora and many more
Lire la suite »



Render Time: 2025-02-26 10:46:36