• Home
  • News
  • Apps
  • Guides
  • Tech
  • Gaming
  • Apple
  • Mobile
  • Windows
  • Hardware

Subscribe to Updates

Get the latest creative news from FooBar about art, design and business.

What's Hot

Kesari Chapter 2 Now Streaming on JioHotstar: Every little thing You Have to Know About Akshay Kumar Starrer Film

Killed by Google: Assist for Android On the spot Apps to Reportedly Be Dropped Later This 12 months

Meta AI Discovery Feed Is Reportedly Stuffed With Customers’ Seemingly Personal Chats

Facebook Twitter Instagram
Tech Crowder
  • Home
  • News
  • Apps
  • Guides
  • Tech
  • Gaming
  • Apple
  • Mobile
  • Windows
  • Hardware
Facebook Twitter Instagram
Tech Crowder
Home»Apps»Apple, Anthropic and Different AI Companies Have Reportedly Skilled AI Fashions on 1000’s of YouTube Movies
Apps

Apple, Anthropic and Different AI Companies Have Reportedly Skilled AI Fashions on 1000’s of YouTube Movies

Rudra SharmaBy Rudra SharmaUpdated:No Comments3 Mins Read
Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
Apple, Anthropic and Different AI Companies Have Reportedly Skilled AI Fashions on 1000’s of YouTube Movies
Share
Facebook Twitter LinkedIn Pinterest Telegram Email

Apple, Anthropic, and different main synthetic intelligence (AI) corporations have reportedly skilled AI fashions on knowledge from a whole bunch of 1000’s of YouTube movies. A brand new report claims that a number of AI corporations used a publicly accessible dataset known as Pile which contained the plain textual content of movies’ subtitles with none video imagery. The information was collected from widespread YouTube creators similar to MrBeast, Marques Brownlee, and PewDiePie in addition to Indian YouTube creators similar to CarryMinati, BB ki Vines, and Ashish Chanchlani.

A number of AI Fashions Reportedly Skilled on YouTube Movies

Proof Information performed an investigation to search out that subtitles knowledge from as many as 1,73,536 YouTube movies have been taken from greater than 48,000 channels. As per the report, EleutherAI, a non-profit AI analysis lab, curated this dataset. Later, it was utilized by corporations similar to Apple, Anthropic, Nvidia, Salesforce, and extra. Notably, the AI lab revealed a analysis paper highlighting the main points of the dataset.

EleutherAI created an information repository of 800GB dubbed Pile and made it publicly accessible for individuals who needed to coach AI fashions however couldn’t afford massive datasets. Nearly all of the dataset was taken from publicly accessible sources similar to English Wikipedia, e-books, and extra. Nevertheless, it additionally contained the subtitles from all of the movies compiled in a dataset known as YouTube Subtitles.

The report claimed that the Pile was used to coach Apple’s OpenELM AI mannequin, on the premise of the analysis paper’s description. Salesforce, Nvidia, and Anthropic’s AI fashions’ analysis papers additionally reportedly point out the utilization of the dataset.

Anthropic spokesperson Jennifer Martinez advised the publication in a press release, “The Pile features a very small subset of YouTube subtitles. YouTube’s phrases cowl direct use of its platform, which is distinct from use of the Pile dataset. On the purpose about potential violations of YouTube’s phrases of service, we would need to refer you to the Pile authors.”

Notably, YouTube’s phrases of service prohibit anybody from accessing the movies on the platform utilizing automated means similar to robots, botnets or scrapers. YouTube Subtitles will fall beneath the scraping class. A Google spokesperson advised Proof Information in an e mail response that the tech big has taken “motion over time to stop abusive, unauthorised scraping.” Nevertheless, no feedback have been made about AI corporations’ utilization of the info.

In a publish on X (previously referred to as Twitter), Marques Brownlee known as out Apple for sourcing knowledge from corporations that included his movies’ transcripts, however he additionally highlighted that it was not the iPhone maker’s fault since they didn’t acquire the info.

Apple has sourced knowledge for his or her AI from a number of corporations

Certainly one of them scraped tons of knowledge/transcripts from YouTube movies, together with mine

Apple technically avoids “fault” right here as a result of they don’t seem to be those scraping

However that is going to be an evolving drawback for a very long time https://t.co/U93riaeSlY

— Marques Brownlee (@MKBHD) July 16, 2024

Whereas this dataset was collected and distributed publicly, there could possibly be different situations of knowledge scraping on platforms similar to YouTube. With AI corporations scrambling to search out extra knowledge to coach their massive language fashions (LLMs), knowledge procurement would possibly proceed to enter related legally gray areas.

Advertisement
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Rudra Sharma
  • Website

Techcrowder is an imprint of Authority Media dedicated to consumer electronics and digital services. Beyond our expertise in mobile tech, our coverage has expanded to include everything from robot vacuums to streaming services to innovations in artificial intelligence.

Related Posts

Killed by Google: Assist for Android On the spot Apps to Reportedly Be Dropped Later This 12 months

Meta AI Discovery Feed Is Reportedly Stuffed With Customers’ Seemingly Personal Chats

Amazon Prime Video Now Exhibits Twice As A lot Adverts As Earlier than: Report

Zoom Expands Cellphone Service to Six Telecom Circles, Zoom Contact Centre Launched in India

What’s Liquid Glass Interface, Apple’s New Common Design Language for iPhone, iPad, Mac, and Different Gadgets

Cybersecurity Researchers Discover 20 Crypto-Phishing Apps on Google Play Retailer: Test Record

Leave A Reply Cancel Reply

Advertisement
Recent Posts
  • Kesari Chapter 2 Now Streaming on JioHotstar: Every little thing You Have to Know About Akshay Kumar Starrer Film
  • Killed by Google: Assist for Android On the spot Apps to Reportedly Be Dropped Later This 12 months
  • Meta AI Discovery Feed Is Reportedly Stuffed With Customers’ Seemingly Personal Chats
  • OTT Releases of the Week (June 9 – June 15): Rana Naidu Season 2, The Traitors, Alappuzah Gymkhana, and Extra
  • PlayStation Plus June 2025 line-up revealed: FBC: Firebreak, Battlefield 2042, and extra be part of {the catalogue}
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo
Don't Miss
Entertainment

Kesari Chapter 2 Now Streaming on JioHotstar: Every little thing You Have to Know About Akshay Kumar Starrer Film

Kesari Chapter 2, a film by Dharma Productions and directed by Karan Singh Tyagi, is…

Killed by Google: Assist for Android On the spot Apps to Reportedly Be Dropped Later This 12 months

Meta AI Discovery Feed Is Reportedly Stuffed With Customers’ Seemingly Personal Chats

OTT Releases of the Week (June 9 – June 15): Rana Naidu Season 2, The Traitors, Alappuzah Gymkhana, and Extra

Advertisement
Latest Posts

Kesari Chapter 2 Now Streaming on JioHotstar: Every little thing You Have to Know About Akshay Kumar Starrer Film

Killed by Google: Assist for Android On the spot Apps to Reportedly Be Dropped Later This 12 months

Meta AI Discovery Feed Is Reportedly Stuffed With Customers’ Seemingly Personal Chats

OTT Releases of the Week (June 9 – June 15): Rana Naidu Season 2, The Traitors, Alappuzah Gymkhana, and Extra

PlayStation Plus June 2025 line-up revealed: FBC: Firebreak, Battlefield 2042, and extra be part of {the catalogue}

The Summer season I Turned Fairly Season 3 OTT Launch Date: When and The place to Watch Last Season On-line?

Amazon Prime Video Now Exhibits Twice As A lot Adverts As Earlier than: Report

Zoom Expands Cellphone Service to Six Telecom Circles, Zoom Contact Centre Launched in India

Categories
  • Apps
  • Entertainment
  • Gadgets
  • Gaming
  • Latest Posts

Kesari Chapter 2 Now Streaming on JioHotstar: Every little thing You Have to Know About Akshay Kumar Starrer Film

Killed by Google: Assist for Android On the spot Apps to Reportedly Be Dropped Later This 12 months

Meta AI Discovery Feed Is Reportedly Stuffed With Customers’ Seemingly Personal Chats

OTT Releases of the Week (June 9 – June 15): Rana Naidu Season 2, The Traitors, Alappuzah Gymkhana, and Extra

Recent Posts
  • Kesari Chapter 2 Now Streaming on JioHotstar: Every little thing You Have to Know About Akshay Kumar Starrer Film
  • Killed by Google: Assist for Android On the spot Apps to Reportedly Be Dropped Later This 12 months
  • Meta AI Discovery Feed Is Reportedly Stuffed With Customers’ Seemingly Personal Chats
  • OTT Releases of the Week (June 9 – June 15): Rana Naidu Season 2, The Traitors, Alappuzah Gymkhana, and Extra
  • PlayStation Plus June 2025 line-up revealed: FBC: Firebreak, Battlefield 2042, and extra be part of {the catalogue}
Pages
  • About Us
  • Contact Us
  • Disclaimer
  • Homepage
  • Privacy Policy
© 2022 Techcrowder.com
  • About Us
  • Contact Us
  • Disclaimer
  • Privacy Policy

Type above and press Enter to search. Press Esc to cancel.

Like Us
Follow Us
Subscribe Us
Follow Us