unilad homepage
  • News
    • UK News
    • US News
    • World News
    • Crime
    • Health
    • Money
    • Sport
    • Travel
  • Film and TV
    • Netflix
  • Music
  • Tech
  • Features
  • Celebrity
  • Politics
  • Advertise
  • Terms
  • Privacy & Cookies
  • LADbible Group
  • LADbible
  • SPORTbible
  • GAMINGbible
  • Tyla
  • UNILAD Tech
  • FOODbible
  • License Our Content
  • About Us & Contact
  • Jobs
  • Latest
  • Archive
  • Topics A-Z
  • Authors
Facebook
Instagram
X
Threads
TikTok
YouTube
Submit Your Content
Experts warn we could run out of data to train AI by 2026

Home> Technology> News

Published 19:35 13 Nov 2023 GMT

Experts warn we could run out of data to train AI by 2026

This data loss could alter the trajectory of the AI revolution

Rhianna Benson

Rhianna Benson

google discoverFollow us on Google Discover
Featured Image Credit: Getty Stock Images

Topics: Artificial Intelligence, Science, Technology, Money, Social Media

Rhianna Benson
Rhianna Benson

Rhianna is an Entertainment Journalist at LADbible Group, working across LADbible, UNILAD and Tyla. She has a Masters in News Journalism from the University of Salford and a Masters in Ancient History from the University of Edinburgh. She previously worked as a Celebrity Reporter for OK! and New Magazines, and as a TV Writer for Reach PLC.

X

@rhiannaBjourno

Advert

Advert

Advert

There's no doubt that artificial intelligence is reaching the peak of its popularity with social media users around the world.

I mean, is there anything more fun that sussing what a celebrity couple's future baby will look like? Or hearing how the late Freddie Mercury might have performed Doja Cat's 'Paint the Town Red'?

But according to some scientists, humans might soon run out of the type of data needed to fully train artificial intelligence by the year 2026.

Advert

Losing this data - which fuels powerful AI systems across the globe - could subsequently decrease the growth rate of AI models, particularly large language models.

This loss may even alter the trajectory of the AI revolution.

The need for this data is for training accurate, high-quality AI algorithms - an example being Chat GPT, which was trained using 570 gigabytes of text data (around 300 billion words).

If there's an insufficient amount of data to train these such programs (including DALL-E, Lensa and Midjourney), inaccurate/low-quality outputs could be produced.

Scientists predict we could run out of data needed to train AI models.
Getty/Westend61

The quality of this necessary data is also hugely important, as, though low-quality data (e.g. blurry pictures and social media posts) is easy to source, they aren't sufficient enough to train high-performing AI models.

Also, text taken from social media platforms might be biased or prejudiced, or may include disinformation or illegal content which could, in turn, be replicated.

This explains why high-quality content such as text from books, online articles, scientific papers, Wikipedia, and certain filtered web content is being sought out.

A group of researchers predicted in a paper published last year that we could be set to run out of this important data by the year 2026 if current AI training trends continue.

They also suspected that low-quality language data could run out between 2030-2050.

Some experts say the situation isn't as bad as it seems.
Getty/TEK IMAGE/Science Photo Library

This has left a multitude of computer and data scientists around the world feeling concerned, being that AI is equally predicted to contribute up to $15.7 trillion US dollars to the world's economy by 2030.

Other experts are reassuring tech users that the situation may not be as bad as it seems, being that there are still hundreds of unknowns regarding AI models developing for the future.

They also say there are fews of addressing potential data shortages, including by AI developers improving algorithms to they use data more efficiently.

These scientists say that, in the coming years, it'll be likely that less data and less computational power will be needed to train high-performance models, which will in turn reduce AI's carbon footprint.

A move towards AI creating synthetic data to train systems they'll need, is also being suggested.

Choose your content:

3 hours ago
a day ago
  • Getty Stock
    3 hours ago

    How to get money from $135 million Android settlement as millions of users could be eligible

    Android users all over the country could be owed money after Google's settlement

    Technology
  • Getty Stock Photo
    a day ago

    ChatGPT's unsettling answer when I asked what's the scariest thing about AI

    The chat bot listed six concerns when it comes to the future of AI

    Technology
  • Getty Stock
    a day ago

    Expert shares the three jobs that AI can't replace

    With the world on the brink of a total technological revolution, many jobs will not be safe from the upheaval of artificial intelligence

    Technology
  • Jakub Porzycki/NurPhoto via Getty Images
    a day ago

    iPhone users warned to delete concerning iCloud email that puts them at risk

    Scammers targeting Apple's 1.8 billion users are tricking people with a particularly real-looking email about their iCloud account

    Technology
  • Change your password immediately if AI created it, cybersecurity experts warn
  • Researchers warn AI is manipulating us by using one very human tactic
  • Stephen Hawking had terrifying answer when asked about the future of AI
  • Microsoft study reveals the jobs least likely to be replaced by AI