serveurs

Scientists are finding that more can be achieved by removing three-quarters of a neural net.

IST Austria

A major pursuit in the science of artificial intelligence (AI) is the balance between how big a program is and how much data it uses. After all, it costs real money, tens of millions of dollars, to buy Nvidia GPU chips to run AI, and to gather billions of bytes of data to train neural networks -- and how much you need is a question with very practical implications.

Google's DeepMind unit last year codified the exact balance between computing power and training data as a kind of law of AI. That rule of thumb, which has come to be called "The Chinchilla Law", says you can reduce the size of a program to just a quarter of its initial size if you also increase the amount of data it's trained on by four times the initial size.

Also: Can generative AI solve computer science's greatest unsolved problem?

The point of Chinchilla, and it's an important one, is that programs can achieve an optimal result in terms of accuracy while being less gigantic. Build smaller programs, but train for longer on the data, says Chinchilla. Less is more, in other words, in deep-learning AI, for reasons not yet entirely understood.

In a paper published this month, DeepMind and its collaborators build upon that insight by suggesting it's possible to do even better by stripping away whole parts of the neural network, pushing performance further once a neural net has hit a wall.

Also: Generative AI will far surpass what ChatGPT can do. Here's why

According to lead author Elias Frantar of Austria's Institute of Science and Technology, and collaborators at DeepMind, you can get the same results in term of accuracy from a neural network that's half the size of another if you employ a technique called "sparsity".

Sparsity, an obscure element of neural networks that has been studied for years, is a technique that borrows from the actual structure of human neurons. Sparsity refers to turning off some of the connections between neurons. In human brains, these connections are known as synapses.

The vast majority of human synapses don't connect. As scientist Torsten Hoefler and team at the ETH Zurich observed in 2021, "Biological brains, especially the human brain, are hierarchical, sparse, and recurrent structures," adding, "the more neurons a brain has, the sparser it gets."

The thinking goes that if you could approximate that natural phenomenon of the very small number of connections, you could do a lot more with any neural net with a lot less effort -- and a lot less time, money, and energy.

Also: Microsoft, TikTok give generative AI a sort of memory

In an artificial neural network, such as a deep-learning AI model, the equivalent of synaptic connections are "weights" or "parameters". Synapses that don't have connections would be weights that have zero values -- they don't compute anything, so they don't take up any computing energy. AI scientists refer to sparsity, therefore, aszeroing-outthe parameters of a neural net.

In the new DeepMind paper, posted on the arXiv pre-print server, Frantar and team ask, if smaller networks can equal the work of larger networks, as the prior study showed, how much can sparsity help push performance even further by removing some weights?

The researchers discover that if you zero out three-quarters of the parameters of a neural net -- making it more sparse -- it can do the same work as a neural net over two times its size.

As they put it: "The key take-away from these results is that as one trains significantly longer than Chinchilla (dense compute optimal), more and more sparse models start to become optimal in terms of loss for the same number of non-zero parameters." The term "dense compute model" refers to a neural net that has no sparsity, so that all its synapses are operating.

"This is because the gains of further training dense models start to slow down significantly at some point, allowing sparse models to overtake them." In other words, normal, non-sparse models -- dense models -- start to break down where sparse versions take over.

Also:We will see a completely new type of computer, says AI pioneer Geoff Hinton

The practical implication of this research is striking. When a neural network starts to reach its limit in terms of performance, actuallyreducingthe amount of its neural parameters that function -- zeroing them out -- will extend the neural net's performance further as you train the neural net for a longer and longer time.

"Optimal sparsity levels continuously increase with longer training," write Frantar and team. "Sparsity thus provides a means to further improve model performance for a fixed final parameter cost."

For a world worried about the energy cost of increasingly power-hungry neural nets, the good news is that scientists are finding even more can be done with less.

Artificial Intelligence

Generative AI will far surpass what ChatGPT can do. Here's everything on how the tech advancesChatGPT's new web browsing feature is a big disappointment. Use this plugin insteadWhat is Amazon Bedrock? 4 ways it can help businesses use generative AI toolsCan generative AI solve computer science's greatest unsolved problem?

Generative AI will far surpass what ChatGPT can do. Here's everything on how the tech advances
ChatGPT's new web browsing feature is a big disappointment. Use this plugin instead
What is Amazon Bedrock? 4 ways it can help businesses use generative AI tools
Can generative AI solve computer science's greatest unsolved problem?

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

serveurs

Nouvelles chaudes

Top Features of the Huawei S5731-S24T4X: The Ultimate Gigabit Access Switch for Modern Networks

General Power Module Fault Location Procedure (CE8800 & 7800 & 6800 & 5800)

How Do I Split a Stack? How to clear the stacking configuration?

Huawei CloudEngine S5731 Datasheet

Huawei CloudEngine S5731-S24P4X: Powerful Enterprise-Grade Switch Explained

Huawei S5731-S48T4X Review: Powerful Enterprise Switch for High-Speed Networking

Why are network cables limited to 100 meters?

Huawei S5731-S32ST4X: Powerful, Enterprise-Ready Gigabit Switch with Advanced Capabilities

Huawei S5731-H48T4XC Review: High-Performance Switching for Modern IT Infrastructures

Huawei S5731-H48P4XC: Comprehensive Overview

Common display Commands for Huawei Devices

Stacking Card Stacking vs. Service Port Stacking: Application Scenarios for the Two Switch Stacking Methods

Huawei S5731-H24T4XC: High-Performance Intelligent Gigabit Switch

Huawei S5731-S48P4X: High-Performance PoE Switch with Flexible Power and Uplink Options

Huawei S5731 Series: Advanced Networking Solutions for Enterprises

Difference between campus switch and data center switch

Huawei S6730-H28Y4C Campus CloudEngine Switch Datasheet

S6730-H48Y6C: Unleashing Power and Flexibility for Modern Networking

CloudEngine S6730-H Series Switches Datasheet

Huawei CloudEngine Switch S6730-S24X6Q Datasheet

CloudEngine S6700 Series Switches Naming Conventions & Description

Huawei CloudEngine S6730-H24X6C Datasheet

Huawei S6730 Series Switches Datasheet

Huawei CloudEngine Switch S6730-H48X6C Datasheet

Introduction to the Huawei CloudEngine S6730-S Series Switches

Huawei S6730-H48X6CZ-V2: The Ultimate High-Speed Network Switch

Overview of the S6730-H28X6CZ-V2 Switch

Huawei CloudEngine S6730-H24X4Y4C: A High-Performance Enterprise Switch for Modern Networks

Introduction to Huawei CloudEngine S6730-H Series Switches

Comprehensive Guide to the CloudEngine S6730-H24X6C-V2: Features, Specifications, and Applications

Less is a lot more when it comes to AI, says Google's DeepMind

Artificial Intelligence

Tags chauds: Intelligence artificielle Innovation et Innovation

Ordering Guide

Ressources ressources

À propos de nous

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

serveurs

Nouvelles chaudes

Top Features of the Huawei S5731-S24T4X: The Ultimate Gigabit Access Switch for Modern Networks

General Power Module Fault Location Procedure (CE8800 & 7800 & 6800 & 5800)

How Do I Split a Stack? How to clear the stacking configuration?

Huawei CloudEngine S5731 Datasheet

Huawei CloudEngine S5731-S24P4X: Powerful Enterprise-Grade Switch Explained

Huawei S5731-S48T4X Review: Powerful Enterprise Switch for High-Speed Networking

Why are network cables limited to 100 meters?

Huawei S5731-S32ST4X: Powerful, Enterprise-Ready Gigabit Switch with Advanced Capabilities

Huawei S5731-H48T4XC Review: High-Performance Switching for Modern IT Infrastructures

Huawei S5731-H48P4XC: Comprehensive Overview

Common display Commands for Huawei Devices

Stacking Card Stacking vs. Service Port Stacking: Application Scenarios for the Two Switch Stacking Methods

Huawei S5731-H24T4XC: High-Performance Intelligent Gigabit Switch

Huawei S5731-S48P4X: High-Performance PoE Switch with Flexible Power and Uplink Options

Huawei S5731 Series: Advanced Networking Solutions for Enterprises

Difference between campus switch and data center switch

Huawei S6730-H28Y4C Campus CloudEngine Switch Datasheet

S6730-H48Y6C: Unleashing Power and Flexibility for Modern Networking

CloudEngine S6730-H Series Switches Datasheet

Huawei CloudEngine Switch S6730-S24X6Q Datasheet

CloudEngine S6700 Series Switches Naming Conventions & Description

Huawei CloudEngine S6730-H24X6C Datasheet

Huawei S6730 Series Switches Datasheet

Huawei CloudEngine Switch S6730-H48X6C Datasheet

Introduction to the Huawei CloudEngine S6730-S Series Switches

Huawei S6730-H48X6CZ-V2: The Ultimate High-Speed Network Switch

Overview of the S6730-H28X6CZ-V2 Switch

Huawei CloudEngine S6730-H24X4Y4C: A High-Performance Enterprise Switch for Modern Networks

​Introduction to Huawei CloudEngine S6730-H Series Switches

Comprehensive Guide to the CloudEngine S6730-H24X6C-V2: Features, Specifications, and Applications

Less is a lot more when it comes to AI, says Google's DeepMind

Artificial Intelligence

Tags chauds: Intelligence artificielle Innovation et Innovation

Ordering Guide

Ressources ressources

À propos de nous

Introduction to Huawei CloudEngine S6730-H Series Switches