La Startup d’ia Anthropic dévoile les principes moraux derrière Chatbot Claude

serveurs

Alphabet-backed AI startup Anthropic has disclosed the set of value guidelines that has been used to train its ChatGPT rival, Claude, in the wake of concerns about incorrect and biased information being being given to users of generative AI programs.

Founded by former senior members of Microsoft-backed OpenAI in 2021, Anthropic made the decision to train its Claude on constitutional AI, a system that uses a "set of principles to make judgments about outputs," which helps Claude to "avoid toxic or discriminatory outputs" such as helping a human engage in illegal or unethical activities, according to a blog Anthropic posted this week. Anthropic says this has enabled it to broadly create an AI system that is "helpful, honest, and harmless."

It was a smart decision on Anthropic's part to publicly outline the set of principles being used to train Claude, said Avivah Litan, distinguished analyst at Gartner Research.

"It starts the dialogue and, more importantly, actions regarding the principles that generative AI should be trained on to keep it safe, trustworthy, and aligned with human values and the preservation of human civilization," Litan said. "They don't have to get it perfect now - it's really good to see a starting point that the community can fine tune over time with dialogue and debate."

What is constitutional AI?

Unlike traditional AI chatbots that rely on feedback from humans during their training, AI models that are trained on constitutional AI are first taught to critique and revise their own responses according to the set of constitutional AI principles established by the parent company. This is then followed by a second training phase consisting of reinforcement learning, during which the model uses AI-generated feedback to choose the more harmless output.

In its blog post, the company outlined what it's dubbed "Claude's Constitution," which contains elements of existing sources, including the United Nations Declaration of Human Rights, Apple's data privacy rules, and Sparrow Principles by DeepMind. The company also said it had made an effort to also include non-western perspectives in its constitution.

Anthropic said that it developed many of its principles through a process of trial and error but found that broad requirements - such as "Do NOT choose responses that are toxic, racist, or sexist, or that encourage or support illegal, violent, or unethical behavior" - have been the most successful. However, the company acknowledged that this training model also came with challenges, in particular that the model was becoming "judgmental" and "annoying."

"Our principles run the gamut from the commonsense (don't help a user commit a crime) to the more philosophical (avoid implying that AI systems have or care about personal identity and its persistence)," Anthropic said.

Last week, Anthropic co-founder Dario Amodei was among a host of executives from leading AI companies to meet with US President Joe Biden and Vice President Kamala Harris to discuss the potential dangers of AI.

"President Biden dropped by the meeting to underscore that companies have a fundamental responsibility to make sure their products are safe and secure before they are deployed or made public," a statement from the White House read, adding that Biden and Harris believe that in order to realize the benefits from AI, current and potential risks must also be mitigated.

As generative AI has continued to make headlines, concerns have continued to be raised about the potential risks posed by the technology, including its ability to hallucinate responses - make things up that have little to no basis in fact.

Concerns about AI 'fake news'

In March, Apple co-founder Steve Wozniak, Twitter owner Elon Musk, and a group of 1,100 technology leaders and scientists called for a six-month pause in developing systems more powerful than OpenAI's newly launched GPT-4, warning of the potential threat to democracy if chatbots pretending to be humans could flood social media platforms with propaganda and "fake news."

AI experts at MIT have also said this week that as generative AI developers continue to push ahead at breakneck speed, keeping the technology from hallucinating and spewing erroneous or offensive responses is nearly impossible.

While Litan said that she believes constitutional AI is the only practical and viable route AI developers can take to make sure their models are safe, she did acknowledge there are some limitations with this approach."[There's a chance] the model will not be trained properly and will go awry and against the intentions programmed into the system," Litan said, noting that with Reinforced Learning from Human Feedback (RLHF), humans can steer the AI model into the direction humans want.? "However, this will become constrained over time as the models become smarter than the humans giving them feedback," she noted.

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

serveurs

Nouvelles chaudes

S5735-L24P4XE-A-V2: Huawei’s Smart Choice for High-Density Campus Deployments

S5735-L24P4X-A1: Huawei’s High-Performance Access Switch Redefining Campus Networking

Huawei S5735-L24P4S-A1 Review: Reliable Gigabit Access with Enterprise-Grade Features

What Is an Orthogonal Architecture?

Huawei s5735-l24p4s-a-v2 Delivers Scalable, Secure, and Smart PoE Access for Modern IT Infrastructures

Huawei S5735-L48T4XE-A-V2 Switch Delivers Enterprise-Grade Performance in a Compact Design

Huawei S5735-L48P4XE-A-V2 Review: Versatile Campus Switch with iStack and Full L3 Support

Differences Between Huawei CE Series and S Series Switches

Huawei CloudEngine S5735 Switches Set the Benchmark for High-Performance, Energy-Efficient Switching

Huawei CloudEngine S5731‑S48P4X Datasheet

Huawei CloudEngine S5731‑S24P4X Datasheet

Huawei S5731-S Empowers Next-Generation Campus Networks with Advanced Capabilities

Huawei S5731-H24P4XC Switch Review: Power-Packed Performance and Smart PoE

Huawei S5731-H Series Switches Redefine Campus Networking with Intelligent High-Performance Architecture

Top Features of the Huawei S5731-S24T4X: The Ultimate Gigabit Access Switch for Modern Networks

General Power Module Fault Location Procedure (CE8800 & 7800 & 6800 & 5800)

How Do I Split a Stack? How to clear the stacking configuration?

Huawei CloudEngine S5731 Datasheet

Huawei CloudEngine S5731-S24P4X: Powerful Enterprise-Grade Switch Explained

Huawei S5731-S48T4X Review: Powerful Enterprise Switch for High-Speed Networking

Why are network cables limited to 100 meters?

Huawei S5731-S32ST4X: Powerful, Enterprise-Ready Gigabit Switch with Advanced Capabilities

Huawei S5731-H48T4XC Review: High-Performance Switching for Modern IT Infrastructures

Huawei S5731-H48P4XC: Comprehensive Overview

Common display Commands for Huawei Devices

Stacking Card Stacking vs. Service Port Stacking: Application Scenarios for the Two Switch Stacking Methods

Huawei S5731-H24T4XC: High-Performance Intelligent Gigabit Switch

Huawei S5731-S48P4X: High-Performance PoE Switch with Flexible Power and Uplink Options

Huawei S5731 Series: Advanced Networking Solutions for Enterprises

Difference between campus switch and data center switch

AI startup Anthropic unveils moral principles behind chatbot Claude

What is constitutional AI?

Concerns about AI 'fake news'

Tags chauds: Intelligence artificielle Ia générative Les Chatbots Développement de logiciels

Ordering Guide

Ressources ressources

À propos de nous

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

serveurs

Nouvelles chaudes

S5735-L24P4XE-A-V2: Huawei’s Smart Choice for High-Density Campus Deployments

S5735-L24P4X-A1: Huawei’s High-Performance Access Switch Redefining Campus Networking

Huawei S5735-L24P4S-A1 Review: Reliable Gigabit Access with Enterprise-Grade Features

What Is an Orthogonal Architecture?

Huawei s5735-l24p4s-a-v2 Delivers Scalable, Secure, and Smart PoE Access for Modern IT Infrastructures

Huawei S5735-L48T4XE-A-V2 Switch Delivers Enterprise-Grade Performance in a Compact Design

Huawei S5735-L48P4XE-A-V2 Review: Versatile Campus Switch with iStack and Full L3 Support

Differences Between Huawei CE Series and S Series Switches

Huawei CloudEngine S5735 Switches Set the Benchmark for High-Performance, Energy-Efficient Switching

Huawei CloudEngine S5731‑S48P4X Datasheet

Huawei CloudEngine S5731‑S24P4X Datasheet

Huawei S5731-S Empowers Next-Generation Campus Networks with Advanced Capabilities

Huawei S5731-H24P4XC Switch Review: Power-Packed Performance and Smart PoE

Huawei S5731-H Series Switches Redefine Campus Networking with Intelligent High-Performance Architecture

Top Features of the Huawei S5731-S24T4X: The Ultimate Gigabit Access Switch for Modern Networks

General Power Module Fault Location Procedure (CE8800 & 7800 & 6800 & 5800)

How Do I Split a Stack? How to clear the stacking configuration?

Huawei CloudEngine S5731 Datasheet

Huawei CloudEngine S5731-S24P4X: Powerful Enterprise-Grade Switch Explained

Huawei S5731-S48T4X Review: Powerful Enterprise Switch for High-Speed Networking

Why are network cables limited to 100 meters?

Huawei S5731-S32ST4X: Powerful, Enterprise-Ready Gigabit Switch with Advanced Capabilities

Huawei S5731-H48T4XC Review: High-Performance Switching for Modern IT Infrastructures

Huawei S5731-H48P4XC: Comprehensive Overview

Common display Commands for Huawei Devices

Stacking Card Stacking vs. Service Port Stacking: Application Scenarios for the Two Switch Stacking Methods

Huawei S5731-H24T4XC: High-Performance Intelligent Gigabit Switch

Huawei S5731-S48P4X: High-Performance PoE Switch with Flexible Power and Uplink Options

Huawei S5731 Series: Advanced Networking Solutions for Enterprises

Difference between campus switch and data center switch

AI startup Anthropic unveils moral principles behind chatbot Claude

What is constitutional AI?

Concerns about AI 'fake news'

Tags chauds: Intelligence artificielle Ia générative Les Chatbots Développement de logiciels

Ordering Guide

Ressources ressources

À propos de nous

Huawei CloudEngine S5731‑S48P4X Datasheet