How Fake Security Reports Are Swamping Open-source Projects, Thanks To Ai

serveurs

AI fakes open-source program security patches and feature requests

You'd think artificial intelligence (AI) is a boon for developers. After all, a recent Google survey found that 75% of programmers rely on AI. On the other hand, almost 40% report having "little or no trust" in AI. Open-source project maintainers -- the people who manage software -- can understand that fact.

Many AI LLMs cannot deliver usable code

First, many AI large language models (LLMs) cannot deliver usable code for even simple projects. Far more troubling, however, is that open-source maintainers are finding that hackers are weaponizing AI to undermine open-source projects' foundations.

Also: Dumping open source for proprietary rarely pays off: Better to stick a fork in it

As Greg Kroah-Hartman, the Linux stable kernel maintainer, observed in early 2024, Common Vulnerabilities and Exposures (CVE), the master list of security holes, are "abused by security developers looking to pad their resumes." They submit many "stupid things." With AI scanning tools, numerous CVEs are being granted for bugs that don't exist. These security holes are rated by their level of dangerousness using the Common Vulnerability Scoring System (CVSS).

Worse still, as Dan Lorenic, CEO of security company Chainguard, observed, the National Vulnerability Database (NVD), which oversees CVEs, has been underfinanced and overwhelmed, so we can "expect a massive backlog of entries and false negatives."

Wasting valuable time on fake security issues

With government employee cuts expected to the NVD's parent organization, this flood of bogus AI-generated security reports making it into the CVE lists will only increase. This, in turn, means programmers, maintainers, and users will all need to waste valuable time on fake security issues.

Some open-source projects, such as Curl, have given up on CVEs entirely. As Daniel Steinberg, leader of Curl, said, "CVSS is dead to us."

Also: Why Mark Zuckerberg wants to redefine open source so badly

He's far from the only one to see this problem.

Seth Larson, Python Software Foundation security developer-in-residence, wrote: "Recently, I've noticed an uptick in extremely low-quality, spammy, and LLM-hallucinated security reports to open-source projects. The issue is that in the age of LLMs, these reports appear at first glance to be potentially legitimate and thus require time to refute." Larson believes these slop reports "should be treated as if they are malicious."

Patches introducing new vulnerabilities or backdoors

Why? Because these patches, while appearing legitimate at first glance, often contain code that is entirely wrong and nonfunctional. In the worst case, these patches will, the Open Source Security Foundation (OpenSSF) predicts, introduce new vulnerabilities or backdoors.

Alongside fake patches and security reports, AI is being employed to generate a deluge of feature requests across various open-source repositories. These requests, while sometimes seeming innovative or helpful, are often impractical, unnecessary, or simply impossible to implement. The sheer volume of these AI-generated requests overwhelms maintainers, making it hard to distinguish genuine user needs from artificial noise.

Also: We have an official open-source AI definition now, but the fight is far from over

Jeff Potliuk, a maintainer for Apache Airflow, an open-source workflow management platform, reported that the Outlier AI company had encouraged its members to post issues to the project "that make no sense and are either copies of other issues or completely useless and make no sense. This takes valuable time of maintainers who have to evaluate and close the issues. My investigation tracked to you as the source of problems -- where your instructional videos are tricking people into creating those issues to -- apparently train your AI."

These AI-driven issues have also been reported in Curl and React. To quote Potliuk: "This is wrong on so many levels. Please STOP. You are giving the community a disservice."

Fake contributions

The mechanics of deception behind these fake contributions are becoming increasingly sophisticated. AI models can now produce code snippets that, while nonfunctional, appear syntactically correct and contextually relevant. In addition, AI generates detailed explanations that mimic the language and style of a genuine contributor. Adding insult to injury, according to OpenSSF, some attackers use AI to create fake online identities, complete with GitHub histories containing thousands of minor but seemingly legitimate contributions.

The consequences of this AI-driven open-source code spam campaign are far-reaching. Besides maintainers wasting time sifting through and debunking fake contributions, this influx of AI-generated spam undermines the trust that forms the bedrock of open-source collaboration.

Stricter guidelines and verification processes

The open-source community is not standing idly by in the face of this threat. Projects are implementing stricter contribution guidelines and verification processes to weed out AI-generated content. In addition, maintainers share experiences and best practices for identifying and dealing with AI-generated code spam.

Also: Red Hat's take on open-source AI: Pragmatism over utopian dreams

As the battle against AI-generated deception in open-source projects continues, the community faces a critical challenge: preserving the collaborative spirit of open-source development while defending against increasingly sophisticated and automated attempts at manipulation.

As open-source programmer Navendu Pottekkat wrote: "Please don't turn this into a 'let's spam open-source projects' fest." Please, please don't. If you value open source, don't play AI games with it.

Open Source

The open-source tools that could disrupt the entire IT incident management market
This Linux distro could let your old laptop 'shine on' after Windows 10's sunset
6 Linux myths, busted
5 lightweight Linux distributions with very low system requirements

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

serveurs

Nouvelles chaudes

Huawei S5731-H24P4XC Switch Review: Power-Packed Performance and Smart PoE

Huawei S5731-H Series Switches Redefine Campus Networking with Intelligent High-Performance Architecture

Top Features of the Huawei S5731-S24T4X: The Ultimate Gigabit Access Switch for Modern Networks

General Power Module Fault Location Procedure (CE8800 & 7800 & 6800 & 5800)

How Do I Split a Stack? How to clear the stacking configuration?

Huawei CloudEngine S5731 Datasheet

Huawei CloudEngine S5731-S24P4X: Powerful Enterprise-Grade Switch Explained

Huawei S5731-S48T4X Review: Powerful Enterprise Switch for High-Speed Networking

Why are network cables limited to 100 meters?

Huawei S5731-S32ST4X: Powerful, Enterprise-Ready Gigabit Switch with Advanced Capabilities

Huawei S5731-H48T4XC Review: High-Performance Switching for Modern IT Infrastructures

Huawei S5731-H48P4XC: Comprehensive Overview

Common display Commands for Huawei Devices

Stacking Card Stacking vs. Service Port Stacking: Application Scenarios for the Two Switch Stacking Methods

Huawei S5731-H24T4XC: High-Performance Intelligent Gigabit Switch

Huawei S5731-S48P4X: High-Performance PoE Switch with Flexible Power and Uplink Options

Huawei S5731 Series: Advanced Networking Solutions for Enterprises

Difference between campus switch and data center switch

Huawei S6730-H28Y4C Campus CloudEngine Switch Datasheet

S6730-H48Y6C: Unleashing Power and Flexibility for Modern Networking

CloudEngine S6730-H Series Switches Datasheet

Huawei CloudEngine Switch S6730-S24X6Q Datasheet

CloudEngine S6700 Series Switches Naming Conventions & Description

Huawei CloudEngine S6730-H24X6C Datasheet

Huawei S6730 Series Switches Datasheet

Huawei CloudEngine Switch S6730-H48X6C Datasheet

Introduction to the Huawei CloudEngine S6730-S Series Switches

Huawei S6730-H48X6CZ-V2: The Ultimate High-Speed Network Switch

Overview of the S6730-H28X6CZ-V2 Switch

Huawei CloudEngine S6730-H24X4Y4C: A High-Performance Enterprise Switch for Modern Networks

How fake security reports are swamping open-source projects, thanks to AI

Many AI LLMs cannot deliver usable code

Wasting valuable time on fake security issues

Patches introducing new vulnerabilities or backdoors

Fake contributions

Stricter guidelines and verification processes

Open Source

Tags chauds: Innovation et Innovation

Ordering Guide

Ressources ressources

À propos de nous