Connect with us

AI

AI Safety Under the Microscope: Researchers Develop Benchmark for Assessing Model Risks

Published

on

To go back to this article, navigate to My Profile and then select View saved stories.

Experts Have Assessed AI Models for Potential Dangers, Uncovering Vast Discrepancies

Bo Li, a faculty member at the University of Chicago with expertise in challenging and exposing the flaws in AI models, has emerged as a key resource for several advisory companies. These firms are increasingly prioritizing concerns over the legal, ethical, and regulatory risks posed by AI models over their intelligence levels.

Li, along with associates from various academic institutions, Virtue AI – a company initiated by Li, and Lapis Labs, have recently formulated a classification system for AI hazards. Additionally, they've created a standard that measures the extent to which various substantial language models deviate from established rules. Li conveyed to WIRED the necessity for establishing guidelines aimed at ensuring AI safety, both in the context of adhering to regulations and in everyday applications.

The team examined policies on AI governance and standards set by authorities from regions such as the United States, China, and the European Union, along with reviewing the operational policies of 16 leading AI firms globally.

The scientists developed AIR-Bench 2024, a benchmarking tool that employs a multitude of prompts to assess how well-known AI models perform regarding particular risks. For instance, it reveals that Anthropic's Claude 3 Opus is highly effective at declining to create cybersecurity threats, whereas Google's Gemini 1.5 Pro excels in not producing nonconsensual sexual content.

Databricks' DBRX Instruct, a newly developed model, performed poorly in all areas. Upon its launch in March, the company pledged ongoing enhancements to the safety mechanisms of DBRX Instruct.

Anthropic, Google, and Databricks have yet to reply to a request for a statement.

Grasping the landscape of risks, along with the advantages and disadvantages of particular models, could grow in importance for businesses aiming to implement AI in specific markets or for certain applications. For example, a business planning to utilize a Large Language Model (LLM) for customer support might prioritize the model's tendency to generate inappropriate language under provocation over its ability to conceptualize a nuclear device.

Bo highlights that the study uncovers intriguing challenges in the development and oversight of AI. Specifically, the team discovered that corporate guidelines are generally more detailed than governmental regulations, indicating potential areas for stricter regulatory measures.

The study also indicates that there is potential for numerous firms to enhance the security of their models. "When you evaluate certain models based on a company's internal guidelines, they don't always align," Bo notes. "This implies there's significant scope for enhancement."

This week, a pair of researchers from MIT introduced a comprehensive database they've created, which aggregates the potential dangers of AI drawn from 43 distinct AI risk frameworks. According to Neil Thompson, a research scientist at MIT who is part of the initiative, the aim is to provide clarity in the complex and often chaotic domain of AI risks. He notes that numerous organizations are still in the initial phases of integrating AI, highlighting the crucial need for advice on the potential hazards involved.

Peter Slattery, who is spearheading the initiative and is part of MIT’s FutureTech group focusing on advancements in computing, points out that the database reveals a disparity in the attention given to various AI risks. For example, while over 70 percent of the frameworks discussed highlight concerns about privacy and security, only about 40 percent address the issue of misinformation.

As artificial intelligence continues to advance, the strategies used to identify and assess its dangers must also progress. Li emphasizes the need to delve into new challenges, including the compelling nature of AI interactions. Her firm conducted an in-depth study of Meta's most advanced Llama 3.1 model, revealing that despite its enhanced functionality, its safety has not seen comparable advancements, indicating a wider issue. "There hasn't been a noticeable enhancement in safety," Li observes.

Discover More…

Explore the World of Politics: Subscribe to our newsletter and tune into our podcast.

The outcomes of distributing no-strings-attached cash to individuals

Weight loss isn't guaranteed for all Ozempic users

The Pentagon plans to allocate $141 billion towards the development of an apocalypse device.

Gathering Announcement: Be part of the Energy Tech Summit happening on October 10th in Berlin.

Additional Content from WIRED

Critiques and Manuals

© 2024 Condé Nast. All rights reserved. Purchases made through our site may generate revenue for WIRED as part of our Affiliate Partnerships with retail partners. Content from this site is prohibited from being copied, shared, distributed, or used in any form without explicit written consent from Condé Nast. Ad Choices

Choose a global website


Discover more from Automobilnews News - The first AI News Portal world wide

Subscribe to get the latest posts sent to your email.

Continue Reading
Click to comment

Leave a Reply

Your email address will not be published. Required fields are marked *

SUBSCRIBE FOR FREE

Advertisement
Automakers & Suppliers1 hour ago

Unveiling Ferrari’s Latest Supercar Innovations: A Deep Dive into Maranello’s Masterpieces and Cutting-Edge Technologies

Sports2 hours ago

Nigel Mansell Criticizes Ferrari’s “Short-Sighted” Decision on Adrian Newey, Predicts Bright Future for Aston Martin

AI2 hours ago

Revealing the AI Gap: How U.S. Teens Outpace Their Parents in Generative AI Use and Understanding

Sports2 hours ago

Peter Windsor Dismisses Russell’s Pirelli Complaints as “Nonsense,” Questions Mercedes Driver’s Approach Post-Azerbaijan GP

AI3 hours ago

Revolutionizing Creativity: YouTube to Unleash Generative AI Video Creation with Veo Model Integration

Sports3 hours ago

Wolff Identifies Tyre Temperature Control as Mercedes’ Key Challenge at Singapore Grand Prix

AI3 hours ago

SocialAI: Navigating the Echo Chamber of AI-Generated Companions

AI3 hours ago

Into the AI Abyss: Navigating the Uncanny World of SocialAI

Sports3 hours ago

Nigel Mansell Weighs in on McLaren’s Team Strategy: Urges Lando Norris to “Step Up” Amid Title Race

AI4 hours ago

Lionsgate and AI Firm Runway Forge Groundbreaking Partnership: A New Era for Film Production and Copyright Concerns

Cars & Concepts4 hours ago

Renault Master H2 Tech: Der Wasserstoff-Revolutionär mit 700 km Reichweite stellt sich in Hannover vor

AI4 hours ago

UN Calls for Global AI Oversight with Urgency Matching Climate Change Initiatives

Cars & Concepts4 hours ago

AEC Erschließt Europäischen Markt mit GMC Yukon und Sierra – Luxuriöse US-Größen zu Stolzen Preisen

AI5 hours ago

Adult Industry Advocates Seek Inclusion in AI Regulation Talks, Highlighting Oversight Risks

Cars & Concepts5 hours ago

Alfa Romeo Junior (2024) Debütiert in Deutschland: Preise und Details zu Hybrid- und Elektromodellen

Politics5 hours ago

Unveiling the Westminster Accounts: A Comprehensive Guide to MPs’ Earnings and Donations

Politics5 hours ago

Unveiling Political Finances: Explore MPs’ Earnings and Donations with the New Westminster Accounts Tool

Business5 hours ago

Meituan’s Delivery Workers Earn $11 Billion in 2023 as CEO Wang Xing Addresses Gig Worker Welfare Concerns Amidst Policy Pressure

Politics2 months ago

News Outlet Clears Sacked Welsh Minister in Leak Scandal Amidst Ongoing Political Turmoil

Moto GP4 months ago

Enea Bastianini’s Bold Stand Against MotoGP Penalties Sparks Debate: A Dive into the Controversial Catalan GP Decision

Sports4 months ago

Leclerc Conquers Monaco: Home Victory Breaks Personal Curse and Delivers Emotional Triumph

Moto GP4 months ago

Aleix Espargaro’s Valiant Battle in Catalunya: A Lion’s Heart Against Marc Marquez’s Precision

Moto GP4 months ago

Raul Fernandez Grapples with Rear Tyre Woes Despite Strong Performance at Catalunya MotoGP

Sports4 months ago

Verstappen Identifies Sole Positive Amidst Red Bull’s Monaco Struggles: A Weekend to Reflect and Improve

Moto GP4 months ago

Joan Mir’s Tough Ride in Catalunya: Honda’s New Engine Configuration Fails to Impress

Sports4 months ago

Leclerc Triumphs at Home: 2024 Monaco Grand Prix Round 8 Victory and Highlights

Sports4 months ago

Leclerc’s Monaco Triumph Cuts Verstappen’s Lead: F1 Championship Standings Shakeup After 2024 Monaco GP

Sports4 months ago

Perez Shaken and Surprised: Calls for Penalty After Dramatic Monaco Crash with Magnussen

Sports4 months ago

Gasly Condemns Ocon’s Aggressive Move in Monaco Clash: Team Harmony and Future Strategies at Stake

Business4 months ago

Driving Success: Mastering the Fast Lane of Vehicle Manufacturing, Automotive Sales, and Aftermarket Services

Cars & Concepts2 months ago

Chevrolet Unleashes American Powerhouse: The 2025 Corvette ZR1 with Over 1,000 HP

Business4 months ago

Shifting Gears for Success: Exploring the Future of the Automobile Industry through Vehicle Manufacturing, Sales, and Advanced Technologies

AI4 months ago

Revolutionizing the Future: How Leading AI Innovations Like DaVinci-AI.de and AI-AllCreator.com Are Redefining Industries

Business4 months ago

Driving Success in the Fast Lane: Mastering Market Trends, Technological Innovations, and Strategic Excellence in the Automobile Industry

Mobility Report4 months ago

**”SkyDrive’s Ascent: Suzuki Propels Japan’s Leading eVTOL Hope into the Global Air Mobility Arena”**

Tech4 months ago

Driving the Future: Exploring Top Innovations in Automotive Technology for Enhanced Safety, Efficiency, and Connectivity

V12 AI REVOLUTION COMMING SOON !

Get ready for a groundbreaking shift in the world of artificial intelligence as the V12 AI Revolution is on the horizon

SPORT NEWS

Business NEWS

Advertisement

POLITCS NEWS

Chatten Sie mit uns

Hallo! Wie kann ich Ihnen helfen?

Discover more from Automobilnews News - The first AI News Portal world wide

Subscribe now to keep reading and get access to the full archive.

Continue reading

×