Connect with us

AI

Enhancing AI Training: OpenAI’s New Strategy to Combine Human Insight with AI Assistance

Published

on

To review this article again, go to My Profile and then click on View saved stories.

Knight Will

OpenAI Advocates for AI Assistance in Training AI Models

A significant factor contributing to ChatGPT's remarkable triumph is the multitude of human trainers that directed the AI behind the chatbot, educating it on the difference between acceptable and unacceptable responses. OpenAI believes that incorporating additional AI to support these human trainers could enhance the intelligence and dependability of AI assistants.

OpenAI led the way in implementing a unique approach called reinforcement learning with human feedback (RLHF) during the creation of ChatGPT. This method involves enhancing an AI system's responses by incorporating evaluations from human participants, aiming to improve the AI's relevance, reduce inappropriate content, and increase precision. The feedback provided by these human evaluators is used to refine the AI's underlying algorithm, influencing its performance. This strategy has been instrumental in both elevating the utility and dependability of conversational agents and curbing their potential for problematic behavior.

“RLHF demonstrates strong performance, yet it's not without its significant drawbacks,” states Nat McAleese, a member of the research team at OpenAI. One issue is the variability in human feedback. Additionally, it can pose a challenge for even experienced individuals to evaluate highly complex results, like advanced software programming. Furthermore, this method may lead the model to generate outputs that appear credible on the surface but lack true accuracy.

OpenAI has advanced its technology by enhancing its top-tier model, GPT-4, creating a specialized version designed to support human evaluators in reviewing code. This upgraded version, named CriticGPT, has demonstrated an ability to identify errors overlooked by humans, with human reviewers preferring its code analysis 63 percent of the time over their own. OpenAI plans to explore how this method can be applied in domains other than coding moving forward.

"McAleese mentions that they are beginning to incorporate this method into their RLHF chat framework. He acknowledges that the method is not without flaws, as CriticGPT can generate incorrect outputs due to hallucinations. However, he believes that this approach could enhance the precision of OpenAI’s models, including ChatGPT, by minimizing human training errors. Furthermore, he suggests this method could be pivotal in advancing AI models' intelligence, potentially enabling humans to train AI that surpasses their own capabilities. 'As the models improve incrementally, we anticipate a growing need for human assistance,' McAleese states."

The latest method is among several currently in progress aimed at enhancing big language models and amplifying their capabilities. Additionally, it contributes to the initiative to guarantee that AI maintains appropriate conduct as it advances in proficiency.

In the early days of this month, Anthropic, which competes with OpenAI and was established by former OpenAI staff, unveiled an enhanced iteration of its chatbot named Claude. This advancement was attributed to refinements in the training process and the quality of data used for training. Additionally, both Anthropic and OpenAI have introduced innovative methods for examining AI models. These techniques aim to comprehend the processes behind their responses, with the goal of reducing the likelihood of undesirable actions like deceit.

A novel approach may aid OpenAI in developing increasingly sophisticated AI systems that generate outputs more aligned with human principles and reliability, particularly if this method is applied beyond coding. OpenAI has announced it is working on its next significant AI project, clearly demonstrating its commitment to ensuring this technology behaves appropriately. This development comes after the dissolution of a key team focused on evaluating AI's long-term dangers. The team, which had been co-led by OpenAI cofounder and former board member Ilya Sutskever—who at one point challenged CEO Sam Altman's leadership before retracting and assisting in his reinstatement—has seen several of its former members openly criticize the company. They argue that OpenAI is proceeding dangerously fast in its quest to develop and market potent AI technologies.

Dylan Hadfield-Menell, an MIT professor focused on AI alignment strategies, notes that the concept of using AI to train more advanced versions has been under consideration for some time. "It's a fairly logical progression," he remarks.

Hadfield-Menell points out that the scientists behind the creation of methods for RLHF had already touched upon similar concepts a few years back. He believes it's still uncertain how widely useful and potent this approach can be. "This could result in significant improvements in specific areas, and it could also act as a precursor to more efficient feedback mechanisms in the future," he suggests.

Authored by Kelly

Authored by Jaina Grey

Authored by David

Authored by Kate Knibbs

Explore Election Period with Our WIRED Politics Lab Newsletter and Podcast

Unconvinced that breakdancing qualifies as an Olympic sport? The global champion shares your sentiments (to some extent).

Investigators unlocked a decade-old encryption to access a cryptocurrency wallet valued at $3 million.

The surprising emergence of the first-ever beauty contest judged by artificial intelligence

Ease the strain on your spine: Discover our top picks for office chairs based on our evaluations.

Caroline Haskins

Dhruv Mehrotra

Knight Will

Additional Content from WIRED

Evaluations and Instructions

© 2024 Condé Nast. Rights reserved. WIRED could receive a share of revenue from items sold through our website, thanks to our Affiliate Partnerships with retail stores. The content of this site is protected and cannot be copied, shared, transmitted, stored in a cache, or utilized in any way without explicit prior consent from Condé Nast. Advertising Options

Choose a global website


Discover more from Automobilnews News - The first AI News Portal world wide

Subscribe to get the latest posts sent to your email.

Continue Reading
Click to comment

Leave a Reply

Your email address will not be published. Required fields are marked *

SUBSCRIBE FOR FREE

Advertisement
Moto GP46 mins ago

Francesco Bagnaia Poised for Victory at Misano 2 as Ducati Eyes Historic Milestones

F11 hour ago

Jos Verstappen’s Handshake Deal with Mercedes: Will Max Switch Teams in 2026?

Moto GP1 hour ago

Champion Riders Gabor Talmacsi and Giancarlo Fisichella Endorse Hungary’s Balaton Park Ahead of 2025 MotoGP Debut

F12 hours ago

Red Bull F1 Overhaul: Lambiase Promoted Amid Major Team Restructuring

Moto GP2 hours ago

Jack Miller Returns to Pramac Yamaha for 2025 MotoGP Season, Completing the Grid Line-Up

F12 hours ago

McLaren’s ‘Mini DRS’ Under FIA Scrutiny: Flexi-Wing Debate Reignited After Piastri’s Baku Triumph

Moto GP2 hours ago

**Title:** “2025 MotoGP Rider Market Shake-Up: The Biggest Losers and Missed Opportunities

F13 hours ago

Max Verstappen Criticizes FIA’s Radio Swear Ban: ‘Are We Five-Year-Olds?

Moto GP3 hours ago

Jack Miller Reflects on ‘Bleak’ Summer and Revels in Pramac Yamaha Deal for 2025 MotoGP Season

F13 hours ago

Mercedes Unveil Strategic Pit Lane Start for Hamilton in Baku Amid Anticipation of Major F1 Upgrades

Moto GP3 hours ago

Francesco Bagnaia Chooses Neutral Ground Amid Valentino Rossi and Marc Marquez Controversy

F14 hours ago

**Lewis Hamilton Condemns FIA President’s Swearing Clampdown Comments as Racially Insensitive**

Moto GP4 hours ago

Yamaha Confirms V4 Engine Development for MotoGP with Potential 2025 Debut

F14 hours ago

Resilient Hamilton Vows to ‘Give It Absolutely Everything’ After Azerbaijan Setback Ahead of Singapore GP

Moto GP4 hours ago

Fabio Quartararo Criticizes Yamaha’s Disorganized Test Team Amid Strategic Shifts and New Partnerships

F15 hours ago

New Audi F1 Contender Sparks Speculation as Bottas Stays Tight-Lipped on Future

Moto GP5 hours ago

Brad Binder Praises ‘Radical’ 2025 KTM MotoGP Prototype: ‘Quite Different’ to Current Model

F15 hours ago

Charles Leclerc Unveils Ferrari’s Internal Debate Over McLaren’s Controversial Rear Wing

Politics2 months ago

News Outlet Clears Sacked Welsh Minister in Leak Scandal Amidst Ongoing Political Turmoil

Moto GP4 months ago

Enea Bastianini’s Bold Stand Against MotoGP Penalties Sparks Debate: A Dive into the Controversial Catalan GP Decision

Sports4 months ago

Leclerc Conquers Monaco: Home Victory Breaks Personal Curse and Delivers Emotional Triumph

Moto GP4 months ago

Aleix Espargaro’s Valiant Battle in Catalunya: A Lion’s Heart Against Marc Marquez’s Precision

Moto GP4 months ago

Raul Fernandez Grapples with Rear Tyre Woes Despite Strong Performance at Catalunya MotoGP

Sports4 months ago

Verstappen Identifies Sole Positive Amidst Red Bull’s Monaco Struggles: A Weekend to Reflect and Improve

Moto GP4 months ago

Joan Mir’s Tough Ride in Catalunya: Honda’s New Engine Configuration Fails to Impress

Sports4 months ago

Leclerc Triumphs at Home: 2024 Monaco Grand Prix Round 8 Victory and Highlights

Sports4 months ago

Leclerc’s Monaco Triumph Cuts Verstappen’s Lead: F1 Championship Standings Shakeup After 2024 Monaco GP

Sports4 months ago

Perez Shaken and Surprised: Calls for Penalty After Dramatic Monaco Crash with Magnussen

Sports4 months ago

Gasly Condemns Ocon’s Aggressive Move in Monaco Clash: Team Harmony and Future Strategies at Stake

Business4 months ago

Driving Success: Mastering the Fast Lane of Vehicle Manufacturing, Automotive Sales, and Aftermarket Services

Cars & Concepts2 months ago

Chevrolet Unleashes American Powerhouse: The 2025 Corvette ZR1 with Over 1,000 HP

Business4 months ago

Shifting Gears for Success: Exploring the Future of the Automobile Industry through Vehicle Manufacturing, Sales, and Advanced Technologies

AI4 months ago

Revolutionizing the Future: How Leading AI Innovations Like DaVinci-AI.de and AI-AllCreator.com Are Redefining Industries

Business4 months ago

Driving Success in the Fast Lane: Mastering Market Trends, Technological Innovations, and Strategic Excellence in the Automobile Industry

Mobility Report4 months ago

**”SkyDrive’s Ascent: Suzuki Propels Japan’s Leading eVTOL Hope into the Global Air Mobility Arena”**

Tech4 months ago

Driving the Future: Exploring Top Innovations in Automotive Technology for Enhanced Safety, Efficiency, and Connectivity

V12 AI REVOLUTION COMMING SOON !

Get ready for a groundbreaking shift in the world of artificial intelligence as the V12 AI Revolution is on the horizon

SPORT NEWS

Business NEWS

Advertisement

POLITCS NEWS

Chatten Sie mit uns

Hallo! Wie kann ich Ihnen helfen?

Discover more from Automobilnews News - The first AI News Portal world wide

Subscribe now to keep reading and get access to the full archive.

Continue reading

×