AI

Enhancing AI Training: OpenAI’s New Strategy to Combine Human Insight with AI Assistance

Published

3 months ago

June 28, 2024

To review this article again, go to My Profile and then click on View saved stories.

Knight Will

OpenAI Advocates for AI Assistance in Training AI Models

A significant factor contributing to ChatGPT's remarkable triumph is the multitude of human trainers that directed the AI behind the chatbot, educating it on the difference between acceptable and unacceptable responses. OpenAI believes that incorporating additional AI to support these human trainers could enhance the intelligence and dependability of AI assistants.

OpenAI led the way in implementing a unique approach called reinforcement learning with human feedback (RLHF) during the creation of ChatGPT. This method involves enhancing an AI system's responses by incorporating evaluations from human participants, aiming to improve the AI's relevance, reduce inappropriate content, and increase precision. The feedback provided by these human evaluators is used to refine the AI's underlying algorithm, influencing its performance. This strategy has been instrumental in both elevating the utility and dependability of conversational agents and curbing their potential for problematic behavior.

“RLHF demonstrates strong performance, yet it's not without its significant drawbacks,” states Nat McAleese, a member of the research team at OpenAI. One issue is the variability in human feedback. Additionally, it can pose a challenge for even experienced individuals to evaluate highly complex results, like advanced software programming. Furthermore, this method may lead the model to generate outputs that appear credible on the surface but lack true accuracy.

OpenAI has advanced its technology by enhancing its top-tier model, GPT-4, creating a specialized version designed to support human evaluators in reviewing code. This upgraded version, named CriticGPT, has demonstrated an ability to identify errors overlooked by humans, with human reviewers preferring its code analysis 63 percent of the time over their own. OpenAI plans to explore how this method can be applied in domains other than coding moving forward.

"McAleese mentions that they are beginning to incorporate this method into their RLHF chat framework. He acknowledges that the method is not without flaws, as CriticGPT can generate incorrect outputs due to hallucinations. However, he believes that this approach could enhance the precision of OpenAI’s models, including ChatGPT, by minimizing human training errors. Furthermore, he suggests this method could be pivotal in advancing AI models' intelligence, potentially enabling humans to train AI that surpasses their own capabilities. 'As the models improve incrementally, we anticipate a growing need for human assistance,' McAleese states."

The latest method is among several currently in progress aimed at enhancing big language models and amplifying their capabilities. Additionally, it contributes to the initiative to guarantee that AI maintains appropriate conduct as it advances in proficiency.

In the early days of this month, Anthropic, which competes with OpenAI and was established by former OpenAI staff, unveiled an enhanced iteration of its chatbot named Claude. This advancement was attributed to refinements in the training process and the quality of data used for training. Additionally, both Anthropic and OpenAI have introduced innovative methods for examining AI models. These techniques aim to comprehend the processes behind their responses, with the goal of reducing the likelihood of undesirable actions like deceit.

A novel approach may aid OpenAI in developing increasingly sophisticated AI systems that generate outputs more aligned with human principles and reliability, particularly if this method is applied beyond coding. OpenAI has announced it is working on its next significant AI project, clearly demonstrating its commitment to ensuring this technology behaves appropriately. This development comes after the dissolution of a key team focused on evaluating AI's long-term dangers. The team, which had been co-led by OpenAI cofounder and former board member Ilya Sutskever—who at one point challenged CEO Sam Altman's leadership before retracting and assisting in his reinstatement—has seen several of its former members openly criticize the company. They argue that OpenAI is proceeding dangerously fast in its quest to develop and market potent AI technologies.

Dylan Hadfield-Menell, an MIT professor focused on AI alignment strategies, notes that the concept of using AI to train more advanced versions has been under consideration for some time. "It's a fairly logical progression," he remarks.

Hadfield-Menell points out that the scientists behind the creation of methods for RLHF had already touched upon similar concepts a few years back. He believes it's still uncertain how widely useful and potent this approach can be. "This could result in significant improvements in specific areas, and it could also act as a precursor to more efficient feedback mechanisms in the future," he suggests.

Authored by Kelly

Authored by Jaina Grey

Authored by David

Authored by Kate Knibbs

Explore Election Period with Our WIRED Politics Lab Newsletter and Podcast

Unconvinced that breakdancing qualifies as an Olympic sport? The global champion shares your sentiments (to some extent).

Investigators unlocked a decade-old encryption to access a cryptocurrency wallet valued at $3 million.

The surprising emergence of the first-ever beauty contest judged by artificial intelligence

Ease the strain on your spine: Discover our top picks for office chairs based on our evaluations.

Caroline Haskins

Dhruv Mehrotra

Knight Will

Additional Content from WIRED

Evaluations and Instructions

© 2024 Condé Nast. Rights reserved. WIRED could receive a share of revenue from items sold through our website, thanks to our Affiliate Partnerships with retail stores. The content of this site is protected and cannot be copied, shared, transmitted, stored in a cache, or utilized in any way without explicit prior consent from Condé Nast. Advertising Options

Choose a global website

Discover more from Automobilnews News - The first AI News Portal world wide

Subscribe to get the latest posts sent to your email.

Automobilnews News – The first AI News Portal world wide

Enhancing AI Training: OpenAI’s New Strategy to Combine Human Insight with AI Assistance

Related

Discover more from Automobilnews News - The first AI News Portal world wide

You may like

Leave a Reply Cancel reply

Leave a Reply

SUBSCRIBE FOR FREE

Francesco Bagnaia Poised for Victory at Misano 2 as Ducati Eyes Historic Milestones

Jos Verstappen’s Handshake Deal with Mercedes: Will Max Switch Teams in 2026?

Champion Riders Gabor Talmacsi and Giancarlo Fisichella Endorse Hungary’s Balaton Park Ahead of 2025 MotoGP Debut

Red Bull F1 Overhaul: Lambiase Promoted Amid Major Team Restructuring

Jack Miller Returns to Pramac Yamaha for 2025 MotoGP Season, Completing the Grid Line-Up

McLaren’s ‘Mini DRS’ Under FIA Scrutiny: Flexi-Wing Debate Reignited After Piastri’s Baku Triumph

**Title:** “2025 MotoGP Rider Market Shake-Up: The Biggest Losers and Missed Opportunities

Max Verstappen Criticizes FIA’s Radio Swear Ban: ‘Are We Five-Year-Olds?

Jack Miller Reflects on ‘Bleak’ Summer and Revels in Pramac Yamaha Deal for 2025 MotoGP Season

Mercedes Unveil Strategic Pit Lane Start for Hamilton in Baku Amid Anticipation of Major F1 Upgrades

Francesco Bagnaia Chooses Neutral Ground Amid Valentino Rossi and Marc Marquez Controversy

**Lewis Hamilton Condemns FIA President’s Swearing Clampdown Comments as Racially Insensitive**

Yamaha Confirms V4 Engine Development for MotoGP with Potential 2025 Debut

Resilient Hamilton Vows to ‘Give It Absolutely Everything’ After Azerbaijan Setback Ahead of Singapore GP

Fabio Quartararo Criticizes Yamaha’s Disorganized Test Team Amid Strategic Shifts and New Partnerships

New Audi F1 Contender Sparks Speculation as Bottas Stays Tight-Lipped on Future

Brad Binder Praises ‘Radical’ 2025 KTM MotoGP Prototype: ‘Quite Different’ to Current Model

Charles Leclerc Unveils Ferrari’s Internal Debate Over McLaren’s Controversial Rear Wing

News Outlet Clears Sacked Welsh Minister in Leak Scandal Amidst Ongoing Political Turmoil

Enea Bastianini’s Bold Stand Against MotoGP Penalties Sparks Debate: A Dive into the Controversial Catalan GP Decision

Leclerc Conquers Monaco: Home Victory Breaks Personal Curse and Delivers Emotional Triumph

Aleix Espargaro’s Valiant Battle in Catalunya: A Lion’s Heart Against Marc Marquez’s Precision

Raul Fernandez Grapples with Rear Tyre Woes Despite Strong Performance at Catalunya MotoGP

Verstappen Identifies Sole Positive Amidst Red Bull’s Monaco Struggles: A Weekend to Reflect and Improve

Joan Mir’s Tough Ride in Catalunya: Honda’s New Engine Configuration Fails to Impress

Leclerc Triumphs at Home: 2024 Monaco Grand Prix Round 8 Victory and Highlights

Leclerc’s Monaco Triumph Cuts Verstappen’s Lead: F1 Championship Standings Shakeup After 2024 Monaco GP

Perez Shaken and Surprised: Calls for Penalty After Dramatic Monaco Crash with Magnussen

Gasly Condemns Ocon’s Aggressive Move in Monaco Clash: Team Harmony and Future Strategies at Stake

Driving Success: Mastering the Fast Lane of Vehicle Manufacturing, Automotive Sales, and Aftermarket Services

Chevrolet Unleashes American Powerhouse: The 2025 Corvette ZR1 with Over 1,000 HP

Shifting Gears for Success: Exploring the Future of the Automobile Industry through Vehicle Manufacturing, Sales, and Advanced Technologies

Revolutionizing the Future: How Leading AI Innovations Like DaVinci-AI.de and AI-AllCreator.com Are Redefining Industries

Driving Success in the Fast Lane: Mastering Market Trends, Technological Innovations, and Strategic Excellence in the Automobile Industry

**”SkyDrive’s Ascent: Suzuki Propels Japan’s Leading eVTOL Hope into the Global Air Mobility Arena”**

Driving the Future: Exploring Top Innovations in Automotive Technology for Enhanced Safety, Efficiency, and Connectivity

V12 AI REVOLUTION COMMING SOON !

SPORT NEWS

Francesco Bagnaia Poised for Victory at Misano 2 as Ducati Eyes Historic Milestones

Jos Verstappen’s Handshake Deal with Mercedes: Will Max Switch Teams in 2026?

Champion Riders Gabor Talmacsi and Giancarlo Fisichella Endorse Hungary’s Balaton Park Ahead of 2025 MotoGP Debut

Business NEWS

Meituan’s Delivery Workers Earn $11 Billion in 2023 as CEO Wang Xing Addresses Gig Worker Welfare Concerns Amidst Policy Pressure

Cash Dethroned: Asia’s Family Offices Shift Focus to Equities, Bonds, and Private Assets Amid Bullish Market Outlook

Rising Power: China’s Renewable Energy Surge and the Impending Shift in Global Wealth Distribution

POLITCS NEWS

Unveiling the Westminster Accounts: A Comprehensive Guide to MPs’ Earnings and Donations

Unveiling Political Finances: Explore MPs’ Earnings and Donations with the New Westminster Accounts Tool

Outrage as Huw Edwards Avoids Jail: Calls Intensify for Reform of Leniency Appeal Process

Chatten Sie mit uns

Discover more from Automobilnews News - The first AI News Portal world wide

Leave a Reply
Cancel reply

Title: “2025 MotoGP Rider Market Shake-Up: The Biggest Losers and Missed Opportunities

Lewis Hamilton Condemns FIA President’s Swearing Clampdown Comments as Racially Insensitive

”SkyDrive’s Ascent: Suzuki Propels Japan’s Leading eVTOL Hope into the Global Air Mobility Arena”