Close Menu
    Facebook X (Twitter) Instagram
    TRENDING :
    • Alphabet’s Q1 profit beats expectations, with Google’s big AI bets paying off
    • This common travel habit is now banned on American Airlines flights
    • Market Talk – April 29, 2026
    • Uber just expanded into hotels, AI, and ‘room service’ and it’s moving fast
    • Social media’s big tobacco moment is just a first step
    • Ghirardelli Chocolate products recalled over Salmonella fears. Avoid this list of 13 beverage mixes
    • Google, TikTok and Meta could be taxed by Australia to fund its newsrooms
    • MacKenzie Scott says we underestimate the impact of small acts of kindness. Science agrees
    Compatriot Chronicle
    • Home
    • US Politics
    • World Politics
    • Economy
    • Business
    • Headline News
    Compatriot Chronicle
    Home»Business»OpenAI’s GPT-5.3-Codex thinks deeper and wider about coding work
    Business

    OpenAI’s GPT-5.3-Codex thinks deeper and wider about coding work

    February 6, 20263 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email Copy Link
    Follow Us
    Google News Flipboard
    Share
    Facebook Twitter LinkedIn Pinterest Email

    On Thursday, OpenAI released GPT-5.3-Codex, a new model that extends its Codex coding agent beyond writing and reviewing code to performing a much wider range of work tasks. The release comes as competition continues to heat up among AI companies vying for market share in the AI-powered coding tools space.

    OpenAI says GPT-5.3 combines the coding performance of GPT-5.2-Codex with the reasoning and professional-knowledge capabilities of GPT-5.2, while running 25% faster. This allows GPT-5.3-Codex to handle long-running tasks that involve research, tool use such as web search or database calls, and complex execution and planning across both general work tasks and software development.

    Codex has reached over 1 million developers, OpenAI claims. And while Anthropic’s Claude Code has also seen rapid adoption, head-to-head data comparing the two tools remains scarce. SemiAnalysis reports that 4% of GitHub public commits, or new code uploaded to repositories, are currently being authored by Claude Code, and projects that figure could reach 20% or more by the end of 2026.

    Benchmark one-upsmanship

    OpenAI says GPT-5.3-Codex now has the best score of any model on SWE-Bench Pro, which evaluates real-world software engineering across four programming languages. The same is true for Terminal-Bench 2.0, which measures the terminal skills coding agents need.

    More significantly, the new model is capable of taking into account larger bodies of information while working on a task, as well as reasoning about those tasks for longer periods without human intervention. In testing, OpenAI says it observed GPT-5.3-Codex autonomously iterating on game development over millions of tokens using generic prompts like “fix the bug” or “improve the game.”

    Rival companies are making similar claims. Anthropic says its new Claude Opus 4.6 model, when powering Claude Code, can also comprehend larger code bases and make more thoughtful decisions about how to add new code. In a Thursday blog post, the company said Opus 4.6 achieved top scores on several industry benchmarks, including Humanity’s Last Exam, which measures complex multidisciplinary reasoning, GDPval-AA, which focuses on economically valuable knowledge work, and BrowseComp, which tests hard-to-find information search.

    Beyond coding to knowledge work

    OpenAI says GPT-5.3-Codex is built to support the full software lifecycle, including debugging, deploying, and monitoring code, as well as writing product requirement documents and conducting research. The same agentic capabilities can apply to tasks well outside software development, the company says, extending to work like creating slide decks and analyzing data in spreadsheets. (Anthropic has taken Claude Code in a similar direction, positioning it to support a broader pool of information workers across a wider range of business tasks.)

    On GDPval, an OpenAI evaluation measuring performance on well-specified knowledge-work tasks across 44 occupations, GPT-5.3-Codex matches GPT-5.2 while adding stronger coding capabilities. On OSWorld-Verified, which tests computer use in a visual desktop environment, GPT-5.3-Codex achieved 64.7% accuracy compared to 38.2% for its predecessor.

    GPT-5.3-Codex is the first model OpenAI classifies as “High capability” for cybersecurity-related tasks under its Preparedness Framework, and the first the company has directly trained to identify software vulnerabilities. OpenAI is committing $10 million in API credits to accelerate cyber defense, particularly for open source software and critical infrastructure systems.

    ChatGPT subscribers can use the GPT-5.3-Codex model as the brain for Codex while using the coding tool via the Codex app, the IDE (Integrated Development Environment) interface, or within the command line interface of their computer.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Related Posts

    Alphabet’s Q1 profit beats expectations, with Google’s big AI bets paying off

    April 29, 2026

    This common travel habit is now banned on American Airlines flights

    April 29, 2026

    Uber just expanded into hotels, AI, and ‘room service’ and it’s moving fast

    April 29, 2026
    Top News

    Questions Small Businesses Need to Ask of Their AI Vendor

    By Staff WriterNovember 2, 2025

    Not all pieces of small business AI are created equal. Some are packaged to operate…

    How Pana Food Truck Started Selling Arepas

    September 18, 2025

    Trump cancels meeting with Schumer and Jeffries as a government shutdown looms

    September 24, 2025

    How to speak with authority

    December 7, 2025
    Top Trending

    Alphabet’s Q1 profit beats expectations, with Google’s big AI bets paying off

    By Staff WriterApril 29, 2026

    Google’s transition into the era of artificial intelligence continued to pay off for its…

    This common travel habit is now banned on American Airlines flights

    By Staff WriterApril 29, 2026

    Passengers flying with low battery on their phones might be out of…

    Market Talk – April 29, 2026

    By Staff WriterApril 29, 2026

    ASIA: The major Asian stock markets had a mixed day today: •…

    Categories
    • Business
    • Economy
    • Headline News
    • Top News
    • US Politics
    • World Politics
    About us

    The Populist Bulletin serves as a beacon for the populist movement, which champions the interests of ordinary citizens over the agendas of the powerful and entrenched elitists. Rooted in the belief that the voices of everyday workers, families, and communities are often drowned out by powerful people and institutions, it delivers straightforward, unfiltered, compelling, relatable stories that resonate with the values of the American public.

    The Populist Bulletin was founded with a fervent commitment to inform, inspire, empower and spark meaningful conversations about the economy, business, politics, inequality, government accountability and overreach, globalization, and the preservation of American cultural heritage.

    The site offers a dynamic mix of investigative journalism, opinion editorials, and viral content that amplify populist sentiments and deliver stories that echo the concerns of everyday Americans while boldly challenging mainstream narratives that serve the privileged few.

    Top Picks

    Alphabet’s Q1 profit beats expectations, with Google’s big AI bets paying off

    April 29, 2026

    This common travel habit is now banned on American Airlines flights

    April 29, 2026

    Market Talk – April 29, 2026

    April 29, 2026
    Categories
    • Business
    • Economy
    • Headline News
    • Top News
    • US Politics
    • World Politics
    Copyright © 2025 Populist Bulletin. All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.