Close Menu
    Facebook X (Twitter) Instagram
    TRENDING :
    • You can’t recall AI like a defective drug
    • Dollar General closed hundreds of locations after evaluating its store footprint. But there’s an upside
    • Bumble stock is up today. Whitney Wolfe Herd’s solution to ‘swipe fatigue’ might be part of the reason why
    • This new foldable phone may have upstaged Apple in the ‘zero-crease’ wars
    • The X algorithm really is trying to radicalize you—researchers just proved it
    • How silicone wristbands can help scientists monitor ‘forever chemicals’
    • The Pentagon–Anthropic clash is a warning for every enterprise AI buyer
    • Trump, London, Netanyahu, & Neocons
    Compatriot Chronicle
    • Home
    • US Politics
    • World Politics
    • Economy
    • Business
    • Headline News
    Compatriot Chronicle
    Home»Business»OpenAI’s GPT-5.3-Codex thinks deeper and wider about coding work
    Business

    OpenAI’s GPT-5.3-Codex thinks deeper and wider about coding work

    February 6, 20263 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email Copy Link
    Follow Us
    Google News Flipboard
    Share
    Facebook Twitter LinkedIn Pinterest Email

    On Thursday, OpenAI released GPT-5.3-Codex, a new model that extends its Codex coding agent beyond writing and reviewing code to performing a much wider range of work tasks. The release comes as competition continues to heat up among AI companies vying for market share in the AI-powered coding tools space.

    OpenAI says GPT-5.3 combines the coding performance of GPT-5.2-Codex with the reasoning and professional-knowledge capabilities of GPT-5.2, while running 25% faster. This allows GPT-5.3-Codex to handle long-running tasks that involve research, tool use such as web search or database calls, and complex execution and planning across both general work tasks and software development.

    Codex has reached over 1 million developers, OpenAI claims. And while Anthropic’s Claude Code has also seen rapid adoption, head-to-head data comparing the two tools remains scarce. SemiAnalysis reports that 4% of GitHub public commits, or new code uploaded to repositories, are currently being authored by Claude Code, and projects that figure could reach 20% or more by the end of 2026.

    Benchmark one-upsmanship

    OpenAI says GPT-5.3-Codex now has the best score of any model on SWE-Bench Pro, which evaluates real-world software engineering across four programming languages. The same is true for Terminal-Bench 2.0, which measures the terminal skills coding agents need.

    More significantly, the new model is capable of taking into account larger bodies of information while working on a task, as well as reasoning about those tasks for longer periods without human intervention. In testing, OpenAI says it observed GPT-5.3-Codex autonomously iterating on game development over millions of tokens using generic prompts like “fix the bug” or “improve the game.”

    Rival companies are making similar claims. Anthropic says its new Claude Opus 4.6 model, when powering Claude Code, can also comprehend larger code bases and make more thoughtful decisions about how to add new code. In a Thursday blog post, the company said Opus 4.6 achieved top scores on several industry benchmarks, including Humanity’s Last Exam, which measures complex multidisciplinary reasoning, GDPval-AA, which focuses on economically valuable knowledge work, and BrowseComp, which tests hard-to-find information search.

    Beyond coding to knowledge work

    OpenAI says GPT-5.3-Codex is built to support the full software lifecycle, including debugging, deploying, and monitoring code, as well as writing product requirement documents and conducting research. The same agentic capabilities can apply to tasks well outside software development, the company says, extending to work like creating slide decks and analyzing data in spreadsheets. (Anthropic has taken Claude Code in a similar direction, positioning it to support a broader pool of information workers across a wider range of business tasks.)

    On GDPval, an OpenAI evaluation measuring performance on well-specified knowledge-work tasks across 44 occupations, GPT-5.3-Codex matches GPT-5.2 while adding stronger coding capabilities. On OSWorld-Verified, which tests computer use in a visual desktop environment, GPT-5.3-Codex achieved 64.7% accuracy compared to 38.2% for its predecessor.

    GPT-5.3-Codex is the first model OpenAI classifies as “High capability” for cybersecurity-related tasks under its Preparedness Framework, and the first the company has directly trained to identify software vulnerabilities. OpenAI is committing $10 million in API credits to accelerate cyber defense, particularly for open source software and critical infrastructure systems.

    ChatGPT subscribers can use the GPT-5.3-Codex model as the brain for Codex while using the coding tool via the Codex app, the IDE (Integrated Development Environment) interface, or within the command line interface of their computer.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Related Posts

    You can’t recall AI like a defective drug

    March 12, 2026

    Dollar General closed hundreds of locations after evaluating its store footprint. But there’s an upside

    March 12, 2026

    Bumble stock is up today. Whitney Wolfe Herd’s solution to ‘swipe fatigue’ might be part of the reason why

    March 12, 2026
    Top News

    Why great leaders encourage people to do a career pivot

    By Staff WriterSeptember 26, 2025

    Early in my career, a boss encouraged me to leave a stable operations role for…

    Red Flag! | The Nation

    August 19, 2025

    Five organizational transformation killers

    October 28, 2025

    How one CEO’s counter-cultural movement became Yondr

    March 2, 2026
    Top Trending

    You can’t recall AI like a defective drug

    By Staff WriterMarch 12, 2026

    At a recent AI summit in New Delhi, Sam Altman warned that…

    Dollar General closed hundreds of locations after evaluating its store footprint. But there’s an upside

    By Staff WriterMarch 12, 2026

    Dollar General’s fourth-quarter and full-year 2026 earnings report shows some successes—though you…

    Bumble stock is up today. Whitney Wolfe Herd’s solution to ‘swipe fatigue’ might be part of the reason why

    By Staff WriterMarch 12, 2026

    Shares in Bumble Inc. (Nasdaq: BMBL), maker of the Bumble dating app,…

    Categories
    • Business
    • Economy
    • Headline News
    • Top News
    • US Politics
    • World Politics
    About us

    The Populist Bulletin serves as a beacon for the populist movement, which champions the interests of ordinary citizens over the agendas of the powerful and entrenched elitists. Rooted in the belief that the voices of everyday workers, families, and communities are often drowned out by powerful people and institutions, it delivers straightforward, unfiltered, compelling, relatable stories that resonate with the values of the American public.

    The Populist Bulletin was founded with a fervent commitment to inform, inspire, empower and spark meaningful conversations about the economy, business, politics, inequality, government accountability and overreach, globalization, and the preservation of American cultural heritage.

    The site offers a dynamic mix of investigative journalism, opinion editorials, and viral content that amplify populist sentiments and deliver stories that echo the concerns of everyday Americans while boldly challenging mainstream narratives that serve the privileged few.

    Top Picks

    You can’t recall AI like a defective drug

    March 12, 2026

    Dollar General closed hundreds of locations after evaluating its store footprint. But there’s an upside

    March 12, 2026

    Bumble stock is up today. Whitney Wolfe Herd’s solution to ‘swipe fatigue’ might be part of the reason why

    March 12, 2026
    Categories
    • Business
    • Economy
    • Headline News
    • Top News
    • US Politics
    • World Politics
    Copyright © 2025 Populist Bulletin. All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.