Tag: Programming & Development

Nous Research drops Hermes 4 AI models that outperform ChatGPT without content restrictions

Nous Research, a secretive artificial intelligence startup that has emerged as a leading voice in the open-source AI movement, quietly released Hermes 4 on Monday, a family of large language models that the company claims can match the performance of leading proprietary systems while offering unprecedented user control and minimal content restrictions. The release represents […]

Read More

Salesforce builds ‘flight simulator’ for AI agents as 95% of enterprise pilots fail to reach production

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Salesforce is betting that rigorous testing in simulated business environments will solve one of enterprise artificial intelligence’s biggest problems: agents that work in demonstrations but fail in the messy […]

Read More

Anthropic launches Claude for Chrome in limited beta, but prompt injection attacks remain a major concern

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Anthropic has begun testing a Chrome browser extension that allows its Claude AI assistant to take control of users’ web browsers, marking the company’s entry into an increasingly crowded […]

Read More

This website lets you blind-test GPT-5 vs. GPT-4o—and the results may surprise you

When OpenAI launched GPT-5 about two weeks ago, CEO Sam Altman promised it would be the company’s “smartest, fastest, most useful model yet.” Instead, the launch triggered one of the most contentious user revolts in the brief history of consumer AI. Now, a simple blind testing tool created by an anonymous developer is revealing the […]

Read More

Developers lose focus 1,200 times a day — how MCP could change that

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Software developers spend most of their time not writing code; recent industry research found that actual coding accounts for as little as 16% of developers’ working hours, with the […]

Read More

MIT report misunderstood: Shadow AI economy booms while headlines cry failure

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now The most widely cited statistic from a new MIT report has been deeply misunderstood. While headlines trumpet that “95% of generative AI pilots at companies are failing,” the report […]

Read More

Chan Zuckerberg Initiative’s rBio uses virtual cells to train AI, bypassing lab work

The Chan Zuckerberg Initiative announced Thursday the launch of rBio, the first artificial intelligence model trained to reason about cellular biology using virtual simulations rather than requiring expensive laboratory experiments — a breakthrough that could dramatically accelerate biomedical research and drug discovery. The reasoning model, detailed in a research paper published on bioRxiv, demonstrates a […]

Read More

TikTok parent company ByteDance releases new open source Seed-OSS-36B model with 512K token context

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now The company’s Seed Team of AI researchers today released Seed-OSS-36B on AI code sharing website Hugging Face. Seed-OSS-36B is new line of open source, large language models (LLM) designed […]

Read More

CodeSignal’s new AI tutoring app Cosmo wants to be the ‘Duolingo for job skills’

CodeSignal Inc., the San Francisco-based skills assessment platform trusted by Netflix, Meta, and Capital One, launched Cosmo on Wednesday, a mobile learning application that transforms spare minutes into career-ready skills through artificial intelligence-powered micro-courses. The app represents a strategic pivot for CodeSignal, which built its reputation assessing technical talent for major corporations but always harbored […]

Read More

DeepSeek V3.1 just dropped — and it might be the most powerful open AI yet

Chinese artificial intelligence startup DeepSeek made waves across the global AI community Tuesday with the quiet release of its most ambitious model yet — a 685-billion parameter system that challenges the dominance of American AI giants while reshaping the competitive landscape through open-source accessibility. The Hangzhou-based company, backed by High-Flyer Capital Management, uploaded DeepSeek V3.1 […]

Read More

TensorZero nabs $7.3M seed to solve the messy world of enterprise LLM development

TensorZero, a startup building open-source infrastructure for large language model applications, announced Monday it has raised $7.3 million in seed funding led by FirstMark, with participation from Bessemer Venture Partners, Bedrock, DRW, Coalition, and dozens of strategic angel investors. The funding comes as the 18-month-old company experiences explosive growth in the developer community. TensorZero’s open-source […]

Read More

That ‘cheap’ open-source AI model is actually burning through your compute budget

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now A comprehensive new study has revealed that open-source artificial intelligence models consume significantly more computing resources than their closed-source competitors when performing identical tasks, potentially undermining their cost advantages […]

Read More

Anthropic takes on OpenAI and Google with new Claude AI features designed for students and developers

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Anthropic is launching new “learning modes” for its Claude AI assistant that transform the chatbot from an answer-dispensing tool into a teaching companion, as major technology companies race to […]

Read More

Claude can now process entire software projects in single request, Anthropic says

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Anthropic announced Tuesday that its Claude Sonnet 4 artificial intelligence model can now process up to 1 million tokens of context in a single request — a fivefold increase […]

Read More

Study warns of security risks as ‘OS agents’ gain control of computers and phones

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Researchers have published the most comprehensive survey to date of so-called “OS Agents” — artificial intelligence systems that can autonomously control computers, mobile phones and web browsers by directly […]

Read More

Anthropic revenue tied to two customers as AI pricing war threatens margins

Anthropic’s meteoric rise to a $5 billion revenue run rate conceals a precarious dependence on just two major customers that account for nearly a quarter of the artificial intelligence company’s income, according to internal data and industry analysis that reveals both the promise and peril of the AI coding boom. The San Francisco-based maker of […]

Read More

The initial reactions to OpenAI’s landmark open source gpt-oss models are highly varied and mixed

But despite achieving technical benchmarks on par with OpenAI’s other powerful proprietary AI model offerings, the broader AI developer and user community’s initial response has so far been all over the map. If this release were a movie premiering and being graded on Rotten Tomatoes, we’d be looking at a near 50% split, based on […]

Read More

Anthropic ships automated security reviews for Claude Code as AI-generated vulnerabilities surge

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Anthropic launched automated security review capabilities for its Claude Code platform on Wednesday, introducing tools that can scan code for vulnerabilities and suggest fixes as artificial intelligence dramatically accelerates […]

Read More

Anthropic’s new Claude 4.1 dominates coding tests days before GPT-5 arrives

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Anthropic released an upgraded version of its flagship artificial intelligence model Monday, achieving new performance heights in software engineering tasks as the AI startup races to maintain its dominance […]

Read More

OpenAI returns to open source roots with new models gpt-oss-120b and gpt-oss-20b 

OpenAI is getting back to its roots as an open source AI company with today’s announcement and release of two new, open source, frontier large language models (LLMs): gpt-oss-120b and gpt-oss-20b. The former is a 120-billion parameter model as the name would suggest, capable of running on a single Nvidia H100 graphics processing unit (GPU) […]

Read More

ChatGPT rockets to 700M weekly users ahead of GPT-5 launch with reasoning superpowers

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now OpenAI’s ChatGPT will reach 700 million weekly active users this week, the company announced Monday, cementing its position as one of the fastest-adopted software products in history just as […]

Read More

Qwen-Image is a powerful, open source new AI image generator with support for embedded text in English & Chinese

After seizing the summer with a blitz of powerful, freely available new open source language and coding focused AI models that matched or in some cases bested closed-source/proprietary U.S. rivals, Alibaba’s crack “Qwen Team” of AI researchers is back again today with the release of a highly ranked new AI image generator model — also […]

Read More

Why tomorrow’s best devs won’t just code — they’ll curate, coordinate and command AI

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now As AI continues to take on more and more new competencies, junior coding, as we knew it, is rapidly becoming a thing of the past. Tasks that used to […]

Read More

OpenAI removes ChatGPT feature after private conversations leak to Google search

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now OpenAI made a rare about-face Thursday, abruptly discontinuing a feature that allowed ChatGPT users to make their conversations discoverable through Google and other search engines. The decision came within […]

Read More

Deep Cogito goes big, releasing 4 new open source hybrid reasoning models with self-improving ‘intuition’

Deep Cogito, a lesser-known AI research startup based in San Francisco founded by ex-Googlers, has released four new open-ish large language models (LLMs) that attempt something few others do: Learning how to reason more effectively over time — and get better at it on their own. The models, released as part of Cogito’s v2 family, […]

Read More

Hard-won vibe coding insights: Mailchimp’s 40% speed gain came with governance price

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Intuit Mailchimp provides email marketing and automation capabilities. It’s part of the larger Intuit organization, which has been on a steady journey with gen AI over the last several […]

Read More

Runloop lands $7M to power AI coding agents with cloud-based devboxes

Runloop, a San Francisco-based infrastructure startup, has raised $7 million in seed funding to address what its founders call the “production gap” — the critical challenge of deploying AI coding agents beyond experimental prototypes into real-world enterprise environments. The funding round, led by The General Partnership with participation from Blank Ventures, comes as the artificial […]

Read More

Nightfall launches ‘Nyx,’ an AI that automates data loss prevention at enterprise scale

Nightfall AI launched the industry’s first autonomous data loss prevention platform Wednesday, introducing an AI agent that automatically investigates security incidents and tunes policies without human intervention — a breakthrough that could reshape how enterprises protect sensitive information in an era of expanding cyber threats. The San Francisco-based startup’s new platform, called Nightfall Nyx, represents […]

Read More

AI vs. AI: Prophet Security raises $30M to replace human analysts with autonomous defenders

Prophet Security, a startup developing autonomous artificial intelligence systems for cybersecurity defense, announced Tuesday it has raised $30 million in Series A funding to accelerate what its founders describe as a fundamental shift from human-versus-human to “agent-versus-agent” warfare in cybersecurity. The Menlo Park-based company’s funding round, led by venture capital firm Accel with participation from […]

Read More

ChatGPT just got smarter: OpenAI’s Study Mode helps students learn step-by-step

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now OpenAI announced Study Mode for ChatGPT on Tuesday, a new feature that fundamentally changes how students interact with artificial intelligence by withholding direct answers in favor of Socratic questioning […]

Read More

Stack Overflow data reveals the hidden productivity tax of ‘almost right’ AI code

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now More developers than ever before are using AI tools to both assist and generate code. While enterprise AI adoption accelerates, new data from Stack Overflow’s 2025 Developer Survey exposes […]

Read More