Skip to main content

Posts

The leadership principles behind high-performing AI engineering teams

Managing large AI teams today is less like running a traditional engineering organization and more like conducting an orchestra while the music is still being written. Leaders must balance speed, experimentation, risk, and coordination across disciplines that operate at very different tempos. Data scientists optimize for discovery, engineers for reliability and efficiency, security and legal teams for constraint, and leadership ultimately for outcomes. When AI teams are managed using the same structures and decision-making patterns as conventional software teams, friction shows up quickly. The leaders who succeed are those who intentionally redesign structure, alignment, and authority to reflect how AI systems are actually built, deployed, and evolved in practice. A critical starting point is clarity around what an AI system is optimizing for, along with the guardrails that prevent unintended tradeoffs. In practice, AI systems rarely behave uniformly. Performance often varies across u...

Apiiro’s Guardian Agent, Ai2’s Open Coding Agents, and more — Daily News Digest

Apiiro announces Guardian Agent Guardian Agent rewrites developer prompts to make them more secure and ensure they meet current needs of the software architecture, runtime environments, organizational policies, and regulatory requirements. According to the company, because of AI, security debt is being added faster than it can be fixed, and asking developers to fix vulnerabilities after code is written is no longer sufficient. “The reality is clear: Detection will never scale at the speed of AI. Only prevention will,” the company wrote in a blog post . Ai2 releases Open Coding Agents Open Coding Agents are a family of open agents that utilize a training method that makes it easier for developers to build their own coding agent trained on their internal codebases. The first release is SERA (Soft-verified Efficient Repository Agent), which uses a fine-tuning method that can be specialized to any codebase. The company is also releasing SERA’s training data to help researchers study w...

OpenSilver 3.3: Blazor Components Now Run Directly Inside XAML Applications

Userware today released OpenSilver 3.3, introducing native integration between XAML and Blazor. Developers can now embed Blazor components from libraries like DevExpress, Syncfusion, MudBlazor, Radzen, and Blazorise directly inside XAML applications, with no JavaScript bridges or performance overhead. OpenSilver is an open-source framework that runs WPF-style C# and XAML applications in web browsers via WebAssembly. It supports deployment to iOS, Android, Windows, macOS, and Linux through .NET MAUI Hybrid. With version 3.3, developers can also leverage the Blazor component ecosystem while keeping XAML as their primary UI technology. “ Blazor has an incredible component ecosystem. XAML has a powerful layout and binding system that developers love, ” said Giovanni Albani, CEO of Userware. “ With 3.3, you don’t have to choose. Use XAML where it excels, drop in Blazor components where you need them. Your ViewModels and architecture stay the same. ” How It Works Because OpenSilver rende...

Teleport tackles agentic trust with new Agentic Identity Framework

Teleport has announced the launch of its new Agentic Identity Framework that defines policies, practices, developer tools, and a reference architecture for securely deploying agents in production. According to the company, agentic AI introduces new security challenges, as they invoke tools, access sensitive data, delegate tasks, and operate across environments at scale, all without human involvement. Teleport says our current identity, access, and security models weren’t designed for non-deterministic systems, and current attempts at deploying agentic systems have led to identity fragmentation, secrets sprawl, limited visibility, and systemic risk. The Agentic Identity Framework attempts to solve these issues by establishing an identity layer that is secured cryptographically with a hardware root of trust. It enables zero trust authentication, zero standing privileges, and real-time visibility into identity behavior. “A unified identity layer is a prerequisite to deploying AI with...

Anthropic makes tools within Claude interactive

Anthropic has announced that users will now be able to directly interact with certain tools within Claude. Claude already had the ability to connect to tools and take action on a user’s behalf in those tools, so what’s new today is the ability for the user to actually go in and interact with those tools directly in the Claude window. The tools and capabilities that are now supported include: Amplitude: Build analytics charts, explore trends, and adjust parameters Asana: Create projects, tasks, and timelines from chats Box: Search for files, preview documents inline, and extract insights and ask questions about content Canva: Create presentation outlines and customize branding and designs Clay: Research companies, find contact info, get information like company size and funding, and draft personalized outreach Figma: Turn text and images into flow charts, Gantt charts, and other visual diagrams Hex: Ask data questions and get answers containing interactive charts, tables, and ...

This week in AI updates: GitHub Copilot SDK, Claude’s new constitution, and more (January 23, 2026)

GitHub Copilot SDK now in technical preview The SDK allows developers to embed agentic capabilities into their applications using the same execution loop used by the GitHub Copilot CLI. The SDK repository includes setup instructions, starter examples, and SDK references for all of the supported languages. GitHub recommends starting by defining a single task, such as updating files or running a command, and letting Copilot plan and execute steps while the application supplies domain-specific tools and constraints. Anthropic drafts new constitution for Claude models The constitution is Anthropic’s vision for Claude’s values and behavior. The main sections in this updated version include specifications related to helpfulness, ethics, safety, nature, and guidelines for how to handle specific issues, like medical advice or cybersecurity requests. “The constitution is a crucial part of our model training process, and its content directly shapes Claude’s behavior. Training models is a ...

New Relic adds monitoring for ChatGPT apps

New Relic customers will now be able to monitor their custom ChatGPT apps to ensure they’re delivering the intended performance, reliability, and user experience. “Bringing business services into the natural flow of a ChatGPT conversation is a powerful, intuitive, and revenue-generating strategy,” said Brian Emerson, chief product officer of New Relic. “But once your carefully crafted application instantiates inside ChatGPT, it traditionally enters a black box where standard browser monitoring tools can fail.” The company went on to explain that when an app is rendered in a conversation, developers can’t see things like layout shifts or broken buttons. Additionally, security headers, content security policies, i-frame sandbox rules, and limitations on client-side storage can hide important performance and user experience data. New Relic’s answer to this problem is to send in an agent that can collect and analyze data. It can track PageViews, PageViewtimings, and AjaxRequests, provi...

Testlio launches new AI-powered QA analysis solution

Testlio has announced the release of a new AI-driven QA analysis solution called LeoInsights . The new platform is powered by the company’s intelligence layer LeoAI Engine, which was trained on 13 years of testing data, 2.6+ million test cases, and 600,000+ devices. It can provide executive summaries featuring key changes, emerging risks, and critical issues, simplifying multiple QA reports into one that can be shared with leaders. LeoInsights also offers a value calculator that quantifies efficiency gains, cost savings, and quality impact, helping QA teams better demonstrate their value to leadership. The calculator can aggregate data across workspaces, do scenario modeling with adjustable inputs, and generate PDFs that can be shared with executives for budgeting and investment discussions. The tool can also provide alerts when unusual trends and anomalies are spotted, helping QA teams discover risks that they might not have otherwise noticed. It also provides app review and senti...

Codenotary updates its free SBOM scanning tool with capabilities that better support AI apps

Codenotary is adding new capabilities to its SBOM.sh service, which provides free analysis of software bills of materials (SBOMs). According to the company, the updates were made in consideration of AI applications, and the tool now treats datasets as software supply chain artifacts. “Traditional SBOM tools were built for an earlier era – focusing primarily on source code to improve visibility into the software supply chain,” said Moshe Bar, CEO and co-founder of Codenotary. “Security teams are swimming in SBOMs, but they’re not getting the actionable clarity they need — especially as AI transforms software with AI applications are built on datasets which are entirely ignored by traditional SBOMs.” It now provides documentation of dataset sources, licensing terms, and governance controls, which helps organizations be more audit-ready. SBOM.sh also now captures lineage metadata, such as base-model origins, fine-tuning history, version identifiers, and update pathways. Additionally...

AI lacks theory of mind – why that matters

A lot of interest has been garnered by large language models (LLMs) and their abilities, but there’s one ability that remains solely human.  We don’t share it with mammals or machines.  That ability is called “theory of mind,” and it’s the mind-reading ability that allows us to coordinate and collaborate with others. Mind reading sounds like the power of some superhero or super villain.  However, the truth is that even babies do it.  We learn to predict what others are thinking and how they’ll react.  Babies learn this skill around three years of age.  They begin to recognize what others do – and don’t – know.  While comic books make the super power sound like reading every thought and every memory our everyday human power of mind reading is limited to awareness, lack of awareness, and simple prediction. As adults, this power allows us to do things like joint cooperation (animals can only use parallel cooperation) – and has allowed us to become the d...

GitLab’s Duo Agent Platform is now generally available

GitLab has made its Duo Agent Platform generally available, providing development teams with agentic AI automation that has access to an organization’s full context, standards, and guardrails. The GA release includes Agentic Chat, providing context-aware assistance throughout the GitLab platform. Agentic Chat builds on the previously released Duo Chat, and brings in context from issues, merge requests, pipelines, security findings, and more, and can perform actions on a developer’s behalf. For example, in the Web UI, Agentic Chat can create issues, epics, merge requests, and highlight key findings and create actionable guidance based on organizational context. Additionally, in the IDE, it can generate code, configurations, and infrastructure-as-code, as well as fix bugs, generate texts, and produce documentation. Other ways Agentic Chat can be used are helping developers understand, configure, or troubleshoot CI/CD pipelines or create new ones, and on the security front, it can exp...

ScyllaDB Releases Integrated Vector Search: 1B Vectors with 2ms P99s and 250K QPS Throughput

ScyllaDB today announced the general availability of its new Vector Search capability, which is integrated into ScyllaDB X Cloud. This high-performance vector search supports the industry’s largest models with low TCO. ScyllaDB is commonly used for real-time AI workloads such as latency-sensitive machine learning, predictive analytics, and fraud detection. It is trusted by high-growth companies such as Tripadvisor, ShareChat, and Freshworks to power large-scale latency-sensitive feature stores. As ScyllaDB’s customers began adopting vector search, many found standalone vector databases to be overly complex and costly at scale. In response, ScyllaDB added Vector Search to its ScyllaDB Cloud offering. ScyllaDB Vector Search is built on ScyllaDB’s shard-per-core architecture with a Rust-based extension that leverages USearch , the industry-standard ANN search library. The architecture separates storage and indexing responsibilities while keeping the system unified from the user’s perspe...

MetalBear launches mirrord for CI to improve testing process for cloud native apps

MetalBear is launching a new tool that allows development teams to run CI tests against Kubernetes environments without needing to deploy code to it or spin up test environments. According to MetalBear, testing cloud native applications can be difficult because a change made to a single service requires other services to be tested to see how it behaves. This is typically accomplished by spinning up new cloud environments or using local Kubernetes tools, but spinning up new environments can take 20-30 minutes, increase cloud costs, and add ongoing maintenance, and using local tools also has its drawbacks because local clusters don’t always behave like real ones. Mirrord for CI aims to address these concerns by securely connecting a runner to an existing Kubernetes cluster, and then running a test suite with real services, dependencies, and traffic, enabling development teams to test against real conditions. “Your code, i.e. the microservice in the branch you want to merge, runs in t...

Report: Companies with technical debt unlikely to see benefits from AI adoption

Organizations that have modernized their applications are three times more likely to see a clear ROI on their AI investments compared to those that haven’t, according to a new survey from Cloudflare. The 2026 App Innovation Report found that 93% of leaders believe that updating their software was “the single most important factor in boosting their AI capabilities.” Organizations that have fallen behind on their modernization efforts report being 85% less confident in their infrastructure. Those who fall into that camp often only modernize reactively after a security breach happens. Additionally, companies that align security with modernization are four times more likely to reach advanced AI maturity. “If you aren’t modernizing your business to embrace AI and prevent the next wave of cyberattacks, you aren’t just standing still, you’re rapidly falling behind. The winners of this era of the Internet will ultimately be defined by their infrastructure,” said Matthew Prince, CEO and co...

Chainguard adds 10 new projects to EmeritOSS program for prolonging the life of open source tools

Chainguard is adding 10 new open source projects to EmeritOSS, its program for supporting mature open source projects that don’t require continuous upkeep or whose maintainers need to step away. “EmeritOSS exists for the projects that have earned their stripes. They’ve shipped, scaled, and supported real systems, and while their maintainers may be ready to step back, the software itself still has plenty of life left. EmeritOSS provides continuity-focused stewardship for mature projects by maintaining public, non-competitive forks, addressing security issues through dependency updates and releases, and clearly documenting support boundaries,” Chainguard wrote in a blog post . EmeritOSS first launched in December with three starting projects: Kaniko , Kubeapps , and ingress-nginx . The 10 new projects that are being added span object storage, monitoring, data processing, backup integrations, and observability. They include: MinIO Prometheus PushProx Cassandra Exporter Prometheus ...