Skip to main content

Posts

March 2025: All AI updates from the past month

Software companies are constantly trying to add more and more AI features to their platforms, and AI companies are constantly releasing new models and features.  Here are all the major AI updates we covered in the month of March. Google releases reasoning model Gemini 2.5, its “most intelligent AI model” yet Gemini 2.0 Flash Thinking was the company’s first reasoning model, and Gemini 2.5 builds on that with a better base model and improved post-training. In its announcement, Google revealed that all of its future AI models will have reasoning capabilities built in. The first Gemini 2.5 model is Gemini 2.5 Pro Experimental, and it leads in LMArena benchmarks over other reasoning models like OpenAI o3-mini, Claude 3.5 Sonnet, and DeepSeek R1. “Gemini 2.5 models are thinking models, capable of reasoning through their thoughts before responding, resulting in enhanced performance and improved accuracy. In the field of AI, a system’s capacity for “reasoning” refers to more than j...

Mar 28, 2025: AI updates from the past week — Gemini 2.5, OpenAI 4o image generation, new reasoning agents from Microsoft, and more

Software companies are constantly trying to add more and more AI features to their platforms, and AI companies are constantly releasing new models and features. It can be hard to keep up with it all, so we’ve written this roundup to share several notable updates around AI that software developers should know about.  Google releases reasoning model Gemini 2.5, its “most intelligent AI model” yet Gemini 2.0 Flash Thinking was the company’s first reasoning model, and Gemini 2.5 builds on that with a better base model and improved post-training. In its announcement, Google revealed that all of its future AI models will have reasoning capabilities built in. The first Gemini 2.5 model is Gemini 2.5 Pro Experimental, and it leads in LMArena benchmarks over other reasoning models like OpenAI o3-mini, Claude 3.5 Sonnet, and DeepSeek R1. “Gemini 2.5 models are thinking models, capable of reasoning through their thoughts before responding, resulting in enhanced performance and improved ...

Akamai launches new platform for AI inference at the edge

Akamai has announced the launch of Akamai Cloud Inference, a new solution that provides tools for developers to build and run AI applications at the edge. According to Akamai, bringing data workloads closer to end users with this tool can result in 3x better throughput and reduce latency up to 2.5x. “Training an LLM is like creating a map, requiring you to gather data, analyze terrain, and plot routes,” said Adam Karon, chief operating officer and general manager of the Cloud Technology Group at Akamai. “It’s slow and resource-intensive, but once built, it’s highly useful. AI inference is like using a GPS, instantly applying that knowledge, recalculating in real time, and adapting to changes to get you where you need to go. Inference is the next frontier for AI.” Akamai Cloud Inference offers a variety of compute types, from classic CPUs to GPUs to tailored ASIC VPUs. It offers integrations with Nvidia’s AI ecosystem, leveraging technologies such as Triton, TAO Toolkit, TensorRT, a...

Android team unveils upcoming enhancements to make Google Play safer

The Android development team is announcing several upcoming updates that will make it easier for app developers to secure their applications. According to the team, security is a priority, and over the last few years it has made several improvements to how security and privacy are managed in Google Play, and recent updates over the last few years have included enhanced tools to protect against fraud, pre-review checks to fix policy and compliance issues earlier in the development life cycle, and advanced AI-powered threat detection capabilities. Building on those, some of the upcoming enhancements the Android team will be making this year include: More pre-review checks New ways to help developers understand Google Play policies, better navigation in the Policy Center, and new features in Console and Android Studio to allow developers to fix issues before app submission.   Updates to the Play Integrity API that will better enable developers to deal with emerging threats ...

Google releases reasoning model Gemini 2.5, its “most intelligent AI model” yet

Google has announced the release of Gemini 2.5, which is a new reasoning model that the company claims is its “most intelligent AI model” yet. “Gemini 2.5 models are thinking models, capable of reasoning through their thoughts before responding, resulting in enhanced performance and improved accuracy. In the field of AI, a system’s capacity for “reasoning” refers to more than just classification and prediction. It refers to its ability to analyze information, draw logical conclusions, incorporate context and nuance, and make informed decisions,” Koray Kavukcuoglu, CTO of Google DeepMind, wrote in a blog post .  Gemini 2.0 Flash Thinking was the company’s first reasoning model, and Gemini 2.5 builds on that with a better base model and improved post-training. In its announcement, Google revealed that all of its future AI models will have reasoning capabilities built in. RELATED CONTENT: Mar 21, 2025: AI updates from the past week — Anthropic web search, Gemini Canvas, new OpenA...

Instabug launches new observability features to connect business outcomes with app performance, user experience

The mobile observability company Instabug has announced new features that will help developers better monitor the user experience of their applications. The new features released today include: Frustration-Free Sessions , consolidating multiple frustration signals, like crashes, slow launches, and network failures, into a single metric to give development teams a clear way to measure and improve user experience Business Impact Dashboard , which connects app performance to business outcomes so companies can make data-driven decisions Prioritized Issues List , which ranks issues based on how they impact user frustration and business metrics “For years, mobile teams have relied on incomplete metrics that fail to capture the full user experience,” said Kenny Johnston, chief product officer at Instabug. “With the launch of Frustration-Free Sessions, Business Impact Dashboard, and Prioritized Issues List, we are giving teams the capabilities they need to bridge the gap between visib...

BrowserStack adds Private Devices offering to enabling testing across variety of secured devices

The testing company BrowserStack has announced a new offering to help organizations get access to different devices to test their applications on. The new offering, Private Devices , provides access to real devices that are secured in data centers, which enables organizations to test on those devices without needing to compromise on security or performance. “With Private Devices, we’re addressing the critical needs of enterprise customers who require both advanced security and testing flexibility. With this launch, we’re giving large enterprises the control and resources they need for secure, flexible, and efficient testing,” said Nakul Aggarwal, CTO of BrowserStack. Devices have guaranteed availability, meaning organizations won’t need to queue their tests.  Private Devices also offer persistent device setups that retain apps, accounts, and settings between sessions.  The devices have advanced functionality, such as settings access, native apps, and iCloud functionalit...

How to build a multi-agent orchestrator using Flink and Kafka

Just like some problems are too big for one person to solve, some tasks are too complex for a single AI agent. Instead, the best approach is to decompose problems into smaller, specialized units, where multiple agents work together as a team. This is the foundation of multi-agent systems. Networks of agents, each with specific roles, collaborating to solve larger problems. When building multi-agent systems, you need a way to coordinate how agents interact. If every agent talks to every other agent directly, things quickly become a tangled mess, making it hard to scale, and hard to debug. That’s where the orchestrator pattern comes in. Instead of agents making ad-hoc decisions about where to send messages, a central orchestrator acts as the parent node, deciding which agent should handle a given task based on context. The orchestrator takes in messages, interprets them, and routes them to the right agent at the right time. This makes the system dynamic, adaptable, and scalable. Thin...

Kagent: Bringing agentic AI to cloud native

Oh no! Your application is unreachable, buried under multiple connection hops. How do you pinpoint the broken link? How do you generate an alert or bug report from Prometheus when certain conditions are met?  You need to roll out a new version of your application. How do you execute a progressive rollout using Argo Rollouts? How do you safely enable zero trust network security when your application scales beyond a single cluster or cloud?  With so many projects in the cloud native ecosystem, how do you figure out which ones are right for your needs and layer them together with proper configuration management? Sound familiar? We hear these questions all the time from platform and DevOps engineers working with Cloud Native Computing Foundation (CNCF) projects like Kubernetes, Envoy, Istio, Prometheus, and Argo.  So why not build AI agents to tackle common challenges and support  engineers and customers? Why not create a catalog of AI agents for the cloud native eco...

Mar 21, 2025: AI updates from the past week — Anthropic web search, Gemini Canvas, new OpenAI audio models, and more

Software companies are constantly trying to add more and more AI features to their platforms, and AI companies are constantly releasing new models and features. It can be hard to keep up with it all, so we’ve written this roundup to share eight notable updates around AI that software developers should know about.  Anthropic’s Claude can now search the web when generating responses Anthropic has announced that Claude can now search the Internet, allowing it to generate more up-to-date and relevant responses. For instance, a developer who is getting an error updating a dependency in TypeScript 5.5 could ask Claude if there were any breaking changes between version 5.4 and 5.5 and also ask for recommended fixes. Claude will respond with direct citations of its web sources, allowing users to fact check the information. Google launches Canvas to enable easier collaboration with Gemini Google is making it easier for developers to collaborate with Gemini with the launch of Canva...