News
Generative AI LLM Official BlogMistral AI's Le Chat Enterprise and Mistral OCR 25.05 model are available on Google Cloud - Announcing Mistral AI’s Le Chat Enterprise and Mistral OCR 25.05 model are available on Google Cloud.
Generative AI LLM Official BlogAnnouncing Anthropic’s Claude Opus 4 and Claude Sonnet 4 on Vertex AI - Today, we're announcing the expansion of our Model Garden collection with the addition of two new models from Anthropic: Claude Opus 4 and Claude Sonnet 4.
BigQuery Data Analytics Official Blog StreamingFast, approximate analytics at scale: Apache DataSketches available in BigQuery - Apache DataSketches in BigQuery enable approximate analytics with minimal memory or computational overhead, and with a single pass through the data.
BigQuery Data Analytics Official BlogIntroducing AI.GENERATE_TABLE: creating structured data from gen AI models in BigQuery - AI.GENERATE_TABLE() in BigQuery lets you convert unstructured data into a structured table based on the provided prompt and table schema.
Official Blog TelecommunicationsThe AI-driven telecom: A new era of network transformation - Learn the current landscape of transformation needs for CSP’s and how Google Cloud is the right partner for AI-driven network transformation.
Gemini Official BlogGemini 2.5 Flash and Pro expand on Vertex AI to drive more sophisticated and secure AI innovation - Today, at Google I/O 2025, we're announcing that Gemini 2.5 Flash is now generally available on Vertex AI and Google AI Studio and Gemini 2.5 Pro will be available soon.
Official Blog Vertex AIExpanding Vertex AI with the next wave of generative AI media models - The next wave of generative media models on Vertex AI: Veo 3, Imagen 4, and Lyria 2. Take a look into each model and ways you can get started today.
Machine Learning Official BlogGoogle AI Edge Portal: On-device machine learning testing at scale - Announcing AI Edge Portal in private preview, a new way to benchmark your LiteRT models on real physical devices so you can find the best configuration to deploy ML models in your app.
Generative AI Java Official BlogGoogle Cloud and Spring AI 1.0 - Spring AI 1.0 simplifies AI integration into Java and Spring Boot applications, supporting various AI models like image, transcription, and chat. It leverages familiar Spring abstractions and offers retrieval augmented generation (RAG) for enhanced accuracy.
AI Hypercomputer LLM Official BlogIntroducing the next generation of AI inference, powered by llm-d - We’re making inference easier and more cost-effective with llm-d, an open-source, Kubernetes-native, distributed and disaggregated inference platform.
Articles, Tutorials
Infrastructure, Networking, Security, Kubernetes
Confidential Computing Official BlogHow Confidential Computing lays the foundation for trusted AI - Confidential Computing has redefined how organizations can securely process their most sensitive data in the cloud. Here’s what’s new.
CISO Official Blog SecurityCloud CISO Perspectives: How Google Cloud’s security team helps build securely - Peek behind the curtain to see how Google Cloud approaches security engineering, and how we take secure by design from mindset to production.
Google Cloud Platform Official BlogAdvancing sovereignty, choice, and security in the cloud for our customers - We’re introducing the next phase of our digital sovereignty journey, and how our customers can achieve greater control, choice, and security in the cloud.
BigQuery FinOps KubernetesOptimizing GKE Cost Visibility with Kubecost and BigQuery Integration - The article provides a guide on mastering GKE cost analysis using Kubecost and GCP billing data exported to BigQuery. It outlines how to enable cost allocation for GKE clusters, set up billing export to BigQuery, install and configure Kubecost, and integrate GCP for out-of-cluster resource attribution, enabling detailed Kubernetes spend monitoring and optimization.
DevOps Generative AI Kuberneteskubectl-ai: AI-Driven Kubernetes Management Solution - kubectl-ai, an open-source tool from Google Cloud, simplifies Kubernetes management by translating natural language commands into `kubectl` commands, increasing productivity and troubleshooting.
App Development, Serverless, Databases, DevOps
AlloyDB Databases Official BlogA deep dive into AlloyDB’s vector search enhancements - Recent innovations in AlloyDB AI’s ScaNN index improve performance and quality of search over structured and unstructured data.
Cloud Run Official BlogAI deployment made easy: Deploy your app to Cloud Run from AI Studio or MCP-compatible AI agents - Deploy Gemma or your app to Cloud Run from AI Studio or any MCP client.
AI Official Blog Partners SAPSAP & Google Cloud: Enabling faster value and smarter innovation for business excellence - At SAP Sapphire, we announced a variety of ways to access and analyze SAP data with Google tools and AI, plus infrastructure and security updates.
Cloud Run NodeJSOptimizing Memory Utilization in Node.js: A Guide for Cloud Environments - The article discusses optimizing Node.js memory utilization in Google Cloud Run to prevent autoscaling failures.
AlloyDB PaywallSeamless Transition: Minimizing Downtime in PostgreSQL to AlloyDB Migrations - The article discusses a strategy for migrating PostgreSQL databases, specifically from Amazon RDS, to Google Cloud's AlloyDB with minimal downtime.
AlloyDB DataplexDataplex AlloyDB Integration - This article provides a solution for automating the discovery of AlloyDB metadata into Dataplex, since Dataplex lacks direct integration.
Cloud Identity Aware Proxy Cloud Run SecurityUsing Google Identity-Aware Proxy (IAP) with Cloud Run — Without a Load Balancer! - Enforce user authentication to your Cloud Run service, without needing a load balancer!
Firebase NodeJSDeploy Each NestJS Module as a Separate Firebase Function. NestFire - NestFire is an npm package that simplifies deploying NestJS modules as separate Firebase Functions. It addresses the challenges of structuring and maintaining Firebase Functions by leveraging NestJS's modular architecture.
Generative AI Monitoring PythonSee Inside Your GenAI: Mastering Tracing & Observability in GCP for GenAI FastAPI Apps - The “Why”: Challenges of Observability in GenAI Applications.
Big Data, Analytics, ML&AI
Machine Learning Official Blog PyTorchTrain AI for less: Improve ML Goodput with elastic training and optimized checkpointing - Elastic training improves ML Goodput of AI training by minimizing disruptive events, while optimizing checkpoints can reduce workload interruptions.
AI GCP Experience Official BlogOviva builds AI meal logging app with Google Cloud - Oviva uses Google Cloud's Gemini and Vertex AI to build an AI meal logging app, providing personalized diet feedback and simplifying user experience.
BigQueryData to Value Tools - An Open-Source Google BigQuery Toolkit for Analytics Engineers.
BigQuery Data Science PythonGenerate Structured Output with Gemini in BigFrames - Generate Structured Output with Gemini in BigFrames, an open source Python library offered by Google.
ADK BigQuery Vertex AIAgent Development Kit — NL2SQL with Bigquery - The article introduces the Agent Development Kit (ADK), an open-source framework for developing and deploying AI Agents, and demonstrates its use in implementing NL2SQL with Bigquery.
Machine Learning Vertex AICombining Googles Live API with a User Interface - From Local Python script to Full Application: Integrating Frontend, Backend, and WebSockets.
ADK Generative AI PaywallCreating a Jira Agent in Google ADK - Using Google ADK to create Jira Agent.
ADK AI ColabBuilding a Multi-Agent Weather AI Assistant with Google ADK - This article details how to define agents, manage session state, implement safety callbacks, and build a team of agents that delegate tasks, demonstrated through the creation of a weather AI assistant.
BigQuery FinOps GeminiAnalyze Gemini Code Assist usage with BigQuery - Analyze AI coding assistance adoption and impact for your team with Gemini Code Assist metadata logs and BigQuery.
Various
AI Machine Learning PaywallThe Project that got me a handshake with Google Cloud’s CEO - Abish Pius recounts how participating in the Google Cloud MLB Hackathon and building a project with Gemini models led to meeting Google Cloud's CEO, Thomas Kurian, at Google Cloud Next 2025.
Slides, Videos, Audio
Security Podcast - #226 AI Supply Chain Security: Old Lessons, New Poisons, and Agentic Dreams.
GCP Bytes Podcast - #17 - In the episode we discuss; Network Certification, Gen AI Certification, MS Security Vulnerabilities, Broadcom Cease and Desist, Micron21, Cyber Insurance, Google Bonuses, Cloud Run and IAP, Power Deal, Rapid Storage, Android Security Updates, Chrome Security, Google IO.
Releases
Gemini - Gemini Code Assist now uses Gemini 2.5. Gemini Cloud Assist now supports asking prompts about your Cloud Monitoring alerts. Gemini Cloud Assist now supports asking prompts about vulnerabilities detected by Artifact Analysis. Gemini Cloud Assist can now test the organization policies that you generate with Gemini Cloud Assist directly in the Cloud Assist chat.
Google Kubernetes Engine - (2025-R20) Version updates GKE cluster versions have been updated. In the Google Cloud console, the GKE security posture dashboard now uses Security Command Center to show the top threats that affect your GKE workloads. The May 13, 2025 issue in which GKE Autopilot clusters failed to update the cgroup_mode field is fixed in all GKE versions. In GKE version 1.32.3-gke.1927002 and later, GKE uses a container-optimized compute platform for the general-purpose Autopilot compute class.
GKE new features - In the Google Cloud console, the GKE security posture dashboard now uses Security Command Center to show the top threats that affect your GKE workloads. In GKE version 1.32.3-gke.1927002 and later, GKE uses a container-optimized compute platform for the general-purpose Autopilot compute class.
Load Balancing - To take advantage of the new features of the global external Application Load Balancer, you can now migrate your classic Application Load Balancer resources to the global external Application Load Balancer infrastructure.
Looker - The following features have been added to Studio in Looker, which is available in preview: Some Looker permissions now apply to Studio in Looker reports. Studio in Looker reports now support some download and export capabilities.
Migration Center - Preview: Added support for discovery of Azure virtual machine (VM) instances and uploading the collected information to Migration Center.
NetApp - Google Cloud NetApp Volumes now supports volume replication for large capacity volumes. The backup vault now allows users to specify a minimum retention period for backups, which prevents the backup deletion before the specified number of days. The Flex service level of Google Cloud NetApp Volumes that supports the independent provisioning of capacity and performance with zonal pools in selected regions is now generally available.
Cloud Interconnect - Cross-Site Interconnect (Preview) support is available in the following colocation facilities: Taipei, Taiwan For more information, see the Locations table and Global Locations.
Security Command Center - In the Google Cloud console, the Google Kubernetes Engine (GKE) security posture dashboard shows the top threats and software vulnerabilities that affect your GKE workloads.
Service Mesh - 1.25.x. 1.25.2-asm.3 is now available for in-cluster Cloud Service Mesh. 1.24.x. 1.24.5-asm.3 is now available for in-cluster Cloud Service Mesh. 1.23.x. 1.23.6-asm.3 is now available for in-cluster Cloud Service Mesh. 1.22.x. In-cluster Cloud Service Mesh 1.22 is no longer supported.
Cloud SQL MySQL - You can now create an instance with both private services access and Private Service Connect enabled.
Cloud SQL Postgres - The rollout of the following minor versions, extension versions, and plugin versions is complete: Minor versions 13.20 is upgraded to 13.21. You can now create an instance with both private services access and Private Service Connect enabled.
Cloud SQL SQL Server - Cloud SQL for SQL Server now extends query insights and index advisor support to read replicas. You can now create an instance with both private services access and Private Service Connect enabled.
Cloud TPU - Public preview: You can request Cloud TPUs using future reservations in calendar mode. Public preview: You can enable reservation sharing for Cloud TPU.
VMware Engine - All new VMware Engine private clouds now deploy with the following versions: VMware vSphere version 8.0 update 3 NSX-T 4.2.1.2 Existing private clouds will be upgraded starting June 2025.
VPC Service Controls - Preview stage support for the following integration: Model Armor. Preview stage support for the following integration: Storage batch operations.
Virtual Private Cloud - Service producers can publish services that are hosted on cross-region internal Application Load Balancers.
AlloyDB - AlloyDB for PostgreSQL supports the pg_ivm extension, which provides incremental view maintenance for materialized views. AlloyDB AI query engine (Preview) lets you combine natural language with SQL using operators like ai.if, ai.rank, and ai.generate.
Anthos clusters on VMware - Google Distributed Cloud (software only) for VMware 1.31.500-gke.68 is now available for download. The following functional changes were made in 1.31.500-gke.68: Upgraded etcd to v3.4.33. The following issues were fixed in 1.31.500-gke.68: Fixed vulnerabilities listed in Vulnerability fixes.
Apigee API Hub - Apigee API hub is now available in the following regions: europe-west10 (Berlin) us-east5 (Columbus) us-south1 (Dallas) me-central2 (Dammam) asia-south2 (Delhi) me-central1 (Doha) europe-north1 (Finland) europe-west3 (Frankfurt) asia-east2 (Hong Kong) asia-southeast2 (Jakarta) africa-south1 (Johannesburg) us-west4 (Las Vegas) us-west2 (Los Angeles) europe-southwest1 (Madrid) australia-southeast2 (Melbourne) europe-west8 (Milan) northamerica-northeast1 (Montréal) europe-west4 (Netherlands) asia-northeast2 (Osaka) us-west3 (Salt Lake City) southamerica-west1 (Santiago) asia-northeast3 (Seoul) us-east1 (South Carolina) asia-east1 (Taiwan) me-west1 (Tel Aviv) asia-northeast1 (Tokyo) northamerica-northeast2 (Toronto) europe-west12 (Turin) europe-central2 (Warsaw) europe-west6 (Zürich) For more information, see API hub locations.
Apigee Advanced API Security - On May 20, 2025 we released a new version of Advanced API Security Abuse Detection. Advanced API Security Abuse Detection incident reports now include the ability to view raw data With this new functionality, you can view raw data underlying an incident report, including client IP address, API proxy, developer app, and other attributes.
Cloud Architecture Center - File storage on Compute Engine: Added information about Google Cloud Managed Lustre and DDN Infinia. Parallel file systems for HPC workloads: Added information about Google Cloud Managed Lustre and DDN Infinia.
Assured Workloads for Goverment - The Canada Protected B control package is now generally available. v1. The names for some Assured Workloads control packages are changing.
Backup and DR Service - Backup and DR Service 11.0.15.226 is now available to update your backup/recovery appliance. There is a new committed use discount (CUD) for customers using Backup and DR Service to protect Oracle databases into a backup vault. Backup and DR Service now supports backup and restore of Db2 databases using persistent disk snapshots. These issues have been fixed: An issue in which multiple snapshot/Direct OnVault jobs became stuck in an unresponsive state after attempting to connect to vCenter with an openssl command. Vulnerabilities CVE-2024-42301, CVE-2024-42284, and CVE-2024-41092 have been fixed at kernel version 4.18.0-553.33.1.el8_10. This release introduces enhanced logging and alerting capabilities for backup/recovery appliances, enabling proactive monitoring of their health and status. Introducing preview of a simplified one-step procedure for changing backup plans assigned to your Compute Engine VMs.
BigQuery - When you migrate Teradata data to BigQuery using the BigQuery Data Transfer Service, you can now specify the outputs of the BigQuery translation engine to use as schema mapping. You can use custom constraints with Organization Policy to provide more granular control over specific fields for some BigQuery resources. Starting September 15 2025, the bigquery.datasets.getIamPolicy IAM permission is required to view a dataset's access controls and to query the INFORMATION_SCHEMA.OBJECT_PRIVILEGES view. When you Set up Gemini in BigQuery you are now prompted to grant the BigQuery Studio User and BigQuery Studio Admin roles. You can select multiple columns and perform data preparation tasks on them, including dropping columns. You are now able to set access controls on routines. You can now perform supervised tuning on a BigQuery ML remote model based on a Vertex AI gemini-2.0-flash-001 or gemini-2.0-flash-lite-001 model. Continuous queries let you build long-lived, continuously processing SQL statements that can analyze, process, and perform machine learning (ML) inference on incoming data in BigQuery in real time. Spanner now supports cross regional federated queries from BigQuery which allow BigQuery users to query Spanner tables from regions other than their BigQuery region.
Capacity Planner - Preview: You can view and export usage and forecast data of the machine types and TPUs in your project, folder, or organization.
CDN - Cloud CDN supports content targeting, which helps you cache and deliver assets that are customized for your end-user contexts.
Chronicle Security Operations - Environment load balancing The environment load balancing feature offers improved stability and fair resource sharing in multi-tenant environments. The following parser documentation is now available. Self-Service Deprovisioning for Google SecOps You can now deprovision your Google SecOps tenant and associated data directly. Simplified provisioning and onboarding The process for customer self provisioning and onboarding has been streamlined, significantly reducing the time required to onboard to Google SecOps.
Compute Engine - Generally available: Resource-based committed use discounts (CUDs) are available for M4 machine types that come with 6 TB of memory. Public preview: The general-purpose C4D machine series offers Local SSD (-lssd) machine types with up to 12 TiB of Titanium SSD. Preview: You can use future reservation requests in calendar mode to reserve capacity for creating VMs with TPUs attached.
Database Migration Service - Database Migration Service now supports MySQL minor version 8.0.42 for homogeneous MySQL migrations.
Dataplex - Previously, Dataplex data profile scans were limited to 300 columns per BigQuery table. You can now run data profile scans on all 10,000 columns in a BigQuery table.
Dataproc Serverless - New Dataproc Serverless for Spark runtime versions: 1.1.104 1.2.48 2.2.48.
Dataproc - Dataproc now supports the creation of zero-scale clusters, available in preview.
Cloud Deploy - Cloud Deploy now uses Skaffold 2.16 as the default Skaffold version, as of May 23, 2025, for all target types.
Google Distributed Cloud Edge - This is a minor release of Google Distributed Cloud connected (version 1.9.0). The following new functionality has been introduced in this release of Google Distributed Cloud connected: Workload network traffic tagging on GDCc servers. The following changes to existing functionality have been introduced in this release of Google Distributed Cloud connected: IPv4/IPv6 dual-stack networking GA. Security mitigations for the following vulnerabilities have been implemented in this release of Google Distributed Cloud connected: OS layer security mitigations: for a list of vulnerabilities, check the release page. The following Google Distributed Cloud connected components have been updated: GDC software-only has been updated from version 1.29.800-gke.111 to version 1.30.400-gke.136. The following documentation updates have been implemented in this release of Google Distributed Cloud connected: Survivability mode. The following issues have been resolved in this release of Google Distributed Cloud connected: Reallocating a GPU resource from a VM to a container no longer causes an initialization error. This release of Google Distributed Cloud connected contains the following known issues: Storage is not freed immediately upon cluster deletion.