Welcome to above the clouds
GCP – Optimizing image generation pipelines on Google Cloud: A practical guide
Generative AI diffusion models such as Stable Diffusion and Flux produce stunning visuals, empowering creators across various verticals with impressive image generation capabilities. However, generating high-quality images through sophisticated pipelines can be computationally demanding, even with powerful hardware like GPUs and TPUs, impacting both costs and time-to-result. The key challenge lies in optimizing the entire […]
GCP – 8 steps to ensuring a smooth Spanner go-live
As a developer, there’s a lot to think about when you’re getting ready to launch an application. There’s the availability of the underlying database, of course, which stores application state, and determines how fast and you can recover if your application or web servers go down. Thankfully, if you’re running on Spanner, its 99.999% availability […]
AWS – AWS CodePipeline adds native Amazon EKS deployment support
AWS CodePipeline introduces a new action to deploy to Amazon Elastic Kubernetes Service (Amazon EKS). This action enables you to easily deploy your container applications to your EKS clusters, including those in private VPCs. Previously, if you wanted to deploy to a EKS cluster within a private network, you had to initialize and maintain a […]
AWS – AWS announces Backup Payment Methods for invoices
Today, AWS announces the introduction of Backup Payment Methods for AWS invoices in all commercial AWS Regions. This feature enables customers to set up alternate payment methods that will be automatically charged for their invoices if the primary payment method fails. This will help customers make timely invoice payments without the need for manual intervention […]
AWS – Amazon EC2 G6e instances now available in Stockholm region
Starting today, the Amazon EC2 G6e instances powered by NVIDIA L40S Tensor Core GPUs is now available in Europe (Stockholm) region. G6e instances can be used for a wide range of machine learning and spatial computing use cases. Customers can use G6e instances to deploy large language models (LLMs) with up to 13B parameters and […]
AWS – Amazon Elastic Beanstalk now supports Windows Server 2025 and Windows Server Core 2025 environments
AWS Elastic Beanstalk now enables customers to deploy applications on Windows Server 2025 and Windows Server Core 2025 environments. These environments come pre-configured with .NET Framework 4.8.1 and .NET 8.0, providing developers with the latest Long Term Support (LTS) version of .NET alongside the established .NET Framework Windows Server 2025 and Windows Server Core 2025 […]
AWS – Amazon Bedrock now available in Asia Pacific (Hyderabad) and Asia Pacific (Osaka) regions
Beginning today, customers can use Amazon Bedrock in the Asia Pacific (Hyderabad) and Asia Pacific (Osaka) regions to easily build and scale generative AI applications using a variety of foundation models (FMs) as well as powerful tools to build generative AI applications. Amazon Bedrock is a fully managed service that offers a choice of high-performing […]
GCP – Unlock Inference-as-a-Service with Cloud Run and Vertex AI
It’s no secret that large language models (LLMs) and generative AI have become a key part of the application landscape. But most foundational LLMs are consumed as a service, meaning they’re hosted and served by a third party and accessed via APIs. Ultimately, this reliance on external APIs creates bottlenecks for developers. There are many […]
GCP – An SRE’s guide to optimizing ML systems with MLOps pipelines
Picture this: you’re an Site Reliability Engineer (SRE) responsible for the systems that power your company’s machine learning (ML) services. What do you do to ensure you have a reliable ML service, how do you know you’re doing it well, and how can you build strong systems to support these services? As artificial intelligence (AI) […]
GCP – Announcing quantum-safe digital signatures in Cloud KMS
The continued advancement of experimental quantum computing has raised concerns about the security of many of the world’s widely-used public-key cryptography systems. Crucially, there exists the potential for sufficiently large, cryptographically-relevant quantum computers to break these algorithms. This potential highlights the need for developers to build and implement quantum-resistant cryptography now. Fortunately, post-quantum cryptography (PQC) […]
