Welcome to above the clouds

AWS – Amazon Connect can now generate forecast for workloads with as little as one contact
Amazon Connect can now generate forecasts for smaller workloads, with as little as one contact, making it easier for contact center managers to predict demand. This eliminates the need for you to manually adjust historical data to meet minimum data requirements. By reducing minimum data requirements, you can now enable managers to generate forecasts for […]

AWS – Amazon SageMaker JumpStart is now available in the AWS GovCloud (US-West and US-East) Regions
Amazon SageMaker JumpStart is now available in the AWS GovCloud (US) Regions. Public sector customers can easily deploy and fine-tune open-weight models through the SageMaker Python SDK. Amazon SageMaker JumpStart is a machine learning (ML) hub that offers hundreds of pre-trained models and built-in algorithms to help you quickly get started with ML. Customers can […]

AWS – Amazon Route 53 Resolver endpoints now support DNS-over-HTTPS (DoH) with Server Name Indication (SNI) validation
Starting today, you can provide Server Name Indication (SNI) with Route 53 Resolver endpoints for DNS-over-HTTPS (DoH), allowing you to specify the target server hostname for DNS query requests from your outbound endpoints to DoH servers that require SNI for TLS validation. DoH on Amazon Route 53 Resolver endpoints allows you to encrypt DNS queries […]

AWS – AWS CodePipeline introduces new general purpose compute action
AWS CodePipeline introduces the Commands action that enables you to easily run shell commands as part of your pipeline execution. With the Commands action, you will have access to a secure compute environment backed by CodeBuild to run AWS CLI, third-party tools, or any shell commands. The Commands action runs CodeBuild managed on-demand EC2 compute, […]

AWS – Amazon Connect Contact Lens supports new read-only permissions for reports and dashboards
Amazon Connect Contact Lens now allows users to save and publish reports and dashboards as read-only. By publishing a report as read-only, only the user who created the report or dashboard can edit the report, while still making it visible for others to view or create a copy. For example, a contact center manager can […]

GCP – Three steps in mapping out your modern platform strategy
As AI adoption speeds up, one thing is becoming clear: the developer platforms that got you this far won’t get you to the next stage. While yesterday’s platforms were awesome, let’s face it, they weren’t built for today’s AI-infused application development and deployment. And organizations are quickly realizing they need to update their platform strategies […]

GCP – Parallelstore is now GA, fueling the next generation of AI and HPC workloads
Organizations use artificial intelligence (AI) and high-performance computing (HPC) applications to process massive datasets, run complex simulations, and train generative models with billions of parameters for diverse use cases such as LLMs, genomic analysis, quantitative analysis, or real-time sports analytics. These workloads place big performance demands on their storage systems, requiring high throughput and I/O […]

GCP – When to use supervised fine-tuning for Gemini
Have you ever wished you could get a foundation model to respond in a particular style, exhibit domain-specific expertise, or excel at a specific task? While foundation models like Gemini demonstrate remarkable capabilities out-of-the-box, there can be a gap between their general knowledge and the nuanced understanding required for specific applications. Supervised Fine-Tuning (SFT) emerges […]

GCP – You can now sign Microsoft Windows artifacts with keys protected by Cloud HSM
To build trust in the software world, developers need to be able to digitally sign their code and attest that the software their customers are downloading is legitimate and hasn’t been maliciously altered. Keys used to sign code are the cryptographic equivalent of crown jewels for many organizations, and protecting them is of utmost importance. […]
GCP – An advanced LlamaIndex RAG implementation on Google Cloud
Introduction Retrieval Augmented Generation (RAG) is revolutionizing how we build Large Language Model (LLM)-powered applications, but unlike tabular machine learning where XGBoost reigns supreme, there’s no single “go-to” solution for RAG. Developers need efficient ways to experiment with different retrieval techniques and evaluate their performance. This post provides a practical guide to rapidly prototyping and […]