Welcome to above the clouds

GCP – Retailers unwrap a successful 2021 holiday season with Google’s Black Friday/Cyber Monday program
Over the last five years, the November-December holiday shopping season has accounted for an average of 19 percent of annual retail sales, driven in large part by the surge in shopping over the five-day Thanksgiving holiday weekend. In 2021, nearly 180 million Americans shopped between Thanksgiving Day and Cyber Monday, generating online sales of $8.9B […]

GCP – GCP Controls to leverage for Data Pipeline in Regulated Industries
Many companies, both in Digital Native and traditional regulated industries, such as Finance, Healthcare and Telecom, use their data and cloud technologies to solve complex problems, enable rapid ML experimentation, and bring new products to market. However, before using Cloud for data workloads, many in regulated industries weigh risk vs reward. The risks are mainly […]

GCP – Explaining machine learning models to business users using BigQueryML and Looker
Organizations increasingly turn to AI to transform work processes, but this rapid adoption of models has amplified the need for explainable AI. Explaining AI helps us understand how and why models make predictions. For example, a financial institution might wish to use an AI model to automatically flag credit card transactions for fraudulent activity. While […]

GCP – Deploying and operating cloud-based 5G networks
Communication services providers (CSPs) are experiencing a period of disruption. Overall revenue growth is decelerating and is projected to remain below 1 percent per year, following a trend that started even before the pandemic.1 At the same time, driven by the pandemic, data consumption in 2020 increased by 30 percent relative to 2019, with some […]

GCP – A video guide to reactive programming with Google Maps Platform
The Google Maps Platform Android SDK supports extensions for reactive programming, which helps you write code to handle asynchronous operations. Write reactive and responsive mapping applications with Google Maps Platform In mobile apps, asynchronous events can happen at any point in time: user touch events, waiting for network calls to complete, or receiving push notifications, […]

GCP – Google Cloud expands higher education credits to 8 countries in Africa
To better support equity and opportunity in higher education across the globe, we are proud to announce the expansion of Google Cloud research, teaching, and learning credits to eight countries in Africa: Egypt, Ghana, Kenya, Morocco, Namibia, Nigeria, Senegal, South Africa, and Tunisia. With the recent additions of five countries in Latin America, Asia, and […]

GCP – PyTorch/XLA: Performance debugging on Cloud TPU VM: Part III
This article is the final in the three part series to explore the performance debugging ecosystem of PyTorch/XLA on Google Cloud TPU VM. In the first part, we introduced the key concept to reason about the training performance using PyTorch/XLA profiler and ended with an interesting performance bottleneck we encountered in the Multi-Head-Attention (MHA) implementation […]

AWS – Amazon SageMaker Pipelines now supports concurrency control
Amazon SageMaker Pipelines, a fully managed service that allows customers to define and orchestrate their model building steps as workflows, now allows customers to set concurrency limits on the number of steps which can be executed in parallel. Read More for the details.

AWS – Amazon SageMaker Pipelines now offers native EMR integration for large scale data processing
Amazon SageMaker Pipelines is a fully-managed service that allows customers to define and orchestrate their model building steps as workflows. Today, we are happy to introduce a new step type that allows machine learning engineers to run data processing applications using open source frameworks such as Apache Spark, Presto, and Hive on Amazon EMR clusters. […]
AWS – Amazon EMR now supports Apache Iceberg, a highly performant, concurrent, ACID-compliant table format for data lakes
We are excited to announce that Amazon EMR 6.5.0 now includes Apache Iceberg version 0.12. Apache Iceberg is an open table format for large data sets in Amazon S3 and provides fast query performance over large tables, atomic commits, concurrent writes, and SQL-compatible table evolution. With the current release, you can use Apache Spark 3.1.2 […]