Cloud

2022 08 12

AWS – Amazon Personalize launches support for range queries on dynamic filters

Amazon Personalize now supports comparison operators ( =, <, <=, >, >=) for dynamic filters to provide flexibility in applying your business rules to recommendations. Amazon Personalize enables developers to improve customer engagement through personalized product and content recommendations – no ML expertise required. Dynamic filters currently allow you to change filter criteria at the time of getting recommendations, so that you can modify filter rules on the fly without having to create separate permutations. With this launch, you get even more control over your recommendations by having the option to use comparison operators ( =, <, <=, >, >=) with dynamic filter values. For example, video or news websites can use this feature in a “What’s New” section to only recommend items introduced in the last 7 days. Similarly, retail customers can upsell items on their checkout page by filtering for items priced higher than the item(s) currently in the shopping cart. This feature builds on existing dynamic filter functionality by providing increased flexibility in applying your individual business rules, enabling you to modify recommendations based on users’ needs, preferences, and changing behavior.

Read More for the details.

2022 08 12

AWS – Amazon RDS Custom for Oracle now supports Scale Compute

Amazon Relational Database Service (Amazon RDS) Custom for Oracle now supports Scale Compute operation. With this, you can now scale your Amazon RDS Custom for Oracle instance up or down.

Read More for the details.

2022 08 12

AWS – The Amazon Chime SDK announces elastic channels

The Amazon Chime SDK enables developers to add intelligent real-time audio, video, and screen share to their web and mobile applications. Starting today, Amazon Chime SDK messaging supports large-scale chat experiences for up to one million users with elastic channels. Use cases include watch parties for sporting events, political events, or live entertainment with create elastic channels. Elastic channels help make it easy for you to create secure, scalable, moderated chat experiences for large audiences which you can use with your built in moderation features to help enforce brand, corporate, or community guidelines.

Read More for the details.

2022 08 12

AWS – AWS Graviton2-based Amazon EC2 M6g, M6gd, C6g, and R6g are now available in Asia Pacific (Jakarta) region

Starting today, Amazon EC2 M6g, M6gd, C6g, and R6g instances powered by AWS Graviton2 processors are available in Asia Pacific (Jakarta) region. AWS Graviton2-based instances provide up to 40% better price performance over comparable x86-based instances. M6g and M6gd instances are ideal for general purpose workloads such as application servers, microservices, mid-size data stores, and caching fleets. C6g instances are ideal for compute-intensive workloads such as high performance computing (HPC), batch processing, ad serving, video encoding, gaming, distributed analytics, and CPU-based machine learning. The local SSD storage provided on M6gd instances is ideal for applications that need access to high-speed, low latency storage of data such as caching, logs, and scratch files. R6g instances are ideal for memory-intensive applications such as open-source databases, in-memory caches, and real time big data analytics.

Read More for the details.

2022 08 12

GCP – Securing apps for Googlers using Anthos Service Mesh

Cloud, Google Cloud gcp

Hi there! I’m David Challoner from Access Site Reliability Engineering (SRE), here with Anthony Bushong from Developer Relations to talk about how Corp Eng is adopting Anthos Service Mesh internally at Google.

Corp Eng is Google’s take on “Enterprise IT”. A big part of the Corp Eng mission is running the first and third party software that powers internal business processes – from legal and finance to floor planning and even the app hosting our cafe menus – all with the same security or production standards as any of Google’s first party applications.

Googlers need to access these applications, which sometimes then need to access other applications or other Google Cloud services. This traffic can cross different trust boundaries which can trigger different policies.

Access SRE runs the systems that mediate this access, and we implemented Anthos Service Mesh as part of our solution to secure the way Googlers access these applications.

But why?

You can probably tell, but the applications Corp Eng is responsible for have disparate requirements. This often means that certain applications are tied to disparate infrastructure due to legal, business or technical reasons – which can be challenging when those infrastructures work and operate differently.

Enter Anthos. Google Cloud built Anthos to provide a consistent platform interface unifying the experience of working with apps on these varying underlying infrastructures, with the Kubernetes API at its foundation.

So when searching for the right tool to build a common authorization framework to mediate access to CorpEng services, we turned to Anthos – specifically Anthos Service Mesh, powered by the open-source project, Istio. Whether these services were deployed in Google Cloud, in Corp Eng data centers, or at the edge onsite at actual Google campuses, Anthos Service Mesh delivered a consistent means for us to program secure connectivity.

To frame the impact ASM had on our organization, it’s helpful to introduce the roles of the folks who manage and use it:

Figure 1 – Anthos Service Mesh empowers multiple people across different roles to connect services securely

For security stakeholders, ASM provides an extensible policy enforcement point running next to each application capable of provisioning a certificate based on the identity of the workload and enforcing mandatory fine-grained application-aware access controls.

For platform operators, ASM is delivered as a managed product, which reduces operational overhead by providing out-of-the-box release channels, maintenance windows, and a published Service Level Objective(SLO).

For service owners, ASM enables the decoupling of their applications from networking concerns, while also providing features like rate limiting, load shedding, request tracing, monitoring, and more. Features like these were typically only available for applications that ran on Borg, Google’s first-party cluster manager that ultimately inspired the creation of Kubernetes.

In sum, we were able to secure access to a plethora of different services with minimal operational overhead, all while providing service owners granular traffic control.

Let’s see what this looks like in practice!

The architecture

Figure 2 – High-level architecture for Corp Eng services and Anthos

In this flow, user access first reaches the Google Cloud Global Load Balancer [1], configured with Identity Aware Proxy (IAP) and Cloud Armor. IAP is the publicly available implementation of Google’s internal philosophy of BeyondCorp, providing an authentication layer that works from untrusted networks without the need for a VPN.

Once a user is authenticated, their request then flows to the Ingress Gateway provided by Anthos Service Mesh [2]. This provides additional checks that traffic flows to services only when the request has come through IAP, while also enforcing mutual TLS (mTLS) between the Anthos Service Mesh Gateway to the Corp services owned by various teams.

Finally, additional policies are enforced by the sidecar running in every single service Pod [3]. Policies are pulled from source control using Anthos Config Management[4], and are propagated to all sidecars by the managed control plane provided by Anthos Service Mesh[5].

Managing the mesh

If you’re not familiar with how Istio works, it follows the pattern of a control plane and a data plane. We talked a little bit about the data plane – it is made up of the sidecar containers running alongside all of our service Pods. The control plane, however, is what’s responsible for updating these sidecars with the policies we want to enforce:

Figure 3 – High-level architecture for Istio

Thus, it is critical for us to ensure that the control plane is healthy. This is where Anthos Service Mesh gives our platform owners a huge advantage with its support for a fully-managed control plane. To provision cloud resources, like many other companies, our organization uses Terraform, the popular open-source infrastructure as code project. This gave us a declarative and familiar means for provisioning the Anthos Service Mesh control plane.

First, you enable the managed control plane feature for GKE by creating the google_gke_hub_feature resource below using Terraform.

code_block[StructValue([(u’code’, u’resource “google_gke_hub_feature” “feature_asm” {rn name = “servicemesh”rn location = “global”rn provider = google-betarn}’), (u’language’, u”), (u’caption’, <wagtail.wagtailcore.rich_text.RichText object at 0x3e3a519a6fd0>)])]

Keep in mind that at publication time, this is only available via the google-beta provider in Terraform.

Once created, we then provision a ControlPlaneRevision custom resource in a GKE cluster to spin up a managed control plane for ASM in that cluster:

code_block[StructValue([(u’code’, u’apiVersion: mesh.cloud.google.com/v1alpha1rnkind: ControlPlaneRevisionrnmetadata:rn name: asm-managedrn namespace: istio-systemrnspec:rn type: managed_servicern channel: regular’), (u’language’, u”), (u’caption’, <wagtail.wagtailcore.rich_text.RichText object at 0x3e3a519a6490>)])]

Using this custom resource, we are able to set the release channel for the ASM managed control plane. This allows for our platform team to define the pace of upgrades in accordance with our team’s needs.

In addition to managing the control plane, ASM also provides management functionality around the data plane to ensure each sidecar Envoy is kept up to date with the latest security updates and is compatible with the control plane – one less thing for service operators to worry about. It does this using Kubernetes Mutating Admission Webhooks and Namespace labels to modify our Pod workload definitions to inject the appropriate sidecar proxy version.

Syncing mandatory access policies

With the core Anthos Service Mesh components in place, our security practitioners can define consistent, mandatory security policies for every single GKE cluster, using Istio APIs.

For example, one policy is enforcing strict mTLS between Pods using automatically provisioned workload identity certificates. Earlier, we talked about how this is enforced between the Istio Gateway; that same policy enforces mTLS between all Pods in our cluster.

Figure 4 – A high-level diagram of mutual TLS

Another policy we implement is denying all egress traffic by default, requiring service teams to explicitly declare their outbound dependencies. The following is an example of using an Istio Service Entry to allow granular access to a specific external service – in this case, Google. This helps prevent unintended access to external services.

code_block[StructValue([(u’code’, u’apiVersion: networking.istio.io/v1alpha3rnkind: ServiceEntryrnmetadata:rn name: googlernspec:rn hosts:rn – www.google.comrn ports:rn – number: 443rn name: httpsrn protocol: HTTPSrn resolution: DNSrn location: MESH_EXTERNALrnEOF’), (u’language’, u”), (u’caption’, <wagtail.wagtailcore.rich_text.RichText object at 0x3e3a3a1b6050>)])]

These policies are automatically synced to all service mesh namespaces in each cluster using Anthos Config Management. By using our internal source control system as a source of truth, Anthos Config Management can sync and reconcile policies across all of our GKE clusters, ensuring that these policies are in place for every single one of our services. You can find more details about our implementation of Anthos Config Management here.

With this in place, our team plans on eventually migrating away from security automation that operates solely based on explicit IP, port and protocol policies.

Integration with Identity-aware Proxy

The publicly available version of the BeyondCorp proxy used by CorpEng is called Identity-aware Proxy (IAP), which offers an integration with Anthos Service Mesh. IAP allows you to authenticate users trying to access your services and apply Context-Aware-Access policies. This integration comes with two main benefits:

Ensuring that user traffic to services in the service mesh only come through Identity-aware Proxy

Enforcing Context-aware access (CAA) trust levels for devices, defined by multiple device signals we collect

Identity-aware Proxy allows us to capture this information in a Request Context Token (RCToken), which is a JSON Web Token (JWT) created by Identity-aware Proxy that can be verified by ASM. IAP inserts this JWT into the Ingress-Authorization header. Using Istio Authorization Policies similar to the following policy, any requests without this JWT are denied:

code_block[StructValue([(u’code’, u’apiVersion: security.istio.io/v1beta1rn kind: AuthorizationPolicyrn metadata:rn name: iap-gateway-require-jwtrn namespace: istio-systemrn spec:rn selector:rn matchLabels:rn app: istio-iap-ingressgatewayrn action: DENYrn rules:rn – from:rn – source:rn notRequestPrincipals: [“*”]’), (u’language’, u”), (u’caption’, <wagtail.wagtailcore.rich_text.RichText object at 0x3e3a3a1b6590>)])]

Here is an example policy that requires a fullyTrustedDevice access level – this might be a device in your organization that is known to be corporate-owned, fully-updated, and running an IT-approved configuration :

code_block[StructValue([(u’code’, u’apiVersion: security.istio.io/v1beta1rnkind: AuthorizationPolicyrnmetadata:rn name: require-fully-trusted-devicern namespace: fooServicernspec:rn selector:rn matchLabels:rn app: fooServicern action: ALLOWrn rules:rn – from:rn – source:rn requestPrincipals: [“*”]rn when:rn – key: request.auth.claims[google.access_levels]rn values: [“accessPolicies/$orgId/accessLevels/fullyTrustedDevice”]’), (u’language’, u”), (u’caption’, <wagtail.wagtailcore.rich_text.RichText object at 0x3e3a3a1b6650>)])]

This allows our security team to not only secure service to service communications, or outbound calls from services, but also specifically require incoming requests come from trusted devices and authenticated users using a trusted device.

Enabling service teams

As an SRE, one of our priorities is ensuring Service-level indicators (SLIs), SLOs, and Service-level agreements (SLAs) exist for services. Anthos Service Mesh helps us empower service owners to do this for their services, as it exposes horizontal request metrics like latency and availability to all services in the mesh.

Before Anthos Service Mesh, each application had to export these separately (if at all). With ASM service owners can easily define their Service’s SLOs in the cloud console or via terraform using these horizontally exported metrics. This then allows us to integrate SLOs into our higher-level service definitions so we can enable SLO monitoring and alerting by default. You can see theSRE book for more details on SLOs and Error budgets.

The takeaway

ASM is a powerful tool that enterprises can use to modernize their IT infrastructure. It provides:

A shared environment-agnostic enforcement point to manage security policy

A unified way to provision identities, describe application dependencies

This also enables previously unheard of operational capabilities such as distributed tracing or incremental canary rollouts – which were difficult to find in the typical enterprise application landscape.

Because it can be incrementally adopted and composed with existing authorization systems to close gaps – barriers to adoption are low and we recommend you start evaluating it today!

Read More for the details.

2022 08 12

AWS – Amazon Personalize now supports incremental bulk dataset imports

We are excited to announce that Amazon Personalize now supports incremental bulk dataset imports; a new option for updating your data and improving the quality of your recommendations. Amazon Personalize enables developers to improve customer engagement through personalized product and content recommendations – no ML expertise required. Previously, Amazon Personalize supported two types of data ingestion: full bulk dataset imports for ingesting large datasets, and APIs (PutEvents, PutItems, PutUsers) for real-time ingestion. Now you can keep your existing datasets intact and add new bulk-loaded data without resupplying your entire dataset. Amazon Personalize will automatically update records with the current version if your incremental import duplicates any records found in your existing dataset, further simplifying the data ingestion process.

Read More for the details.

2022 08 12

AWS – AWS DeepRacer Student now offers community races and new Intel learning content with a look under the hood of the DeepRacer vehicle

AWS DeepRacer Student, presented by Intel, is the first global autonomous racing league for students offering free educational material and resources to get hands on with machine learning (ML) by powering a 1/18th scale race car driven by reinforcement learning. Now available are student community races and additional learning content diving into the OpenVINO™ toolkit by Intel on machine learning inference.

Read More for the details.

2022 08 12

GCP – Building security guardrails for developers with Google Cloud

Cloud, Google Cloud gcp

Organizations across the globe are using the cloud to drive innovation. Developers are using cloud technology as the engine to test new ideas, fail fast, and automate scalability. Innovation on the cloud requires freedom and flexibility to run experiments and make mistakes. For many organizations with security top of mind, their concern is “How do I balance security and innovation?”

As a member of the Security Practice in Google Cloud’s Professional Services Organization, we regularly help customers to solve this question and many more cloud security challenges. Our global team works across industries to bring our Google Cloud security expertise directly to customers. We specialize in cloud security domains such as cloud native compliance, zero trust architecture, application security, data protection, and security operations.

In this post, we will provide a couple examples of how we advise our customers to configure preventive security controls using Google Cloud’s native capabilities and industry best practices. Preventive security controls, also known as security “guardrails”, are controls that allow developers the flexibility to innovate within the boundaries of defined security policies. Preventing a misconfiguration or vulnerability before it becomes exploitable.

Infrastructure as Code: Deploying securely

It can be difficult to solve organizational security challenges solely with technology. Mature cloud security programs are a blend of repeatable, operational processes and automated controls. To help ensure developers are innovating on a secure baseline, we recommend customers design a centralized process for developers to request new GCP projects and register workloads. This allows the security team the ability to properly configure GCP projects with defined security parameters. To help enable repeatability and consistency of the process, we automate using Google Cloud Project Factory to centrally deploy opinionated projects to developers.

The goal of guardrails is to prevent security violations before they can impact the production platform. Stopping a security issue before it occurs can be an effective risk mitigation tactic. Traditionally organizations used cumbersome change management processes to manually control deployment and evaluate security posture. On Google Cloud, we work with customers to design Infrastructure as Code (IaC) pipelines to define security policy checks and automatically validate posture before the deployment. A typical design pattern uses “policy-as-code” tools, such as Terraform Validator, to enforce security guardrails for developers as part of the CI/CD pipeline. This design allows customers to configure security constraints, based on their specific requirements or risk tolerance.

Building preventive controls using GCP native capabilities

Google Cloud works to deliver the industry’smost trusted cloud offering native platform and product capabilities that enable organization-wide preventive security control. We collaborate with security teams to design foundational architecture and recommend security services to meet their requirements. To highlight, the following Google Cloud services are commonly used to implement security guardrails for developers:

Organization Policy – Provides centralized and programmatic control over how the organization’s resources are deployed. Security teams can select from a list of available constraintsto restrict how a resource is configured, preventing a potential misconfiguration from occurring. For example the organization policy constraint, constraints/storage.publicAccessPrevention, will prevent a developer from publicly exposing a cloud storage bucket.

VPC Service Controls – Prevents unauthorized data movement by isolating GCP resources and restricting data flows with fine grained rules. VPC Service Controls enable context-based perimeter security to secure API-based services. Developers working on a protected service within a VPC Service Control perimeter will be restricted to the rules defined by the administrator, helping to mitigate the risk of data exfiltration. For example, customers will configure VPC Service Controls to limit BigQuery access to a developer’s specified location or device.

Cloud IAM– Enables granular access to ensure Developers only have access to specific Google Cloud resources. Security teams are able to apply the principle of least privilege, preventing overly permissive roles to reduce the overall attack surface of the platform.

These native GCP services are supported by Infrastructure as Code pipelines. To help ensure consistent protection for developers, preventive security services should be configured and deployed with the previously discussed managed IaC pipeline. Building a repeatable automated pattern for IaC deployment will simplify the process for developers and protect the environment with the defined security guardrails.

For more information on building a secure Google Cloud deployment, check out the Security Foundations Blueprint.

Read More for the details.

2022 08 12

AWS – Amazon Kinesis Data Analytics is now available in Asia Pacific (Jakarta) region

Amazon Kinesis Data Analytics is now available in the Asia Pacific (Jakarta) region.

Read More for the details.

2022 08 12

GCP – Building a scalable MLOps system with Vertex AI AutoML and Pipeline

Cloud, Google Cloud gcp

When you build a Machine Learning (ML) product, consider at least two MLOps scenarios. First, the model is replaceable, as breakthrough algorithms are introduced in academia or industry. Second, the model itself has to evolve with the data in the changing world.

We can handle both scenarios with the services provided by Vertex AI. For example:

AutoML capability automatically identifies the best model based on your budget, data, and settings.

You can easily manage the dataset with Vertex Managed Datasets by creating a new dataset or adding data to an existing dataset.

You can build an ML pipeline to automate a series of steps that start with importing a dataset and end with deploying a model using Vertex Pipelines.

This blog post shows you how to build this system. You can find the full notebook for reproduction here. Many folks focus on the ML pipeline when it comes to MLOps, but there are more parts to building MLOps as a “system”. In this post, you will see how Google Cloud Storage (GCS) and Google Cloud Functions manage data and handle events in the MLOps system.

Architecture

Figure 1 Overall MLOps Architecture (original)

Figure 1 shows the overall architecture presented in this blog. We cover the components and their connection in the context of two common workflows of the MLOps system.

Components

Vertex AI is at the heart of this system, and it leverages Vertex Managed Datasets, AutoML, Predictions, and Pipelines. We can create and manage a dataset as it grows using Vertex Managed Datasets. Vertex AutoML selects the best model without your knowing much about modeling. Vertex Predictions creates an endpoint (RestAPI) to which the client communicates.

It is a simple, fully managed yet somewhat complete end-to-end MLOps workflow moves from a dataset to training a model that gets deployed. This workflow can be programmatically written in Vertex Pipelines. Vertex Pipelines outputs the specification for an ML pipeline allowing you to re-run the pipeline whenever or wherever you want. Specify when and how to trigger the pipeline using Cloud Functions and Cloud Storage.

Cloud Functions is a serverless way to deploy your code in Google Cloud. In this particular project, it triggers the pipeline by listening to changes on the specified Cloud Storage location. Specifically, if a new dataset is added, for example, a new span number is created; the pipeline is triggered to train the dataset, and a new model is deployed.

Workflow

This MLOps system prepares the dataset with either Vertex Dataset’s built-in user interface (UI) or any external tools based on your preference. You can upload the prepared dataset into the designated GCS bucket with a new folder named SPAN-NUMBER. Cloud Functions then detects the changes in the GCS bucket and triggers the Vertex Pipeline to run the jobs from AutoML training to endpoint deployment.

Inside the Vertex Pipeline, it checks if there is an existing dataset created previously. If the dataset is new, Vertex Pipeline creates a new Vertex Dataset by importing the dataset from the GCS location and emits the corresponding Artifact. Otherwise, it adds the additional dataset to the existing Vertex Dataset and emits an artifact.

When the Vertex Pipeline recognizes the dataset as a new one, it trains a new AutoML model and deploys it by creating a new endpoint. If the dataset isn’t new, it tries to retrieve the model ID from Vertex Model and determines whether a new AutoML model or an updated AutoML model is needed. The second branch determines whether the AutoML model has been created. If it hasn’t been created, the second branch creates a new model. Also, when the model is trained, the corresponding component emits the artifact as well.

Directory structure that reflects different distributions

In this project, I have created two subsets of the CIFAR-10 dataset, SPAN-1 and SPAN-2. A more general version of this project can be found here, which shows how to build training and batch evaluation pipelines pipelines. The pipelines can be set up to cooperate so they can evaluate the currently deployed model and trigger the retraining process.

ML Pipeline with Kubeflow Pipelines (KFP)

We chose to use Kubeflow Pipelines to orchestrate the pipeline. There are a few things that I would like to highlight. First, it’s good to know how to make branches with conditional statements in KFP. Second, you need to explore AutoML API specifications to fully leverage AutoML capabilities, such as training a model based on the previously trained one. Last, you also need to find a way to emit artifacts for Vertex Dataset and Vertex Model to consume that Vertex AI can recognize them. Let’s go through these one by one.

Branching strategy

In this project, there are two main conditions and two sub-branches inside the second main branch. The main branches split the pipeline based on a condition if there is an existing Vertex Dataset. The sub-branches are applied in the second main branch, which is selected when there is an exciting Vertex Dataset. It looks up the list of models and decides to train an AutoML model from scratch or a previously trained one.

ML pipelines written in KFP can have conditions with a special syntax of kfp.dsl.Condition. For instance, we can define the branches as follows:

code_block[StructValue([(u’code’, u’from google_cloud_pipeline_components import aiplatform as gcc_aiprnrnrn# try to get Vertex Dataset IDrndataset_op = get_dataset_id(…) rnrnwith kfp.dsl.Condition(name=”create dataset”, rn dataset_op.outputs[‘Output’] == ‘None’):rn # Create Vertex Dataset, train AutoML from scratch, deploy modelrnrnwith kfp.dsl.Condition(name=”update dataset”, rn dataset_op.outputs[‘Output’] != ‘None’):rn # Update existing Vertex Datasetrn …rnrn # try to get Vertex Model IDrn model_op = get_model_id(…)rnrn with kfp.dsl.Condition(name=’model not exist’,rn model_op.outputs[‘Output’] == ‘None’):rn # Create Vertex Dataset, train AutoML from scratch, deploy modelrnrn with kfp.dsl.Condition(name=’model exist’,rn model_op.outputs[‘Output’] != ‘None’):rn # Create Vertex Dataset, train AutoML based on trained one, deploy model’), (u’language’, u”), (u’caption’, <wagtail.wagtailcore.rich_text.RichText object at 0x3e2582867250>)])]

get_dataset_id and get_model_id are custom KFP components used to determine if there is an existing Vertex Dataset and Vertex Model respectively. Both return “None” if a model is found and some other value if a model isn’t found. They also emit Vertex AI-aware artifacts. You will see what this means in the next section.

Emit Vertex AI-aware artifacts

Artifacts track the path of each experiment in the ML pipeline and display metadata in the Vertex Pipeline UI. When Vertex AI aware artifacts are released into in the pipeline, Vertex Pipeline UI displays links for its internal services such as Vertex Dataset, so that users can visit a web page for more information.

So how could you write a custom component to generate Vertex AI-aware artifacts? To do this, custom components should have Output[Artifact] in their parameters. Then you need to replace the resourceName of the metadata attribute with a special string format.

The following code example is the actual definition of get_dataset_id used in the previous code snippet:

code_block[StructValue([(u’code’, u’@component(rn packages_to_install=[“google-cloud-aiplatform”, rn “google-cloud-pipeline-components”]rn)rndef get_dataset_id(project_id: str, rn location: str,rn dataset_name: str,rn dataset_path: str,rn dataset: Output[Artifact]) -> str:rn from google.cloud import aiplatformrn from google.cloud.aiplatform.datasets.image_dataset import ImageDatasetrn from google_cloud_pipeline_components.types.artifact_types import VertexDatasetrnrn rn aiplatform.init(project=project_id, location=location)rn rn datasets = aiplatform.ImageDataset.list(project=project_id,rn location=location,rn filter=f’display_name={dataset_name}’)rn rn if len(datasets) > 0:rn dataset.metadata[‘resourceName’] = rn f’projects/{project_id}/locations/{location}/datasets/{datasets[0].name}’rn return f’projects/{project_id}/locations/{location}/datasets/{datasets[0].name}’rn else:rn return ‘None”), (u’language’, u”), (u’caption’, <wagtail.wagtailcore.rich_text.RichText object at 0x3e25832efa50>)])]

As you see, the dataset is defined in the parameters as Output[Artifact]. Even though it appears in the parameter, it is actually emitted automatically. You just need to provide the necessary data as if it is a function variable.

The dataset component retrieves the list of Vertex Dataset by calling the aiplotform.ImageDataset.list API. If the length of it is zero, it simply returns ‘None’. Otherwise, it returns the found resource name of the Vertex Dataset and provides the dataset.metadata[‘resourceName’] with the resource name at the same time. The Vertex AI-aware resource name follows a special string format, which is ‘projects/<project-id>/locations/<location>/<vertex-resource-type>/<resource-name>’.

The <vertex-resource-type>can be anything that points to an internal Vertex AI service. For instance, if you want to specify that the artifact is the Vertex Model, then you should replace <vertex-resource-type> with models. The <resource-name> is the unique ID of the resource, and it can be accessed in the name attribute of the resource found by the aiplatform API. The other custom component, get_model_id, is written in a very similar way as well.

AutoML based on the previous model

You sometimes want to train a new model on top of the previously best model. If that is possible, the new model will probably be much better than the one trained from scratch, because it leverages previously learned knowledge.

Luckily, Vertex AutoML comes with the ability to train a model using a previous model. AutoMLImageTrainingJobRunOp component lets you train a model by simply providing the base_model argument as follows:

code_block[StructValue([(u’code’, u”training_job_run_op = rn gcc_aip.AutoMLImageTrainingJobRunOp(rn …,rn base_model=model_op.outputs[‘model’],rn …rn )”), (u’language’, u”), (u’caption’, <wagtail.wagtailcore.rich_text.RichText object at 0x3e258c4b8f50>)])]

When training a new AutoML model from scratch, you pass ‘None’ in the base_model argument, and it is the default value. However, you can set it with a VertexModel artifact, and the component will trigger an AutoML training job based on the other model.

One thing to be careful of is that VertexModel artifacts can’t be constructed in a typical way of Python programming That means you can’t create an instance of VertexModel artifact by setting the id found in the Vertex Model dashboard. The only way you can create one is to set the metadata[‘resourceName’] parameters properly. The same rule applies to other Vertex AI-related artifacts such as VertexDataset. You can see how the VertexDataset artifact is constructed properly to get an existing Vertex Datasetto import additional data into it. See the full notebook of this project here.

Cost

You can reproduce the same result from this project with the free $300 credit when you create a new GCP account.

At the time of this blog post, Vertex Pipelines costs about $0.03/run, and the type of underlying VM for each pipeline component is e2-standard-4, which costs about $0.134/hour. Vertex AutoML training costs about $3.465/hour for image classification. GCS holds the actual data, which costs about $2.40/month for 100GiB capacity, and Vertex Dataset is free.

To simulate two different branches, the entire experiment took about one to two hours, and the total cost for this project is approximately $16.59. Please find more detailed pricing information about Vertex AI here.

Conclusion

Many people underestimate the capability of AutoML, but it is a great alternative for app and service developers who have little ML background. Vertex AI is a great platform that provides AutoML as well as Pipeline features to automate the ML workflow. In this article, I have demonstrated how to set up and run a basic MLOps workflow, from data injection to training a model based on the previously-achieved best one, to deploying the model to a Vertex AI platform. With this, we can let our ML model automatically adapt to the changes in a new dataset. What’s left for you to implement is to integrate a model monitoring system to detect data/model drift. One example is found here.

Read More for the details.

2022 08 11

AWS – AWS Private 5G is now generally available

Today, we are announcing the general availability of AWS Private 5G, a managed service that helps enterprises set up and scale private mobile networks in their facilities in days instead of months. With only a few clicks in the AWS Management Console, you can specify where to build a mobile network and the number of devices you want to connect. AWS then delivers and maintains the small cell radio unit, the mobile network core and radio access network (RAN) software, and subscriber identity modules (SIM cards) required to set up a private mobile network and connect devices. AWS Private 5G automates the setup and deployment of the network. No upfront fees or per-device costs are incurred with AWS Private 5G, and you pay only for the network capacity that you request.

Read More for the details.

2022 08 11

AWS – Amazon CloudWatch Synthetics adds support for custom canary groups with group-level availability metrics

Amazon CloudWatch Synthetics, an outside-in monitoring capability to continually verify your customer experience even when you don’t have any customer traffic on your applications, introduced a new capability to create custom groups of canaries. By creating a group of canaries, you can track success/failure status at a group or application level yet with an easy drill down to the failing canary, making it easier to pinpoint the canary failures in the context of the group or application. When groups consist of canaries across multiple AWS regions, this new capability allows you to more easily isolate region-specific issues.

Read More for the details.

2022 08 11

AWS – Amazon Cognito enables native support for AWS WAF

You can now enable AWS WAF protections for Amazon Cognito, making it even easier to protect Amazon Cognito user pools and hosted UI from common web exploits.

Read More for the details.

2022 08 11

AWS – Amazon EKS and Amazon EKS Distro now support Kubernetes version 1.23

You can now use Amazon EKS and Amazon EKS Distro to run Kubernetes version 1.23. Highlights of Kubernetes version 1.23 release include graduation of PodSecurity and Ephemeral containers to beta, and graduation of HorizontalPodAutoscaler to GA. Additionally, Kubernetes version 1.23 turns on CSI migration feature for Amazon EBS by default. You can find more details about Kubernetes 1.23 release in the EKS blog post, EKS release notes, and in the Kubernetes project release notes. Support for version 1.23 will be available in Amazon EKS Anywhere in the next couple of weeks.

Read More for the details.

2022 08 11

AWS – Amazon AppFlow now supports Zendesk Chat and Zendesk Sell as sources

Amazon AppFlow, a fully managed integration service that helps customers to securely transfer data between AWS services and software-as-a-service (SaaS) applications in just a few clicks, now supports Zendesk Chat and Zendesk Sell as sources.

Read More for the details.

2022 08 11

AWS – Amazon Location Service now supports circular geofences

Amazon Location Service now supports circular geofences, allowing developers to draw virtual boundaries by specifying a point and a radius on a map. Developers can use Amazon Location geofencing to initiate actions when devices enter or leave a specific area of interest by evaluating positions against tens of thousands of geofences in near-real time.

Read More for the details.

2022 08 11

AWS – Introducing the AWS Supply Chain Competency Partner Program

We are excited to announce the new AWS Supply Chain Competency, featuring top AWS Partners who provide cloud-native supply chain solutions on AWS and professional services for companies of every size and segment to enhance their responsiveness to customers, increase the speed of decision making and operation through automation and AI, strengthen their resilience to disruptions and become more responsible to the environment and global community.

Read More for the details.

2022 08 11

AWS – Amazon EventBridge now supports receiving events from GitHub, Stripe and Twilio using Webhooks

Amazon EventBridge now supports integrations with GitHub, Stripe, and Twilio via webhooks using Quicks Starts. You can subscribe to events from these SaaS applications and receive them on an Amazon EventBridge event bus for further processing. With Quick Starts, you can use AWS CloudFormation templates to create HTTP endpoints for your event bus that are configured with security best practices for GitHub, Stripe, and Twilio. You can configure your GitHub, Stripe, and Twilio webhooks from the respective accounts; simply select the types of events you want to send to the newly generated endpoint and begin securely receiving events on your event bus.

Read More for the details.

2022 08 11

AWS – Amazon AppFlow now supports data transfers from SAP applications to AWS Services using SAP Operational Data Provisioning (ODP)

Amazon AppFlow, a fully managed integration service that helps customers to securely transfer data between AWS services and software-as-a-service (SaaS) applications in just a few clicks, now supports data transfers from SAP applications to AWS Services using SAP Operational Data Provisioning (ODP) framework. With this launch AppFlow customers can use the AppFlow SAP OData connector to perform full and incremental data transfers, including Change Data Capture using SAP Operational Delta Queue, from SAP ERP/BW applications (including ECC, BW, BW/4HANA and S/4HANA) to AWS services such as Amazon S3.

Read More for the details.

2022 08 11

AWS – AWS Console Mobile Application adds support for Cost Explorer service

AWS Console Mobile Application users can now use AWS Cost Explorer on both the iOS and Android applications. The Console Mobile Application provides a secure on-the-go solution to visualize, understand, and manage AWS costs and usage over time. Customers can analyze total costs and usage across all regions and services for preceding eight weeks, identify trends, pinpoint cost drivers, and detect anomalies.

Read More for the details.