, Author at Cloud bites from the grill

About

Posts by :

2023 10 25

GCP – Smile please! 3D Predict revolutionizes orthodontics with generative AI technology running on Google Cloud

If you’ve ever worn braces, you’ll know how uncomfortable and inconvenient they can be. Aside from having a mouthful of metal, there’s the sticky problem of trapped food, sleepless nights, and frequent visits to the orthodontist for adjustments. This discomfort is endured over a long period of treatment of 2+ years on average (and much longer in many cases).

So how else do you get that perfect smile? Clear plastic aligners offer a lot of advantages: they are more comfortable to wear, far less visible, and can be taken off when eating or brushing your teeth. But traditionally they are only used for mild orthodontic issues. They can be a nuisance for doctors who often have to recalibrate treatment and extend treatment times, which makes results difficult to predict.

As a Doctor of Medicine with experience in clinical trials and epidemiological studies, I was determined to identify both the root cause of this problem and the technology to solve it. The first part of the puzzle was clear: traditional aligner systems didn’t take a patient’s full anatomy into consideration when planning their treatment. To make aligners that produce predictable results in less time, we had to produce and analyze large datasets of what’s beyond the gum (roots, bone). I could see that harnessing the power of artificial intelligence would revolutionize the orthodontic industry. After intensive research and planning, we launched3D Predict, a patented technology for root and bone-based orthodontic treatment. Taking advantage of the latest developments in machine learning models, our platform is trained on high-resolution Cone beam computed tomography (CBCT) data that shows the position of a patient’s teeth before, during, and at the end of treatment.

This innovative approach enables us to predict the most efficient forces that should be applied to teeth, based on an individual’s teeth roots length and thickness of the bone. Even in severe cases, it can deliver results 30% faster with 50% less visits to the doctor. If we translate this to a real life example, 3D Predict can enable the doctor to perform severe crowding or bite correction in 2 years vs 3 with other braces and clear aligner systems. Our adoption of AI shows the power of this technology, and our experience can be instructive to other startup companies as well.

Taking advantage of generative AI in the cloud

To succeed, we needed computing power that matched the demands of our software, the majority of which incorporates a complex AI infrastructure. This includes our patented Deep CBCT analysis software that converts a scan into an accurate 3D model of the patient’s unique anatomy; a patented CAD program for advanced treatment planning; software that allows doctors to plan tooth movements on a 3D model; and a portal where doctors submit patient information, review 3D plans, and order aligners.

Google Cloud was the clear answer because of its comprehensive range of solutions and services and the geographical distribution of its virtual servers. For machine learning we useGoogle Compute Engine to train models, store training datasets, and manage AI operations. For training and inference, we use NVIDIA T4 GPUs running on Google Cloud. AI technology market is highly competitive and speed to market is more important than ever for new product launches and updates. Adopting flexible solutions like Google Cloud can facilitate the focus on innovation and experiments, rather than on optimization of internal technical infrastructure to support the product.

We are also excited aboutGenerative AI Studio. We are planning to use it to analyze our database of medical cases and generate improvements to personalize treatment planning even further and maintain our place as the leading technology in terms of treatment planning and predictability of results.

Another significant feature isCloud Storage, which offers flexible financing options based on data usage. Data security is also critical. With Google Cloud we can protect business and patient data and comply with HIPAA standards that govern health information in the US.

We useCloud SQL, andFilestore for storage, andCloud Monitoring to track the performance of our system. We’re also scoping new projects withGoogle Cloud Smart Analytics andLooker Studio on top ofBigQuery.

The overall flexibility and computing power of Google Cloud has had a significant impact on performance and costs. We can now process high-resolution images 12 times faster than with our on-premises systems and benefit from a 20 times reduction in storage costs.

Getting started with Google Cloud was also simplified by theGoogle for Startups Cloud Program, and access to experts who supported our migration to the cloud. From a financial perspective, Google Cloud credits helped us to focus on innovation rather than budgets—a huge advantage for a lean startup where resources are tight.

Advice for women entrepreneurs

There will be bumps along the way, but don’t be afraid to be yourself, to be proud of your abilities or to ask for advice. The team at Google Cloud gave us valuable strategic guidance to help us optimize our infrastructure and processes. Today we are excited to be an international, diverse team led by women and where more than 60 percent of our workforce is female, challenging the stereotypes associated with the technological space and AI products in particular.

Above all we are motivated to make a positive difference and improve people’s lives using artificial intelligence. Thanks to our technology, patients can achieve their desired healthy smiles faster and know how much time they will need to commit from the start. For doctors, we empower them with information necessary to make better treatment decisions and to stop losing money on multiple refinements.

When we look to the future, we already see that the Google Cloud roadmap is in step with our ambitions. Recent additions to the Google Cloud GPU family, such asG2, will enable us to increase the resolution of patient scans and boost the accuracy of the models that we generate. All this adds momentum to our research and our quest to deliver better outcomes for patients and doctors. That’s something that we can all smile about.

If you want to learn more about how Google Cloud can help your startup, visit our page hereto get more information about our program, and sign up for our communicationsto get a look at our community activities, digital events, special offers, and more.

Read More for the details.

2023 10 25

GCP – Empowering all to be safer with AI this Cybersecurity Awareness Month

Cloud, Google Cloud gcp

Cybersecurity requires continual vigilance, including using built-in protections and providing resources for changing security threats. In acknowledgment of Cybersecurity Awareness Month, now in its 20th year, we recently shared our progress across a number of security efforts, and announced a few new technologies that help us keep more people safe online than anyone else.

Also this year, artificial intelligence has been igniting massive shifts in the world of technology. As people look to AI to help address global issues ranging from disease detection to natural disaster prediction, AI has the potential to vastly improve how we identify, address, and reduce cybersecurity risks.

When it comes to AI and cybersecurity, the two topics that are top of mind are: how can generative AI enhance security, and how secure is generative AI from attacks.

Using AI to enhance security

According to Deloitte’s Future of Cyber 2023 survey, more than 90% of organizations have experienced at least one cyber compromise. In a data-driven era, globally connected infrastructures and applications inject cyber risk at each level of an organization’s digital activity, and criminals are deriving new ways to automate their tactics.

Fortunately, AI’s potential as a force multiplier in cybersecurity is becoming a reality. Generative AI and foundation models can be used in cybersecurity to overcome challenges in threat monitoring, architecting systems and tools, and talent shortages. A foundation model trained and fine-tuned for security can help:

Better identify threats at scale, in real time, by finding patterns or anomalies in large amounts of data to quickly identify hard-to-find threats and contain them before they spread.Automate the management of cybersecurity systems and tools by streamlining data, prioritizing alerts, and simplifying threat analysis to reduce toil.Make security operations accessible to the entire organization. By pairing the workforce with the intuitive capabilities of a foundation model that can access a variety of datasets and threat intelligence, the in-demand security professionals are freed up so they can operate “at the top of their license,” focusing on the most urgent or priority threats that pose the greatest risks to the organization.

The Google Cloud Security AI Workbench, powered by Google’s security-specific Sec-PaLM 2 model, is a platform for adding gen AI functionality to security products. It’s based on years of foundational AI research by Google. It is designed to help address the core challenges limiting cybersecurity operations today: the scope and scale of the threats, the toil of architecting security tools, and the stubborn talent gap.

aside_block<ListValue: [StructValue([(‘title’, ‘Security’s generative AI moment’), (‘body’, <wagtail.rich_text.RichText object at 0x3e088b7736a0>), (‘btn_text’, ‘Read: Entering the era of generative AI-enabled security’), (‘href’, ‘https://inthecloud.withgoogle.com/entering-era-gen-ai-enbl-security/download.html?utm_source=blog&utm_medium=referral’), (‘image’, None)])]>

Securing AI from attacks

Inspired by industry best practices, Google designed its Secure AI Framework (SAIF) to help organizations assess and mitigate risks specific to AI systems.

SAIF has six core elements:

Expand strong security foundations to the AI ecosystemExtend detection and response to bring AI into an organization’s threat universeAutomate defenses to keep pace with existing and new threatsHarmonize platform level controls to ensure consistent security across the organizationAdapt controls to adjust mitigations and create faster feedback loops for AI deploymentContextualize AI system risks in surrounding business processes

You can read a more detailed SAIF summary here, and review examples of how practitioners can implement SAIF here.

Google is putting SAIF into action by fostering industry support for SAIF with key partners and contributors, and continued industry engagement. We are also working directly with organizations, including private enterprise and governments, to help them understand how to assess AI security risks and mitigate them.

Part of this work includes conducting workshops with practitioners and continuing to publish best practices for deploying AI systems securely. In addition, we share insights from Google’s leading threat intelligence teams including Mandiant and Threat Analysis Group (TAG) on cyber activity involving AI systems.

AI’s impact on the security industry

Our approach to AI is rooted in the principle that we must be bold and responsible with AI, so it can have a positive impact on the security ecosystem. That’s why Google Cloud pursues a shared fate model, focused on protecting people, businesses, and governments by sharing our expertise, empowering the society to address ever-evolving cyber risks, and continuously working to advance the state of the art in cybersecurity to build a safer world for everyone.

Read More for the details.

2023 10 25

GCP – Connecting the South Pacific with new subsea cables

Cloud, Google Cloud gcp

Today, we’re announcing plans for the South Pacific Connect initiative, which will deliver two new transpacific subsea cables — Honomoana and Tabua — to help increase the reliability and resilience of digital connectivity in the Pacific.

In collaboration with a number of partners, including Fiji International Telecommunications, Office of Posts and Telecommunications of French Polynesia, APTelecom and Vocus Group, the South Pacific Connect initiative will deliver new international subsea cables connecting Fiji and French Polynesia to both the United States and Australia.

Honomoana, named after the Polynesian word for “link” (hono) and “ocean” (moana), will connect the United States and Australia to French Polynesia. Tabua, named after a sacred Fijian whale’s tooth, will connect the United States and Australia to Fiji.

In addition, the South Pacific Connect initiative will construct physically diverse cable landing stations in Fiji and French Polynesia and connect them with an interlink cable. This will serve to connect transpacific routes, improve reliability, add capacity, and reduce latency for users in the Pacific Islands and around the world.

The South Pacific Connect initiative will create a ring between Australia, Fiji and French Polynesia. This ring will include pre-positioned branching units that will allow other countries and territories of Oceania to take advantage of the reliability and resilience resulting from the initiative. This is one of the first projects of its kind in the Pacific, providing the ability to bring redundant international connectivity to a region that is susceptible to natural disasters.

“The Government of Fiji is delighted to partner with Google in achieving this momentous milestone to bolster digital connectivity and resilience. Our shared vision is to pave the way for a more interconnected and fortified digital future not only for Fiji but also for the entire Pacific region and beyond. Through this partnership, we aim to leave behind a lasting legacy of economic growth, skill development, and progress that extends its benefits to all. Fiji expresses its profound gratitude to Google for this collaboration and reaffirms its unwavering commitment to support this transformative initiative, dedicated to fostering peace, prosperity, and a brighter future for our people and the broader global community,” said Prime Minister of Fiji, Honorable Sitiveni Rabuka.

“The government of French Polynesia has chosen the digital economy as one of the four key sectors for economic and social development. We could not wish for a better partner than Google in this endeavor,” said President Moetai Brotherson of French Polynesia.

“Fintel is excited to be supporting the South Pacific Connect initiative alongside the multinational technology company, Google. This will strengthen Fintel and Fiji’s telecommunications hub status and provide the much needed reliable and resilient connectivity between the people of the Pacific and the rest of the world,” said Fiji International Telecommunications CEO George Samisoni.

“Vocus is delighted to be the partner of choice for Google to deliver the South Pacific Connect initiative, which will significantly uplift the capacity, reliability, and resilience of Australia’s international connectivity. The system will establish three diverse Australian landings along with dual cable paths to the US, substantially improving the resilience of Australia’s critical connections to the world,” said Vocus CEO Ellie Sweeney.

“Since 2010, OPT has started to build its submarine cable network to allow access to the internet for the French Polynesia population and reduce the digital divide for remote islands. As such, after Honotua (2010) linking Tahiti to Hawaii, Natitua (2018 and 2023) linking 22 islands of Tuamotu, Marquisas and Australes archipelagos to Tahtiti, Manatua (2020) linking Tahiti to Cook Islands, Niue and Samoa, OPT is proud to partner with Google to expand its network and expertise in this fantastic project for our region and communities,” said Jean-Francois Martin, CEO of OPT.

“Increasing the reliability and resilience of digital connectivity in the Pacific is a fundamental objective of APTelecom and collaborating with Google is a great example of how companies can work together to accelerate and deliver connectivity solutions that will change the world,” said Eric Handa, CEO of APTelecom.

Subsea cables improve internet affordability and reliability, leading to economic development and productivity gains. With more access to digital services, people can develop new skills that open up career opportunities, while businesses and public sector organizations can better serve their customers and constituents. We are excited about what this means for our users and Google Cloud customers in the Pacific and around the world.

We look forward to sharing more as we work closely with our partners to bridge the digital divide and improve Pacific connectivity.

Read More for the details.

2023 10 24

GCP – How Orby AI leverages Document AI to enable organizations to automate document-centric processes

Cloud, Google Cloud gcp

Organizations are inundated with an overwhelming volume of documents that need to be processed, validated, and organized. Orby AI, powered by Document AI, helps customers automate document-centric repetitive tasks like contract processing, email processing, and invoice validation. Using Document AI’s pre-trained models — known as processors — as an important foundational layer, Orby AI learns a customer’s workflows and then generates suggestions on how to automate the process while staying compliant.

The platform harnesses the knowledge derived from human activities and leverages Document AI processors to understand documents and automate workflows once validated. This approach ensures seamless integration of human expertise and AI-powered automation, optimizing efficiency and productivity across operations.

Observe, learn, and automate repetitive work

Consider a familiar scenario: finance teams spend hours manually extracting key data from vendor contracts and comparing it to invoices before payment. This painstaking process not only drains precious time but also opens the door to potentially costly errors.

Current solutions often fall short when it comes to tackling intricate and dynamic processes that demand the prowess of Artificial intelligence (AI) and machine learning (ML). They may also require substantial upfront effort to comprehend complex workflows before automation can commence.

Orby AI helps fulfill the end-to-end automation promise while keeping humans involved in the process. Orby observes the process and learns from users as they work, collecting data to create automation suggestions they can review and provide feedback on later.

Orby AI’s unique “observe, learn, and automate” end-to-end experience monitors a user’s activities, identifies repetitive work steps, and automatically generates suggestions that can be used to automate these tasks after human validation. Moreover, Orby improves over time from human feedback and becomes more accurate with no coding involved.

Building a solution architecture with Document AI

Orby AI uses Document AI processors, including Enterprise Document OCR and Form Parser, to extract highly accurate text, layouts, key value pairs, and tables from documents.

Overall Orby AI architecture

Orby AI chose to use Document AI’s Form Parser and Enterprise Document OCR for the high quality text detection and extraction, strong multilingual support and broader integration with the Google Cloud ecosystem. This set Document AI apart and has been critical for Orby AI in providing stronger recommendations downstream.

Deep dive into Document Understanding Capabilities

“We chose Document AI’s general document models because it provides the state of the art quality for OCR and document understanding.”— Will Lu, Orby AI Cofounder & CTO

To see Orby AI in action, check more out here: www.orby.ai

Read More for the details.

2023 10 24

GCP – How NCR Voyix reduced cost and complexity with Log Analytics from Cloud Logging

Cloud, Google Cloud gcp

Today, we hear from NCR Voyix, who co-creates experiences with the world’s leading retailers, restaurants and financial institutions, about their move to Google Cloud, what it meant for their log processing and analytics, and how Cloud Logging and Log Analytics has helped them take advantage of the cloud paradigm.

Here at NCR Voyix, the SRE team provides observability solutions necessary for running our Digital Banking platforms, and we rely on observability tools to detect, troubleshoot and resolve issues as quickly as possible. Analyzing logs is not only an important component toward this endeavor, but also for making informed business decisions.

Opportunities in the cloud

We recently completed the transition of a key banking platform called Digital Insight onto Google Cloud. This meant moving our observability tools into the cloud as well, including a third-party logs processing and analytics tool that we have relied upon since our on-prem days. But integrating this tool was difficult and plagued with latency issues.

We also understood that working in the cloud brings new opportunities. We are constantly looking to optimize our systems and we noticed that there were new tools available, natively within Google Cloud.

Enter Log Analytics, a new set of features in Cloud Logging that brings the power ofBigQuery with a new user experience optimized for analyzing logs data. The launch of Log Analytics couldn’t have come at a better time for us. It not only satisfied our use cases, but was also available at no additional cost. Being able to use SQL to query the logs data was the icing on the cake, and trimmed our learning curve.

Today, our third- party tool is gone; Log Analytics is our new solution. It’s a great fit for our needs, and we’re excited that new features are being added to it regularly. Throughout the deployment, we collaborated with the Google Cloud product management organization to provide feedback, so Log Analytics could meet the complex requirements of our SRE teams. Here are a few things we learned about the tool, about running in the cloud — and about our organizational culture — along the way.

Key learnings

We didn’t need to bring all our baggage into the cloud. Re-framing some of our needs helped us find solutions that ran natively in the cloud. Doing this early can bring savings. For us, the third-party tool was overpriced and underused; sure it had some advanced features, but not for our kind of usage.Cloud Logging and Log Analytics, being native to the cloud, reduced the need to replicate logs to an external system. This in turn increased the speed with which we can generate reports about issues within the overall environment.Using structured logging, such as the JSON format, improves Log Analytics’ performance because there is less overhead from pattern matching.Minimize your learning curve. Getting people to accept new ways of doing things can be difficult. But with Log Analytics it’s been pretty straightforward so far because it just uses SQL and our teams already know SQL.Before, we couldn’t send out some types of logs such as audit logs to third-party tools for further analysis without incurring costs and or causing compliance implications. With Log Analytics, we can now analyze all our logs locally.Log Analytics charts, a new feature within Log Analytics, can be a great alternative to charts built with log-based metrics. These charts can also be embedded into Cloud Monitoring dashboards.Finally, our use of Log Analytics is paving the way for more cohesive observability, where important tools aren’t external entities but part of the same overall system. Everything is in one place.

In summary

By embracing Log Analytics, our organization is reducing costs while enabling users to go deeper with their logs. Being native to the cloud removed the need to replicate logs to an external system, which has improved the latency of reporting on issues within the overall environment. Cloud Logging, as a whole, now handles platform security, scalability and reliability of our system so that our SRE teams can better focus on partnering with our product and application teams.

Read More for the details.

2023 10 24

GCP – Save up to 52 percent on your database compute costs with AlloyDB committed use discounts

Cloud, Google Cloud gcp

AlloyDB for PostgreSQL is a fully managed, PostgreSQL-compatible database service that offers high performance, scalability, reliability, and availability. It is a good choice for a wide range of workloads, including transactional and mixed workloads.

Each database user has unique needs, and they find different benefits from AlloyDB’s industry-leading performance. It can mean that your queries execute rapidly to deliver a snappy app experience, it can provide you the headroom to scale your application to support your growing business, or in other cases, it can mean that you can run your existing database workload in smaller instances on AlloyDB, and decrease your costs.

Committed use discounts (CUDs) provide additional ways for you to save money on your database costs. By committing to a consistent amount of compute usage for a one- or three-year period, you can get deep discounts: a 25% discount for a one-year commitment and a 52% discount for a three-year commitment.

CUDs are spend-based commitments, and apply to all vCPU and memory usage in AlloyDB across all projects or regions that are associated with a single Cloud Billing account. You can purchase a CUD today from your Cloud Console.

How to use CUDs to save money

To illustrate how CUDs can save you money, let’s look at an example. Suppose you have an AlloyDB cluster with a highly available primary instance with 8 vCPUs and 64 GB of memory in Iowa. You expect to use this cluster consistently for the next year.

The on-demand prices for compute resources in Iowa are $0.066 per vCPU per hour and $0.011 per GB of memory per hour. This means that the total on-demand cost for the instance is $2.49 per hour.

If you purchase a one-year CUD for this instance, you will get a 25% discount. This means that the cost of the instance will be reduced to $1.87 per hour.

Over the course of a year, this will save you $5,455.

If you purchase a three-year CUD for this instance, you will get a 52% discount. This means that the cost of the instance will be reduced to $1.20 per hour.

Over the course of three years, this will save you $34,025.

When to use CUDs

CUDs are ideal for workloads with predictable resource needs. For example, if you have a production database that you expect to use consistently for the next year or more, you can purchase a CUD to save money.

CUDs are also a good choice for workloads that are growing steadily. For example, if you expect your database usage to grow by 30% each year, you can purchase a three-year CUD at your current usage levels to lock in a 52% discount. As your workload grows, you can purchase additional CUDs to cover more of your spend with attractive discounts.

How to purchase CUDs

To purchase a CUD, go to theCommitted use discounts page in the Cloud console. You can make your commitment based on the on-demand prices of the instances you would like to cover with your commitment. Once you choose your commitment period, you will see the cost of your commitment after the discounts and how much you’re saving. It’s that easy!

Conclusion

AlloyDB provides both a performant and a cost-effective platform to run your database workloads. With committed use discounts, you can save even more on your compute costs! Check out our documentation and pricing page for more details on AlloyDB CUDs.

Read More for the details.

2023 10 23

GCP – What’s new with Cloud Firewall Standard

Cloud, Google Cloud gcp

Google Cloud Firewall is a fully distributed, stateful inspection next-generation firewall that is built into our software-defined networking fabric and enforced for each workload. With Cloud Firewall, you can enable advanced network threat protection with operational simplicity at cloud scale.

Today, we are excited to announce the general availability of the fully qualified domain name (FQDN) feature for Cloud Firewall. FQDN is generally available to customers as part of the Cloud Firewall Standard tier, which also includes Google Cloud Threat Intelligence integration and geolocation filtering. We have also extended Google Cloud Threat Intelligence support with new IP reputation lists and released IPV6 and GKE node pool support for IAM-governed tags in Public Preview.

Cloud Firewall features are available in three tiers shown in the graphic below: Essentials, the foundational set of capabilities; Standard, which expands rule capabilities; and Plus, which includes advanced threat protection capabilities. You can check out our Cloud Firewall Plus blog to learn more about the capabilities in Plus tier.

Figure 1: Cloud Firewall Tiers

FQDN-based objects to help easily filter traffic using domain names

With fully qualified domain name (FQDN) based objects, Google Cloud takes care of knowing the exact IP addresses for the FQDN in firewall rules. These objects can be used in rules to allow or block traffic based on FQDN instead of IP addresses, which can help provide the following benefits:

Improved reliability: FQDNs do not change when the underlying IP addresses change. This can help to reduce downtime and improve the reliability of access to your cloud workloads.Easier to use: FQDNs are more human-readable and easier to remember than IP addresses. This can make your firewall rules more understandable by making them self-documenting, so they are easier to audit and maintain.Enhanced security: Cloud Firewall integrates with Cloud DNS for FQDN name resolution to help improve the security of your applications by making DNS spoofing attacks more difficult.

Expanded Threat Intelligence lists for Cloud Firewall

Threat Intelligence for Cloud Firewall leverages a combination of Google, third-party, and open source data to provide curated IP reputation lists to help you block known malicious traffic and allow known good traffic. These lists are maintained and continuously updated by Google Cloud Threat Intelligence researchers.

Threat Intelligence for Cloud Firewall is part of the Cloud Firewall Standard tier, and today we are expanding our coverage with the following new IP lists for Cloud Firewall to help tighten your security posture and help block malicious traffic:

iplist-vpn-providers: Matches IP addresses that belong to low-reputation VPN providersiplist-anon-proxies: Matches IP addresses that belong to open anonymous proxiesiplist-crypto-miners: Matches IP addresses that belong to cryptocurrency mining sitesiplist-public-clouds-google-services: Matches IP addresses that belong to Google services

Enhanced support for tags in firewall policies

We are also pleased to announce IPv6 support, and Google Kubernetes Engine (GKE) node pool support for IAM-governed tags, both available in public preview. Tag support is part of the Cloud Firewall Essentials tier.

Previously, tags only worked with IPv4 based rules. With IPv6 support for tags, you can now use tags as source and destination filters for IPv6 based rules.

With GKE node pool support for resource manager tags, you can selectively enforce Cloud Firewall network firewall policies in GKE clusters and node pools, to help control traffic flow between your VM instances and GKE clusters and node pools. This helps to strengthen your security posture by enabling micro-segmentation down to the GKE node pool level.

Take the next step

Cloud Firewall is a scalable, cloud-first, stateful firewall service with advanced protection capabilities. The latest updates to Cloud Firewall Standard, now available in GA, provide additional capabilities to simplify firewall management to help protect your cloud workloads.

To learn more about Cloud Firewall, check out the latest video or the documentation to activate Cloud Firewall Standard in your cloud environment to help guard your Internet traffic.

Read More for the details.

2023 10 23

GCP – GKE Stateful High Availability (HA) Controller now in public preview

Cloud, Google Cloud gcp

Designing for application requirements is a key business consideration for any production application on Google Kubernetes Engine (GKE). This is especially true for stateful applications, e.g., databases and message queues. But running stateful applications often means choosing between cost and availability. For instance, you can run a single replica application in a single zone, minimizing costs but trading off on availability. Or if you need higher availability, you can run multiple application replicas, providing data redundancy in the event of a node failure — but you pay for that redundancy in terms of compute and network infrastructure.

Furthermore, in Kubernetes, during disruption events (e.g., upgrades, zone failure), the scheduler follows an eventually consistent approach. While this works for stateless applications, customers are looking for a more proactive approach for stateful applications. They want to control failover time and have visibility into how stateful applications behave during disruption events.

Customers want a middle ground: the availability of multiple replicas, with the cost efficiency of a single replica application. To help, today we’re announcing Stateful HA Operator on GKE, a new feature that brings proactive scheduling to stateful applications while balancing cost and availability. Stateful HA Operator is in preview.

Let’s take a closer look at how Stateful HA Operator can help you balance availability and cost for your stateful apps.

Getting to know Stateful HA Operator

At a high level, Stateful HA Operator provides proactive controls to stateful applications, and increases availability through its integration with regional persistent disk (RePD).

Many low-cost availability tools offer failover that is eventually consistent, where the failover process takes approximately 10 minutes. This is too long if you want to achieve an industry gold-standard Recovery Time Objective (RTO) of 60-120 seconds. Stateful HA Operator reduces this lag and lets you customize your failover response on a per workload level, so failover times match business requirements. You also have full observability so you can audit and track any failover activity.

The use of regional persistent disks, meanwhile, introduces a new option in the cost vs availability spectrum. Regional persistent disk is a storage option that provides synchronous replication of data between two zones in a region. Since adding additional storage is generally cheaper than running additional compute, and because RePD does not charge for cross-zone networking fees, you can balance cost and availability. Pairing the Stateful HA Operator with Spare Capacity or PriorityClass provides your application available compute capacity to fail over to in the event of a total failure.

Diagram 1. Stateful HA conceptual cost vs availability representation

Case studies

Use case 1: Upgrading availability for a single-replica PostgreSQL only at an 8% increase in cost

Consider a standard highly available single replica PostgreSQL application* with a total list price of $391/ month. Single replica applications are subject to disruptions and RTO can be hours to days.

By installing the Stateful HA Controller and allowing it to orchestrate failover with Regional PD, you can add tolerance to disruptions such as zonal failures for a marginal increase of 8%. Stateful HA Operator reschedules your replica within a preconfigured timeout, allowing you to minimize the unavailability window of your application to fit its SLA, at a very attractive cost point. If you need even more availability, you can also add failover compute capacity.

Diagram 2. Convert a single replica Postgres app to GKE StatefulHA Optimized infra

*Pricing assumes 3x e2-standard-16 VM shape, 200 GiB pd-ssd storage requirement, 6 MiBps ingress. See here for Standard PostgreSQL (1-replica) and Stateful HA (1-replica, regional replicated Storage) configuration and pricing assumptions.

Use case 2: Reducing costs for a multi-replica Kafka with cost savings up to 53%

Some applications need to maintain a high RTO in the event of node failures, but inter-zone application replication introduces additional compute and storage costs. In the unlikely event of a zone failure, all replicas can be rescheduled from the primary to the secondary zone. This allows the application to optimize savings on inter-zone networking costs under normal operation.

Kafka was designed for running across a flat network. Depending on data replication costs, some applications see that inter-zone data replication can exceed over 80% of total application costs, significantly outweighing the cost of both compute and storage. Kafka can operate on the same principle of zonal isolation. Consider a 6-replica Kafka application. Spreading all Kafka brokers evenly across zones results in a total list price of $3,969.54 per month. By installing the Stateful HA Controller, you can reduce the costs by up to 53%.

Diagram 3. Convert a multi-replica Kafka app to Stateful HA Optimized

*Pricing assumes 6x e2-standard-16 VM shape, 500 GiB pd-balanced storage requirement, 20 MiBps ingress. See here for Standard Kafka (3 zone) and Stateful HA Kafka (1 zone) configurations and pricing assumptions.

Try the GKE Stateful HA Operator via Helm

The Stateful HA Operator is a fully automated solution that reduces the toil of customizing your application to meet its availability needs.It’s available now to preview in your test cluster. To install it, you can follow the instructions in the documentation.

Read More for the details.

2023 10 23

GCP – How to build a conversational AI experience using generative AI to improve employee productivity

Cloud, Google Cloud gcp

A major source of frustration for any large organization, whether non-profit, public, or private, is the difficulty that individuals and teams within the organization have in locating relevant information both internally (on intranets) and externally (on the web). Employees can waste a significant amount of time trying to zero in on the right intranet or web source of information for various daily work related questions; or they might struggle to find the right internal expert to assist with the topic they seek answers about.

Google Cloud’s generative AI capabilities now enable organizations to address this pain point by leveraging Google’s best-in-class advanced conversational and search capabilities. Using Google Cloud generative AI features in Dialogflow, you can create a lifelike conversational AI agent that empowers employees to retrieve the most relevant information from internal or external knowledge bases. Generative AI features in Dialogflow leverages Large Language Models (LLMs) to power the natural-language interaction with users, and Google enterprise search to ground in the answers in the context of the knowledge bases. The knowledge base could consist of both structured and unstructured data.

In the following sections we provide an example on how to build a chat experience that handles HR benefits questions from both external websites and an internal FAQ knowledge base. Additionally, because the user may need to consult with a human HR representative for a specific situation, the Dialogflow virtual agent is able to find the right HR representative based on the conversation topic, and follow up with making an appointment by calling the calendar API. The core components of this example include:

Generative AI Agent: uses generative capabilities. An agent can be used to create a lifelike chat or voice experience in minutes, for which the conversational answers would be grounded in the user-provided knowledge base.Playbooks: a Playbook is a generative agent designer for building flow and tasks to be carried out by the virtual agent. Playbooks can be designed and created simply based on instructions written in natural language.A webhook: a service that hosts business logic or calls other services, API, etc.Intents and routes: An intent categorizes an end-user intention for one conversation turn. Routes are connections among flows and conversations. We use an intent to route a conversation to the right flow.

Build a chatbot using gen AI to improve employee productivity

Getting started takes a few simple steps. First go to the Vertex AI Conversation console to build your data store/knowledge base. Then, you can start to create a transactional agent with multi-turn conversation and call external APIs using Dialogflow. Before diving into the steps, let’s look at the use case that led to creating a conversational AI experience using generative AI.

Summary

Miranda recently joined a company with 500 employees. She’s spending countless hours trying to get up to speed on understanding benefits, compensation, performance reviews, and different aspects of the organization. It would be great if Miranda had, within her company’s web portal, a lifelike virtual agent to allow employees to find the right information and right people, quickly. This use case is applicable to any industry and organization. A user journey for Miranda would go as follows:

As a new employee, Miranda wants to know:

How to enroll in benefits as a new hire. For example what information she needs to provide and what internal tools she can use.What are the various benefit options available to her?How to add her dependents to a benefit, such as the dental plan.What steps she needs to update her benefits and enrollment.

Miranda also wants to consult with a HR representative in person to understand how her compensation was modeled and how her performance will impact future compensation.

Reference architecture

Step 1: Create agent data store (knowledge base)

1. In the Vertex AI Conversation console, create a data store using data sources such as public websites, unstructured data, or structured data.

2. Go to Cloud Storage, create a bucket with name “demo-better-employee-search” and select “continue” until the final step, “create” the bucket.

3. Upload your document to the Cloud Storage bucket. We support:

Text/HTML for public domain and websiteHTML, PDF and CSV for private document and FAQ document

4. Switch back to Vertex AI Conversation console and add data to your data store. There are three options to upload data:

Cloud Storage (without metadata): store your document in a Cloud Storage BucketCloud Storage (with metadata): store JSONs in a Cloud Storage bucket with links to your documents and metadataBigQuery (with metadata): store a table in BigQuery with links to your document and metadataAPI : Import data by calling API

Step 2: Create a generative AI agent to handle frequently asked questions using Data Store

The Generative AI Agent is a chat experience that can answer questions based on the organization’s knowledge base. After creating a data store in the previous step, you will be navigated to the Dialogflow CX console. Click “Test Agent” to try out the virtual agent.

Step 3. Create a Playbook to map HR topics and HR representatives

Conversations can be categorized by generative AI-powered Playbooks. A Playbook can be defined by topics and the associated conversational path. For example, a pizza ordering agent might have playbooks or flows such as pizza order, customer information, make a payment, etc. In this virtual agent design, we use instructions from a Playbook to map HR topics with HR representatives.

Unlike a standard flow, which can be built by intents, training phrases, etc, Playbooks can be created based on instructions written in natural language to define tasks for virtual agents.

In this step, we first create parameters before creating instructions. Parameters are used to capture and reference values that have been supplied by the end-user during a session. Each parameter has a name and entity type.

1. Create three parameters for user data, hr_topics, hr_representative, and appointment as input parameters. Input parameters are collected and made available to the flow.

2. Create output parameter to collect “@sys.date” to obtain appointment availability during conversation. Input parameters are expected to be collected from the Playbook.

3. Setup instructions to describe what the flow does. Then identify steps to map $hr_topics and $hr_representative*.

*This feature is currently in GA via allowlist

Step 4. Route flows

To design the virtual agent, we identify intents to build a storyline, which is similar to creating an outline when writing a story. In the storyline, we build:

A deterministic flow using FAQ pairs, website, or private documentation. In this tutorial we use the deterministic flow to answer HR/benefit related questions, such as “what leave options are available for me as a US based employee?” (see step 2)Transactional flows to answer users’ questions or complete certain transactions (see step 3). In the tutorial we use the transactional flows to help Miranda to find the right HR representative and schedule a session.Intents and/or playbooks instructions to route conversations.

In the following section, we will learn how to build intents to route conversations.

An intent categorizes the end user’s intention during the conversation. Intents can be categorized as head intents and supplemental intents. Head intents identify users’ primary purpose for interacting with an agent, while a supplemental intent identifies a user’s subsequent questions. For example, in a pizza ordering virtual agent design, “order.pizza” can be a head intent, and “confirm.order” is a supplemental intent relating to the head intent. After identifying intents, you can add training phrases to trigger the intent.

In this tutorial, we create a head intent “redirect.schedule.appointment” to route conversations from Default Start flow to MapRepresentative flow. This can be triggered by utterances such as “ I want to talk to a HR representative.” We also build “confirmation” intents as supplemental to address users’ utterances such as “the schedule works for me.”

Step 5. Combine predictive and generative flows to complete the conversation

Conversation dialogue includes multiple conversation topics. With Dialogflow, you can combine predictive flows with specific, prescriptive intents with generative flows created by Playbooks. In the example, we build the following structure:

Default start flow is to host the data store/knowledge base, and route intents to the appropriate flows or Playbooks (see step 3 and step 5).Build MapRepresentative Playbook to find the right HR representative based on the user topics (see above step 4).Create FindRepresentative flow to route conversations for finding out the right representative and call the calendar API for an appointment (see step 6 below on how to create webhook to call the calendar API).The ConfirmAppointment flow ends the conversation gracefully. This is also a Playbook.

Step 6. Create a webhook to call the calendar API

In this step the virtual agent will check the HR representative’s availability, and integrate with the calendar API via webhook. This can be achieved by deploying cloud functions.

In Cloud Functions:

1) Enter your code to call calendar API in index.js

2) Provide package.json

3) Deploy the cloud functions. Please find sample code in github here

4) Configure a webhook in Dialogflow

Best practices

It’s not trivial to design a virtual agent. In order to build robust conversation experience, some best practices include:

Understand data and hierarchy of the conversation to design flows and head intents before any technical design. Determining which flows, or parts of flows, require intent-driven designs due to control, cost, security, or other reasons, and which can be addressed with Generative AI Agent or Playbooks.Flows are major topics. Intents are the user’s intention, which is used to achieve the happy path of the major topics. Note that restricting the head intents to one or two within one flow can avoid complexity.Build reusable supplemental intents. Where possible, try to reuse intents across different flows. For example, supplemental intent “confirmation.yes” means that as you brainstorm ways of saying “yes.” “you bet,” “you got it captain,” you only have to use this one intent across different flows, as opposed to multiple.Use Default Negative Intent to remove any utterances that were originally out of scope. For example, in a pizza ordering flow, “I am going to cook chicken for dinner” is an out of scope utterance. Such utterances can be included in the Default Negative Intent.

Conclusion

In the example, we demonstrated how to create a virtual agent powered by generative AI that can answer frequently asked questions based on the organization’s internal and external knowledge base. In addition, when the user wants to consult with a human agent or HR representative, we use a “mix-and-match” approach of intent plus generative flows, including creating agents using natural language. We then added webhooks and API callsI to check calendar availability and schedule a meeting for the user.

Reference documentation

Dialogflow CX documentation Create and use a Generative AI agent Create a Generative Chat App

Read More for the details.

2023 10 23

GCP – Die Cloud als Stadt: Wie Unternehmen mit adesso SE ihre digitale Infrastruktur gestalten

Cloud, Google Cloud gcp

IT-Lösungen gibt es wie Sand am Meer. IT-Lösungen, die Kund*innen und Mitarbeitende wirklich überzeugen, sind seltener – und genau für diese Lösungen schlägt das Herz der adesso SE. Der IT-Dienstleister mit knapp 10.000 Mitarbeitenden verbindet technologische Kompetenz mit fachlichem Branchen Know-how, um Unternehmen bei der Umsetzung ihrer Softwareprojekte mit bewährten Methoden zu unterstützen.

Als Google Cloud-Partner ist adesso breit aufgestellt, wobei sich das Team in den letzten Jahren besonders auf innovative Machine-Learning-Lösungen spezialisiert hat. Im Interview verrät Pascal Reddig, Head of Google Cloud bei adesso SE, was die Partnerschaft mit Google Cloud so besonders macht, und warum es hilfreich sein kann, sich die Cloud wie eine Stadt vorzustellen.

adesso SE + Google Cloud – warum passt das so gut zusammen?

Google Cloud steht für Innovation, Lösungsorientierung und User-Fokus. Ihr arbeitet mit den neuesten Technologien und legt sehr viel Wert auf Forschung und Entwicklung. Da gehen wir bei adesso voll mit, getreu unserem Claim ‘Business.People.Technology’. Es macht Spaß, gemeinsam mit Google Cloud wirklich sinnvolle Projekte zu machen, die zur Wertschöpfungskette des Kunden beitragen und nicht einfach nur einer Konzernvorgabe folgen.

Als IT’ler im Herzen, lieben wir die Google Cloud Entwicklungsplattform – technologisch “Top of the Notch”. Insbesondere in unseren Leidenschaftsthemen Machine Learning, Cloud Native Development und Nachhaltigkeit ist Google Cloud führend. Das macht uns gemeinsam so erfolgreich und führt dazu, dass wir jeden Monat neue interessante Kundenprojekte umsetzen.

Sie nannten den Claim Business.People.Technology. Wie spielen diese Themen zusammen?

Genau, diese drei Worte fassen unsere Ausrichtung ganz gut zusammen. Erfolgreiche Projekte entstehen durch innovative Ideen, zukunftsfähige Strategien und passgenaue IT-Lösungen – Business und Technology. Beteiligt und zentrales Element bei adesso sind aber immer Menschen, die den richtigen Mix aus Technologieexpertise und grundlegendem Verständnis für das jeweilige Geschäft der Kunden mitbringen – die People. Eigene Entwicklungspfade und ein hoher Qualitätsanspruch ermöglichen das aktuelle Wachstum des Unternehmens.

Und wie passt hier das adesso Competence Center Google Cloud rein?

Unser Competence Center ist ein eingespieltes Team aus Google Cloud-Expert*innen. Wir arbeiten bereichs- und branchenübergreifend bei adesso, weil sich die Themen im ‘Cloud-native Thinking’ stark überschneiden. Wir möchten die vielfältigen Stärken der Kolleginnen und Kollegen nutzen, statt sie in feste Rollen zu pressen. So bauen unsere Data Spezialist*innen auch Terraform Skripte, während Cloud-Architekt*innen auch Data Pipelines aufsetzen. Damit rückt die Infrastruktur noch näher an den Code und wir können für Projekte und Anfragen genau die Rollen anbieten, die gesucht werden.

Sie haben Machine Learning als Leidenschaftsthema aufgeführt. Was zeichnet das gemeinsame ML-Angebot von adesso und Google Cloud aus?

In der heutigen ML-Welt passiert so viel, dass eine technische Umsetzung spätestens nach einem halben Jahr veraltet ist. Google Cloud hat das frühzeitig erkannt und mit „Machine Learning for everyone“ begonnen. Für Themen, die technisch immer gleich sind, werden fertige APIs zur Verfügung gestellt. Zum Beispiel können Kund*innen Vorhersagen oder Klassifizierungen mit fertig trainierten Modellen nutzen oder auch mit eigenen Daten trainieren lassen. Diese Modelle haben eine höhere Treffergenauigkeit als selbst gebaute Modelle und kosten nur einen Bruchteil einer eigenen Lösung.

Statt ein eigenes Team dafür im Unternehmen aufzubauen, ist es sinnvoll, mit einem Partner wie adesso zusammenzuarbeiten, da wir solche Herausforderungen täglich lösen.

Ein Hilfsmittel, das adesso dabei nutzt, ist das „Cloud City Model“. Was hat es damit auf sich?

Wenn ein Cloud-Vorhaben nicht so läuft, wie geplant oder sogar komplett scheitert, liegt es meist an zu hoher Komplexität des Projektes oder daran, dass der Aufwand unterschätzt wurde. Angefangen mit der Frage, warum das Projekt gescheitert ist, haben wir uns in einer breit angelegten Analyse die Umsetzungen verschiedener Kunden genauer angeschaut. Dabei kam oft heraus, dass lediglich Teilbereiche richtig umgesetzt, andere hingegen komplett vernachlässigt wurden. Zudem wurde das Know-How der Mannschaft überschätzt oder falsch vorausgesetzt..

Aus diesem Grund haben wir das komplexe Thema Cloud als “Cloud City Model” heruntergebrochen. Ihr müsst euch das so vorstellen: Eine Stadt funktioniert nur, wenn alle Institutionen, die Infrastruktur und die Menschen ineinander greifen und es an nichts fehlt. Das Rathaus bestimmt die Strategie und nimmt neue Bürgerinnen und Bürger auf. Die Polizei – die Security -ist für die Spielregeln zuständig. Kontrolle und Freiheiten gehen Hand in Hand. Dann gibt es noch die Universität, die für die Bildung und Wissensvermittlung zuständig ist. Straßen und Verkehrsmittel verknüpfen die Gebäude, die Anwendungen darstellen. Und dann ist da schließlich noch die Bank, die alle Kosten im Auge behält – FinOps ist hier das Stichwort.

Ein oft nicht greifbares Vorhaben kann so anschaulich dargestellt werden. In der Cloud City lässt sich schnell erkennen, welche Bausteine vielleicht noch nicht richtig ausgebildet sind. So ist auch sichergestellt, dass nichts für den erfolgreichen Ausbau der Stadt vergessen wird.

Wie ist es in dieser Cloud-Stadt um die Nachhaltigkeit bestellt?

Nachhaltigkeit ist für adesso ein zentrales Thema. Sustainability spiegelt sich für uns nicht nur in der Infrastruktur oder der Rechenleistung wider, sondern wird schon bei der Bereitstellung digitaler Touchpoints mitgedacht – also bei der User-Eingabe über eine Webseite oder dem IoT-Display. Die Anzeige auf dem Endgerät und der Netzwerk-Weg haben einen ebenso großen Einfluss auf die Nachhaltigkeit einer Anwendung wie der Aufbau der Anwendung selbst.

Mit Cloud-nativen Ansätzen können wir Anwendungen flexibel, skalierbar und bedarfsgerecht umsetzen. Wir haben beispielsweise kein „immer da“-Rechenzentrum mehr, sondern nutzen nur die Ressourcen für Workloads, die auch wirklich benötigt werden. Dabei wägen wir ganz genau ab, ob eine Berechnung besser im Frontend oder im Backend erfolgt oder ob ein Workload besser in einer anderen Cloud-Region läuft, die aktuell mit grünem Strom betrieben wird.

Wie sieht ein typisches Projekt mit adesso aus?

Ein typisches Projekt gibt es bei uns gar nicht – und das ist auch gut so. Jedes Unternehmen tritt mit ganz individuellen Themen und Herausforderungen an uns heran. Ein Beispiel für ein spannendes Machine- Learning-Projekt ist der Kunde Rheinische Post. Sie kamen mit dem Wunsch auf uns zu, ein komplett personalisiertes Online-Portal zu werden. Gemeinsam haben wir eine moderne Cloud-Datenplattform aufgebaut, die eine Grundlage für das ML-basierte Verständnis der Nutzenden und der Inhalte bildet. Heute kann die RP beispielsweise hyper-lokale Inhalte ausspielen, sodass Lesende Nachrichten aus einem Umkreis von 500 Metern angezeigt bekommen. Mit Hilfe einer Cloud-basierten KI-Anwendung und aufgrund genauer Vorhersagen von Aufrufen und möglichen Abo-Abschlüssen, werden der Redaktion Platzierungen für Artikel empfohlen.

Wo können potenzielle Kund*innen mehr über adesso SE erfahren?

Natürlich über unsere Website oder den Blog unserer Exptert*innen. Eines meiner Herzensprojekte ist auch der IT-Tacheles-Podcast. Ganz in Ruhrpott-Manier reden wir hier Tacheles zu IT-Themen, hinterfragen diese kritisch und zeigen auf, wie man es richtig macht. Aber nichts geht über den persönlichen Kontakt. Sprechen Sie mich oder meine Kolleg*innen doch einfach per Mail, telefonisch oder über LinkedIn an.

Egal ob Sie noch ganz am Anfang Ihrer Cloud-Reise stehen oder bereits als Cloud-Native unterwegs sind: Im Partnernetzwerk von Google Cloud finden Sie führende Spezialist*innen für Digitalisierungsprojekte jeder Art und Größe. Mehr über unser Partnerprogramm erfahren Sie auf unserer Website.

Read More for the details.

2023 10 20

GCP – Getting started with Feast on Google Cloud

Cloud, Google Cloud gcp

Post Content

Read More for the details.

2023 10 20

GCP – Nippy navigates its way to growth with Google Cloud

Cloud, Google Cloud gcp

When Nippy was founded in Argentina, we set out to solve problems faced by gig-economy workers, especially immigrants. Imagine that you’ve arrived from another country, and you want to get started as a delivery driver, but you don’t own a vehicle or a cellphone. Maybe you don’t speak the language. It’s tough, but you’re determined to succeed.

That’s where Nippy can help. We provide independent workers with services, benefits, and opportunities to improve their lives. This includes insurance, health, legal advice, and financing tools so that they can start earning quickly and safely.

We started out about four years ago when our co-founders Diego Amondaray and Florencia Moroni met a freelance worker from Venezuela called Luis who had moved to Argentina in search of work to support his family.

Diego was a lawyer specializing in corporate and labor law and he understood the challenges facing Luis — from navigating the Argentine banking system to enrolling for healthcare and benefits. Luis became our first ‘Nipper’ and shortly after, Diego and Florencia officially founded the business. Today we have 19,000 active members in five countries and 400,000 people on the waiting list.

Rapid delivery of benefit packages

Our first business model was based on B2C relationships with freelance workers, who pay a monthly subscription for essential services. But as this community grew, we realized that we had something to offer businesses as well. Employers must comply with fast-changing legislation that applies to employee contracts and conditions, while minimizing workforce turnover. By becoming a Nippy partner, they can add a range of offers to their existing benefits package. Our services also help them comply with legislation in their country or region.

For example, Uber drivers can access favorable automobile leasing packages and cell phone plans created exclusively for delivery drivers. There are also special Nippy offers deals for freelance employees who work for delivery platforms such as Rappi and Paquery.

As nimble as a scooter driver weaving through traffic

My background is in digital product and strategy development and I also mentor digital businesses, which is how I first met the Nippy team. I’ve had a lot of experience with cloud platforms and was keen to deepen my understanding ofGoogle Cloud at Nippy.

It was a revelation. From a few proof of concepts, we now use Google Cloud to store all our data. We’ve also deployed Google Cloud tools includingBigQuery to build our data lake andPub/Sub to ingest events from other platforms that manage our web applications and end-user data. We are also migrating toDataflow for batch data processing that underpins our customer support team where we also useGoogle Chat to communicate with customers.

I would also call out the great support we get from our Google Cloud account manager. She has been fantastic helping us troubleshoot issues, while the whole Google Cloud team has a ‘human touch’ that is missing from other cloud providers. We’re also a member of theGoogle for Startups Cloud Program where we are eligible for credits that cover the cost of Google Cloud and Google Cloud tools for up to two years.

Driving towards a profitable future

We’re in a strong position to grow the business and offer new services. One of the things that I’m most proud of is our network of Nippy Centers, locations where members can use restrooms, grab a snack and a drink, recharge their cellphones, and access Wi-Fi. There’s space as well for bikes and scooters so we’re helping to reduce urban congestion, especially at busy junctions where freelance drivers tend to congregate.

From a technology perspective, artificial intelligence is an area where we hope to take advantage of Google Cloud machine learning and AI tools. We’ve gathered a huge amount of anonymized data from employee members relating to the number of journeys, demographics, vehicles, and other information. We can use this to develop algorithms that understand their needs and simplify member recruitment, retention, and payment processes.

With the help of Google Cloud, we can scale fast and enter new markets. Beyond Argentina, we’ve launched Nippy in Uruguay, Chile, and the Dominican Republic and we are now expanding into Mexico and more countries in the Caribbean.

Best of all, it means that we can continue to fulfill our mission to improve gig economy conditions in a way that benefits both employees and hiring organizations. This nicely aligns with one of Google Cloud’s own priorities to provide people information and make it universally accessible and useful. Argentina alone has nearly one hundred thousand delivery drivers, and that number is growing fast across all of Latin America. By standardizing on Google Cloud as our technology platform, we look forward to providing people with better working conditions, financial security, and other social advantages.

Read More for the details.

2023 10 20

GCP – Google Cloud Next London 2023: a recap

Cloud, Google Cloud gcp

Post Content

Read More for the details.

2023 10 19

GCP – Public Sector Guide to jumpstart your secure gen AI journey

Cloud, Google Cloud gcp

The impact of generative AI for the public sector can be groundbreaking. Generative AI empowers agencies with an always-on AI collaborator, helping workers be more collaborative, creative, and productive. In practice, this could mean generative AI serves as a brainstorming assistant to draft and iterate content or drive conversational, real-time interactions with constituents on agency websites.

AI is accelerating mission outcomes and shaping the future of how the government works. According to a study by KPMG, 77% of government decision-makers would like their organizations to adopt AI more aggressively and feel their employees are prepared to do so. Just last month, the federal government documented over 700 AI use cases spanning healthcare, transportation, environment, and benefits delivery.

To jumpstart your secure gen AI journey, we developed the Public Sector Guide to Getting Started with Gen AI. This downloadable eBook provides agency leaders with a step-by-step guide to getting started on their AI journeys, with recommended best practices from Google Cloud’s AI experts and customers. The guide recommends how to launch your first use case in 30 days and how to set the right Key Performance Indicators (KPIs) to measure progress. Importantly, this eBook also explores how to establish an AI governance process to ensure you comply with responsible practices, and offers resources for every step along the way.

Gen AI delivers a new way of working and serving the public

Every day, government organizations spend time and energy digging for information to make decisions, serve constituents, and move the mission forward. Informed decisions require information, and collecting the right inputs can take time.

This guide demonstrates the potential value of gen AI across government, health and human services, labor, transportation, and education – with suggested use cases and customer examples. By the end of the book, you’ll have insights to help your organization get started with a successful and responsible implementation of AI.

Here are a few real-world customer examples:

The State of Minnesota and the City of Dearborn, Michigan implemented AI-based contact centers to provide government services to their constituents in multiple languages, 24/7.The Wisconsin Department of Workforce Development deployed Google Cloud AI and ML to help clear its unemployment application backlog.They were able to process an average of 157,000 claims each week, releasing most payments to constituents within two to three business days.The City of Memphis, Tennessee analyzed high-resolution video footage with AI to identify and fix potholes in local streets and determine areas of urban blight.Nerdy Inc’s Varsity Tutor platform used AI to match students to tutors, factoring in over 100 variables affecting academic, social, and motivational outcomes. Tutors on the platform also use AI-driven adaptive assessments to quickly understand a student’s grasp of a subject, identifying strengths and opportunities for growth across 200 subjects and 4,000 skills.

Advancing toward the future, one step at a time

When a new technology moves as fast as gen AI does, it can be hard to keep up. As a strategic partner to our customers, Google Cloud helps public sector leaders chart their path forward with the appropriate frameworks, tools, and governance structures — and instill a responsible approach to AI across your organization. Google leads the way in AI with capabilities that are easy and scalable for everyone.

Learn how gen AI can improve the future of citizen engagement and services by downloading the new 10 step guide.

Read More for the details.

2023 10 19

GCP – Sharing Datasets across organizations with BigQuery Analytics Hub

Cloud, Google Cloud gcp

Post Content

Read More for the details.

2023 10 19

GCP – Visualize Cloud DNS public zone queries using log-based metrics and Cloud Monitoring

Cloud, Google Cloud gcp

When logging is enabled, Cloud DNS logs all DNS queries for a public zone from external sources. The logs contain useful information such as the query name, query type, response code, and source IP address. Users can query the data in Cloud Logging to find specific information or to troubleshoot an ongoing issue. However, Cloud DNS does not publish any metrics for public zones, and there is no direct way to visualize all the logged data.

This blog post will show you how to create a log-based metric using Cloud DNS public zone logs data. We’ll then use Cloud Monitoring to create a custom dashboard to view the data.

The pre-configured dashboard will provide the following information:

Query Count for All Public Zones: Total number of DNS queries received for all public zones during a specified time period.

Query Count per Target Name: The number of DNS queries received per public zone during a specified time period.

Response Code: The total number of occurrences of a specific response code for all public zones during a specified time period.

Response Code per Target Name: The number of times a specific response code was returned, grouped by public zone.

Errors: The total number of response codes excluding NoError for all public zones during a specified time period.

Errors per Target Name: The total number of response codes excluding NoError, grouped by public zone.

Server Latency: This distribution metric reports statistical data on request latencies, not individual values. A heat map chart shows the 50th, 95th, and 99th percentiles of server latency. The 50th percentile is the median latency. The 95th percentile is the value that 95% of requests took longer than. The 99th percentile is the value that 99% of requests took longer than. See the official documentation for details on how to interpret heat map charts.

Steps to create Cloud DNS custom dashboard

The following steps will be performed:

Enable logging on public zonesUnderstanding the log entry for public zoneCreate log-based metricsCreate the custom dashboard

1. Enable logging on public zones

Unlike private zones, where logging is enabled or disabled by the DNS server policy on the client network, logging for public zones is enabled or disabled at the zone level. To enable logging for an existing public zone, use the following command:

Command

code_block<ListValue: [StructValue([(‘code’, ‘gcloud dns managed-zones update ZONE_NAME –log-dns-queries’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e82283e2e80>)])]>

Note: Cloud DNS only logs queries that reach its name servers. Queries that are answered from caches are not logged.

2. Understanding the log entry for public zone

You can review a table of all the fields in the Cloud DNS logging and monitoring documentation. This section will review the fields that will be used later to create the log-based metrics.

The following fields will be used to create the log-based metrics:

queryName: The DNS query name, eg www.example.com.

queryType: DNS query type eg A, AAAA, SOA, NS, etc. In the sample log entry provided below, the query is for an A record.

sourceIP: IP address of the DNS resolver from which Cloud DNS received the query

responseCode: DNS response codes, eg NOERROR, NXDOMAIN, SERVFAIL, REFUSED, etc.

project_id: Google Cloud project ID for the project which owns the public zone.

target_type: Type of target resolving the DNS query: public-zone, private-zone, forwarding-zone, forwarding-policy, peering-zone, internal, external.

target_name: The target name, for example, zone name, policy name, internal zone name, external domain name

3. Create the log-based metrics

We require the creation of two distinct log-based metrics: a counter metric and a distribution metric.

We will use the counter metric to count the number of log entries for a specific DNS query name, query type, or response code.We will use the distribution metric to extract the distribution of server latency.

To create log-based metrics, use the gcloud logging metrics create command. Logging metrics configuration can be passed to gcloud using a .yaml file.

Note: All user-defined log-based metrics are a class of Cloud Monitoring custom metrics and are subject to charges. For pricing information, please refer to Cloud Logging pricing: Log-based metrics. The retention period for log-based metrics is six weeks. Please refer to the data retention documentation for details.

Create the counter metric

1. Download the config.yaml from Github:

code_block<ListValue: [StructValue([(‘code’, ‘curl https://raw.githubusercontent.com/GoogleCloudPlatform/professional-services/main/examples/cloud-dns-public-zone-dashboard/config.yaml -o config.yaml’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e82268a0460>)])]>

2. To create counter metrics, use the gcloud logging metrics create command.

Command

code_block<ListValue: [StructValue([(‘code’, ‘gcloud logging metrics create cloud-dns-log-based-metric –config-from-file=config.yaml’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e82268a0d90>)])]>

Create the distribution metric

1. Download the latency-config.yaml from Github:

code_block<ListValue: [StructValue([(‘code’, ‘curl https://raw.githubusercontent.com/GoogleCloudPlatform/professional-services/main/examples/cloud-dns-public-zone-dashboard/latency-config.yaml -o latency-config.yaml’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e82268a01f0>)])]>

2. To create counter metrics, use the gcloud logging metrics create command.

Command

code_block<ListValue: [StructValue([(‘code’, ‘gcloud logging metrics create cloud-dns-latency-log-based-metric –config-from-file=latency-config.yaml’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e82268a0130>)])]>

4. Create the dashboard

1. Download the dashboard.json from Github. We will use this file to import the pre-configured dashboard

code_block<ListValue: [StructValue([(‘code’, ‘curl https://raw.githubusercontent.com/GoogleCloudPlatform/professional-services/main/examples/cloud-dns-public-zone-dashboard/dashboard.json -o dashboard.json’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e8227fec6a0>)])]>

2. Use the gcloud monitoring dashboards create command to create the dashboard. This command will create a custom dashboard named gcloud-custom-dashboard.

Command

code_block<ListValue: [StructValue([(‘code’, ‘gcloud monitoring dashboards create –config-from-file=dashboard.json’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e8227fec370>)])]>

Access the dashboard

In your Google Cloud console, click Monitoring and then Dashboards.

2. Click the custom dashboard named gcloud-custom-dashboard

3. The dashboard can be refined using the Group By and Filter functions. For example, the screenshot below shows a filter that only shows entries where the QueryType is A records.

Considerations

Log-based metrics have higher ingestion delays than other types of metrics, making them unsuitable for real-time monitoring or highly sensitive alerts.Your metric counts may be delayed. The log-based metric could also have delays in displaying the correct log count due to a potential 10-minute delay for log ingestion.Users should set the alignment period to at least 5 minutes when configuring alerts for log-based metrics to prevent minor fluctuations from triggering alerts.

Learn More

To learn more about DNS capabilities and customization options, please check out the following:

Repo: cloud-dns-public-zone-dashboardDocumentation : Using Cloud DNS LoggingDocumentation : Log-based metrics overview

Read More for the details.

2023 10 19

GCP – InstaDeep’s scalable reinforcement learning on Cloud TPU

Cloud, Google Cloud gcp

Post Content

Read More for the details.

2023 10 18

GCP – Google Cloud and E-ISAC team up to advance security in the electricity industry

Cloud, Google Cloud gcp

Power generation and distribution networks are essential to modern life and must keep pace with dramatically increasing demand for electricity. The Energy sector is uniquely critical because it enables all other critical infrastructure sectors. Without reliable and secure electricity networks, economies and communities cannot function.

Cybersecurity is particularly important for energy and utility companies because they face the challenges of protecting vast supply chains, electricity grids, and customer information against myriad malign actors. The energy sector must contend with cyberattacks that include ransomware, supply chain compromise, botnets, and worm attacks. These significant threats emanate from state actors, quasi-state organizations, and terror groups who all see electricity infrastructure, companies, and their systems as valuable targets.

aside_block<ListValue: [StructValue([(‘title’, ‘Hear monthly from our Cloud CISO in your inbox’), (‘body’, <wagtail.rich_text.RichText object at 0x3ee3c3b1f9a0>), (‘btn_text’, ‘Subscribe today’), (‘href’, ‘https://go.chronicle.security/cloudciso-newsletter-signup?utm_source=cgc-blog&utm_medium=blog&utm_campaign=FY23-Cloud-CISO-Perspectives-newsletter-blog-embed-CTA&utm_content=-&utm_term=-‘), (‘image’, <GAEImage: gcat small.jpg>)])]>

To enhance our commitment for this sector, today we are announcing a new partnership with the Electricity Information Sharing and Analysis Center (E-ISAC). Google Cloud is proud to be the first leading cloud provider to join the E-ISAC Vendor Affiliate Program.

By joining E-ISAC as a vendor affiliate, Google Cloud will contribute to the electricity industry’s collective defense by providing subject matter expertise on critical vulnerabilities and security solutions. In its role as a Vendor Affiliate Program partner, Google Cloud will devote resources and experts to work alongside industry leaders to transform, secure, and defend the electricity sector.

E-ISAC, in collaboration with the U.S. Department of Energy (DOE) and the Electricity Subsector Coordinating Council (ESCC), serves as the primary security communications channel for the electricity industry and enhances the industry’s ability to prepare for and respond to cyber and physical threats, vulnerabilities, and incidents. E-ISAC aggregates and analyzes security data to share with stakeholders, coordinates incident management, and communicates mitigation strategies to reduce cyber and physical security risks to the electricity industry across North America.

“Partnering with E-ISAC is a critical step in our commitment to help the energy sector transform and secure its critical infrastructure and is aligned with the US Government’s grid modernization and critical infrastructure security priorities,” said Phil Venables, VP and CISO Google Cloud. “As one of the world’s largest tech providers, we believe we have a responsibility to share our expertise and resources with organizations that are working to protect the energy grid and critical infrastructure. This partnership will help us to raise awareness of the security threats facing the energy sector and to develop new solutions to help address these threats.”

As a Vendor Affiliate Program partner, Google Cloud will bring experts and resources — including unique insights from Mandiant, our Threat Horizon reports, and the Google Cybersecurity Action Team — to help the electricity industry protect against cyberattacks. Googlers will work with defenders and leaders in the power and energy sector, sharing knowledge we’ve learned building and deploying secure technology at Google.

This partnership is a continuation of Google’s August 2021 commitment to invest at least $10 billion over five years to advance cybersecurity. This same commitment has enabled us to join other organizations like Health ISAC and Financial Services ISAC, so we can continue to support the security and resilience of our critical infrastructure across key sectors.

“The E-ISAC is pleased to welcome Google Cloud as a Vendor Affiliate Program partner,” said Manny Cancel, NERC SVP and CEO of the E-ISAC. “Our partnership with Google Cloud is a significant and positive step in furthering collaboration between industry and vendors as we work together to reduce risk around supply chain interdependencies and strengthen our collective defense.”

Learn more

For more information on Google Cloud’s E-ISAC partnership, please visit the Google Cybersecurity Action Team page.

Read More for the details.

2023 10 18

GCP – Customize load balancers for unique application needs with Service Extensions callouts

Cloud, Google Cloud gcp

Service Extension callouts on Google Cloud Application Load Balancers, which we recently announced at Google Next ‘23, are now available in public preview. Service Extensions empower users to quickly and easily customize the data plane of Google Cloud Networking products. This custom logic can address unique workflow requirements, offer an on-ramp for partners to integrate their software with Google services, or help organizations implement Cross-Cloud Network services.

Service Extensions offers two methods to inject custom logic into the networking data path: plugins and callouts.

Plugins allow users to insert WebAssembly (wasm) code to run the extension inline in the networking data path. Since they are a fully managed resource, they are a friendly option for users that want the benefits of a Google-managed offering. Plugins are currently only available on Media CDN.Callouts allow users to instruct Google Cloud Networking products to make RPC ‘callouts’ to custom services running in Google Cloud, multi-cloud, or on-premises from within the data processing path. Callouts are deployed on user-managed general-purpose computing.

With the introduction of Service Extensions callouts for Google Cloud Application Load Balancers, users instruct the load-balancers to forward traffic from within the Cloud Load Balancing data processing path via gRPC to a user-managed or partner-hosted application. These applications can apply various policies or functions, such as header or payload manipulation, security screening, custom logging or authentication on the traffic before returning the traffic to the load-balancer for further processing.

Figure #1, Service Extensions callouts data flow

Two callout extension types, route extensions and traffic extensions, are planned. Each of these types has a primary customization focus:

Route extensions execute first in the request processing order and can be used to insert custom logic near the beginning of the request path. These extensions can be used to influence how Cloud Load Balancers choose which backend service to send the request.

Traffic extensions execute last in the request processing path and can be used to insert custom logic just before the request goes to the backend. These extensions support a wide variety of use cases, such as adding a request header, modifying the payload or enabling custom logging.

Benefits of Service Extensions callouts include:

Bespoke implementation – Traffic handling is tailored to address unique workflow requirements and can optimize the performance of cloud applications or services.User empowerment – Organizations can develop their own applications or purchase programs to change how a service is delivered to support new or custom requirements.Partner integration – Partners can programmatically integrate their software with Google Cloud Application Load Balancer services and deliver new advanced use cases.

While Service Extensions can deliver a wide variety of functions and services, customer feedback is that the following are very popular use cases:

Incorporating partner software or services allows users an easy, quick, and efficient way to integrate partner applications or services with Google Cloud Load Balancing. Typical areas of interest for this use case include integrating leading security capabilities, such as web application firewall (WAF), API security, and bot management. We are excited to see partners including Fortinet, Palo Alto, Traceable and Human Security share an interest in this use case.Data plane customization focuses on modifying traffic headers and payloads, including rewriting HTML responses to inject security or adtech JavaScript, customizing cache keys by geography, or adding/removing/changing app-specific headers or device types.Security and logging enables users to support custom user authentication and authorization based on JWT payloads, translate and implement custom URL signing mechanisms, support custom TLS fingerprinting, or establish custom logs based on custom attributes.Traffic steering allows callouts to rewrite header information to influence backend selection based on user location and HTTP method, implement custom sticky session logic, and support geo-based regional Load Balancer traffic routing.

Early feedback on Service Extensions callouts from customers and partners such as Palo Alto Networks, Fortinet, Traceable and Human Security, has been very positive:

“With Google’s new Service Extensions callout capability, Fortinet and Google Cloud customers get even better, more seamless protection for their workloads on Google Cloud.” – John Maddison, Chief Marketing Officer and EVP, Product Strategy, Fortinet

“API security is critical with 90% of web traffic being routed through APIs and becoming the primary targets for modern day AuthN/AuthZ based attacks, data exfiltration and fraud. Traceable’s collaboration with Service Extensions for Google Cloud Load Balancing solves a key customer need of seamless L7 Traffic steering for comprehensive API security. This innovative integration between Google Cloud and Traceable empowers our joint customers to quickly operationalize API security and continuously discover, test, analyze, and protect the digital assets and systems powered by APIs.” – Sanjay Nagaraj, Chief Technology Officer/Co-founder, Traceable

“We are excited to be at the forefront of leveraging Service Extensions callouts to simplify and streamline the integration of the Human Defense Platform for our Google Cloud customers. With this expansion of our partnership with Google Cloud, we are making it easier for our valued partners and clients to safeguard their applications from cybersecurity threats, fraud and abuse. This innovative approach allows effortless integration of the Human Defense Platform into our customers’ applications running anywhere, all without any additional modification of their applications.” – Ido Safruti, Chief Technology Officer, Human Security

“Service Extensions callouts on Google Cloud Load Balancing have the potential to unlock and simplify multiple use cases for our business. The flexibility to use our code or third-party software to change how traffic is secured and processed is particularly attractive to us. We look forward to participating in the public preview and partnering with Google to guide the Service Extensions roadmap.” – Roiy Berko, Vice President of Technical Operations, DoubleVerify

Please see the Service Extensions documentation for additional information.

Read More for the details.

2023 10 18

GCP – Digitalparking delivers dependable, secure parking services with streamlined IT

Cloud, Google Cloud gcp

Digitalparking serves more than half of drivers in Switzerland through its parking payment solutions. With a history that reaches back to far more basic parking payment options in the 1960s, the company has evolved alongside the proliferation of smartphones and digital payments to meet the demands of today’s customers.

To transform its services, Digitalparking migrated its infrastructure away from on-premises legacy technology to cloud computing. “Everything changed when reliable, secure management of digital payments became possible,” says Reto Schläpfer, Chief Executive Officer and Chief Technology Officer of Digitalparking. “We realized we had to transform from a hardware to a software company.”

Let’s take a look at how a combination of technologies from Google Cloud and partners Aiven and Datadog modernized Digitalparking’s technology stack while improving uptime, security, and simplicity—and the services people rely on daily.

Keeping IT simple

According to Schläpfer, roughly 2.5 million of the 4.8 million cars in Switzerland have used Digitalparking’s system in the past year. The company started processing a low number of digital payments in 2018, but has seen demand skyrocket to up to 60 million transactions annually.

To accomplish this, Digitalparking takes an approach to innovation that emphasizes customer experience.

“The people using our services value reliability and consistency over everything else,” says Schläpfer. “Our customers park 24 hours a day and want to pay for their parking spot quickly and effortlessly. Any issues can result in hassles for them and problems for us. A big challenge for us was looking at how to scale to handle such a big jump in digital transactions without increasing complexity. We knew we needed a simple and reliable IT infrastructure to support our parking software.”

Digitalparking chose to migrate its VPS-provider-based infrastructure to Google Cloud. Today, the company uses a combination of Compute Engine, Cloud Storage, and Cloud Run, as well as Secret Manager. As maintaining security is vital, Digitalparking takes advantage of firewall configuration capabilities through Terraform by HashiCorp. This enables the business to avoid the costly and time-intensive provisioning of firewalls on individual operating systems while maintaining compliance with data security regulations.

Further, network peering has been critical to Digitalparking’s success, as it allows the company to keep all IT assets within one system.

“Between network peering and Google Cloud firewall capabilities, we not only improved our general data and IT security, but also our system reliability,” says Schläpfer. “Network peering dramatically reduces latency compared to a more fragmented architecture and that translates to better reliability and higher uptime.”

Leveraging a dynamic partner ecosystem

Digitalparking looked at multiple approaches to simplify its IT environment. In addition to adopting Google Cloud solutions, the company began working with a cloud data platform from Google Cloud partner Aiven to remove maintenance and management demands from its DevOps team. The company proved Aiven’s reliability after a year of testing as a secondary database.

Now, Aiven acts as the core database that stores transaction history and other customer data from more than two million customers. “Aiven was the best solution for us. It works great with Google Cloud via network peering and alleviates security challenges as our digital business grows,” says Schläpfer. “Now, we don’t worry about building or managing our own database — that is all outsourced to Aiven. It has had a very positive impact on our business.”

Digitalparking also works with Google Cloud partner Datadog for all of its logging and application monitoring (APM) needs. “We get a lot out of a relatively small integration effort with Datadog,” says Schläpfer. “We don’t have a big team, but with Datadog, we can efficiently observe our machines’ load, manage logging, and ensure high uptime with little to no management or maintenance burden.”

The combination of Google Cloud, Aiven, and Datadog has enabled Digitalparking to manage roughly 2,000 database queries per second without having a DevOps team.

“The beauty of the Google Cloud, Aiven, and Datadog partnership is that we can connect the systems we need once and then never have to worry about it,” says Schläpfer. “We can focus on scaling our business, meeting our customer needs, and keeping our systems secure.”

Gaining even greater customer trust

In the future, Digitalparking intends to continue refining its architecture to achieve the highest levels of security, scalability, and affordability.

“Our market requires us to be as dependable and affordable as possible,” says Schläpfer. “There are actions we can take to further reduce the total cost of ownership for parking lot operators while providing reliable and secure services to their customers. Google Cloud, Aiven, and Datadog help us optimize simplicity across our systems. They will play a primary role in our success going forward.”

Check out the Google Cloud Marketplaceto learn more about how partners like Aiven and Datadog can simplify your IT. Additionally, read Aiven’s article on its work with Digitalparkingfor further details on this great customer success story.

Read More for the details.