Welcome to above the clouds
GCP – Fireworks.ai: Lighting up gen AI through a more efficient inference engine
Enterprises across industries are investing in AI technologies to move faster, be more productive, and give their customers the products and services that they need. But moving AI from prototype to production isn’t easy. That’s why we created Fireworks AI. The story of Fireworks AI started seven years ago at Meta AI, where a group […]
GCP – Faster food: How Gemini helps restaurants thrive through multimodal visual analysis
Businesses across all industries are turning to AI for a clear view of their operations in real-time. Whether it’s a busy factory floor, a crowded retail space, or a bustling restaurant kitchen, the ability to monitor your work environment helps businesses be more proactive and ultimately, more efficient. Gemini 1.5 Pro’s multimodal and long context […]
GCP – Veo and Imagen 3: Announcing new video and image generation models on Vertex AI
Generative AI is leading to real business growth and transformation. Among enterprise companies with gen AI in production, 86% report an increase in revenue1, with an estimated 6% growth. That’s why Google is investing in its AI technology with new models like Veo, our most advanced video generation model, and Imagen 3, our highest quality […]
AWS – Deploy GROW with SAP on AWS from AWS Marketplace
GROW with SAP on AWS is now available for subscription from AWS Marketplace. As a complete offering of solutions, best practices, adoption acceleration services, community and learning, GROW with SAP helps any size organization adopt cloud enterprise resource planning (ERP) with speed, predictability, and continuous innovation. GROW with SAP on AWS can be implemented in […]
AWS – Introducing latency-optimized inference for foundation models in Amazon Bedrock
Latency-optimized inference for foundation models in Amazon Bedrock now available in public preview, delivering faster response times and improved responsiveness for AI applications. Currently, these new inference options support Anthropic’s Claude 3.5 Haiku model and Meta’s Llama 3.1 405B and 70B models offering reduced latency compared to standard models without compromising accuracy. As verified by […]
AWS – VPC Lattice now includes TCP support with VPC Resources
With the launch of VPC Resources for Amazon VPC Lattice, you can now access all of your application dependencies through a VPC Lattice service network. You’re able to connect to your application dependencies hosted in different VPCs, accounts, and on-premises using additional protocols, including TLS, HTTP, HTTPS, and now TCP. This new feature expands upon […]
AWS – Amazon EC2 P5en instances, optimized for generative AI and HPC, are generally available
Today, AWS announces the general availability of Amazon Elastic Compute Cloud (Amazon EC2) P5en instances, powered by the latest NVIDIA H200 Tensor Core GPUs. These instances deliver the highest performance in Amazon EC2 for deep learning and high performance computing (HPC) applications. You can use Amazon EC2 P5en instances for training and deploying increasingly complex […]
GCP – (Re)Introducing IBM Power for Google Cloud
Back in January of 2020, we announced the availability of IBM Power Systems for Google Cloud. But while the pandemic accelerated cloud computing adoption, many large enterprises still faced challenges with critical workloads such as those often found on the Enterprise IBM Power platform. At the beginning of 2022, we partnered with Converge Technology Solutions, […]
GCP – PayPal’s Real-Time Revolution: Migrating to Google Cloud for Streaming Analytics
At PayPal, revolutionizing commerce globally has been a core mission for over 25 years. We create innovative experiences that make moving money, selling, and shopping simple, personalized, and secure, empowering consumers and businesses in approximately 200 markets. Ensuring the availability of services offered to both merchants and consumers is paramount. PayPal’s journey with Dataflow has […]
GCP – Vertex AI grounding: More reliable models, fewer hallucinations
At the Gemini for Work event in September, we showcased how generative AI is transforming the way enterprises work. Across all the customer innovation we saw at the event, one thing was clear – if last year was about gen AI exploration and experimentation, this year is about achieving real-world impact. Gen AI has the […]