2024 03 06

GCP – Build generative AI applications with similarity search in Cloud SQL for MySQL

Generative AI is transforming application development across industries as developers build brand new user experiences that weren’t possible before. We’re already seeing customers like Linear build amazing new AI-powered applications with Google Cloud databases. Recently, we announced that you can now also use Cloud SQL for MySQL to perform similarity searches by indexing and searching for vector embeddings generated by your favorite large language model (LLM). Cloud SQL now allows you to store vectors in the same Cloud SQL for MySQL instance you’re already using, and then search against your vector store using either an exact nearest neighbor (KNN) or approximate nearest neighbor (ANN) search.

Vector search in Cloud SQL for MySQL is built on Google’s open-source ScaNN libraries, which support multiple ANN index types: Tree-AH, Tree-SQ, and Brute Force with autotuning. Cloud SQL supports multiple distance measures, such as cosine, L2, and dot_product. Combining your vector store with your operational data allows you to create more meaningful and relevant experiences by leveraging vector search augmented with real-time data. Let’s dig in to how you can use this capability, currently in preview! Fill out our preview form here to request access.

Let’s imagine you’re building a new website for a library system that helps library patrons pick out e-books that they might enjoy. Users will tell you an e-book they like, and then your website finds other e-books that they might also enjoy. You’ve just upgraded the backend of your website to Cloud SQL for MySQL’s new Enterprise Plus edition, because your library serves customers in a large city who use library services around-the-clock, and you wanted to make sure your website is taking advantage of Enterprise Plus edition’s 99.99% SLA for high availability and up to 3x higher read throughput.

The journey begins by enabling a new MySQL flag called cloudsql_vector. In order to use similarity search, you’ll need to turn your data into embeddings (vectors) and store these vectors in the catalog table. The next sections are easy-to-follow steps that guide you through:

Getting embeddingsStoring and indexing those embeddingsPerforming similarity searches

Let’s assume that your catalog table is called library_catalog and includes details like book titles, descriptions, and copies available. You want to update your library’s catalog to include vector embeddings for all items currently in circulation. You can add a new column to store these vectors — let’s call the column item_embedding, with vector data type. To do this, you would update your table like this.

code_block
<ListValue: [StructValue([(‘code’, ‘ALTER TABLE library_catalog ADD COLUMN item_embedding VECTOR(3)rnUSING VARBINARY;’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e8f48244b80>)])]>

Next, generate a vector embedding for each item currently in your catalog. Use your favorite LLM for this — for example, you could use Vertex AI’s pre-trained text embeddings model to create embeddings based off of the item’s description in your catalog. The below example uses the textembedding-gecko@001 model.

code_block
<ListValue: [StructValue([(‘code’, ‘from vertexai.language_models import TextEmbeddingModelrnrnrndef text_embedding() -> list:rn “””Text embedding with a Large Language Model.”””rn model = TextEmbeddingModel.from_pretrained(“textembedding-gecko@001”)rn embeddings = model.get_embeddings([“What is life?”])rn for embedding in embeddings:rn vector = embedding.valuesrn print(f”Length of Embedding Vector: {len(vector)}”)rn return vectorrnrnrnif __name__ == “__main__”:rn text_embedding()’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e8f48244460>)])]>

Then, update that item’s row to store its new vector embedding.

code_block
<ListValue: [StructValue([(‘code’, “// Replace ‘[x,y,z]’ with the vector embedding returned by the rn// model api call from the above python examplernUPDATE library_catalog rnSET item_embedding=string_to_vector(‘[x,y,z]’) rnWHERE id=1;”), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e8f48244fa0>)])]>

Once you’ve updated all of the items in your catalog, you can add an index to your catalog to make it easy to perform similarity searches against items in the catalog. We support three different index types:

Tree-SQ: This is the default algorithm as it requires reduced memory and supports persistence across restarts minimizing operational toil. It also features slightly better recall due to reduced compression compared to Tree-AH, but with some additional compute cost.Tree-AH: Ideal for applications that prioritize speed and compute efficiency at the cost of additional memory with slight operational overhead.Brute-Force: Suitable when the distance measure needs to be precise at the cost of speed.

You could also perform brute force searches on your vectors to get the closest nearest neighbor (KNN) by not adding an index. By default, we’ll create a Tree-SQ index.

code_block
<ListValue: [StructValue([(‘code’, “CALL mysql.create_vector_index(rn’vectorIndex’, ‘library_catalog’, ‘item_embedding’,rn’table_size=<num of rows>’)”), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e8f482446a0>)])]>

Now, when a customer comes to the library website, they get suggestions of books similar to their favorite book, combined with a filter on what items are currently available for check out. For example, your website could run this query to get books similar to one that the customer says they like:

code_block
<ListValue: [StructValue([(‘code’, “// Replace ‘[x,y,z]’ with the vector embedding for the rn// query / search terms returned by the model api call rn// from the above python examplernSELECT book_title, num_available rnFROM library_catalog rnWHERE NEAREST(item_embedding) TO (string_to_vector(‘[x,y,z]’));”), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e8f482448b0>)])]>

If you wanted to get the exact nearest items in your catalog, you could instead search for:

code_block
<ListValue: [StructValue([(‘code’, “SELECT book_title, num_available, dist, rn vector_distance(item_embedding, string_to_vector(‘[x,y,z]’)) rnFROM books rnWHERE num_available > 0 rnORDER BY dist rnLIMIT 10;”), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e8f482443d0>)])]>

And this is just the start. Watch the video below to see this in action – you can follow this example, or customize for retail, healthcare, financial industries, and more. Interested in trying the preview? Fill out this form and we’ll be in touch!

GCP – Build generative AI applications with similarity search in Cloud SQL for MySQL

Google Cloud databases stand ready to power your gen AI apps with new capabilities

Related Posts

AWS – Amazon VPC Route Server now available in new regions

GCP – Palo Alto Networks automates customer intelligence document creation with agentic design

GCP – Vibe querying: Write SQL queries faster with Comments to SQL in BigQuery