Stay organized with collections
Save and categorize content based on your preferences.
This page describes AlloyDB AI vector search strategies and explains
when to use each strategy. By default, AlloyDB uses k-nearest neighbors
search (KNN) to find vectors that are similar to a query. Vector indexes
implement a search strategy called Approximate Nearest Neighbor (ANN). When you
create a vector index, AlloyDB AI uses ANN, which provides better
performance than KNN. Keep in mind that, when you select a vector index, you
need to balance query latency and recall.
Recall measures how effectively a search retrieves all relevant items for
a given query. For example, imagine you have 100 embeddings, each one
representing an entity in your database. You query your embeddings with a
target vector and limit it to 10 results. A KNN vector search finds the 10
exact closest vectors using a brute force calculation method, which
results in 100% recall. AlloyDB AI uses this method by default
if no vector search index is created or chosen.
When you create a vector index in AlloyDB for PostgreSQL, it typically uses ANN,
which might partition
vectors according to similarity to facilitate faster retrieval. As a result,
using ANN, the 10 vectors returned in the earlier example might not be exactly
the 10 vectors that are closest in
distance. If only 8 out of the 10 retrieved vectors are the closest in space
to your query vector, then your recall is 80%.
Query latency defines how fast the search results are generated. For
example, latency is calculated based on the time spent on a search to
return the vectors after you submit a query.
Choose your search strategy
When you perform vector search in AlloyDB, choose one the following
search strategies:
Search Strategy
Description
Use Cases
K-nearest neighbors (KNN)
An algorithm that finds the k-nearest neighbors data points to a given
query data point. When you perform a vector search without creating an
index, a KNN search is performed by default.
Your application is very sensitive to accuracy and you need the exact closest matches.
You have fewer than 100,000 vectors.
Approximate Nearest Neighbors (ANN)
An algorithm that finds approximately the closest data points. ANN divides existing customer data points into small groups based on similarities.
Your application requires low latency.
You have more than 100,000 vectors.
Google recommends that you create a vector index to optimize performance on your
vector search queries. For more information about how the ANN index is used for
similarity searches, see Create indexes using ScaNN.
To accelerate your filtered KNN search, use the columnar engine.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-29 UTC."],[[["\u003cp\u003eAlloyDB AI uses k-nearest neighbors (KNN) search by default to find vectors similar to a query, but when a vector index is created, it uses Approximate Nearest Neighbor (ANN) for better performance.\u003c/p\u003e\n"],["\u003cp\u003eRecall measures how effectively a search retrieves all relevant items, with KNN achieving 100% recall by using brute force, while ANN, used with indexes, might have a lower recall rate.\u003c/p\u003e\n"],["\u003cp\u003eQuery latency measures the speed at which search results are generated.\u003c/p\u003e\n"],["\u003cp\u003eKNN is recommended for applications requiring high accuracy and when dealing with fewer than 100,000 vectors, whereas ANN is preferred for low latency and when handling over 100,000 vectors.\u003c/p\u003e\n"],["\u003cp\u003eCreating a vector index is recommended by Google to optimize performance of vector searches.\u003c/p\u003e\n"]]],[],null,["This page describes AlloyDB AI vector search strategies and explains\nwhen to use each strategy. By default, AlloyDB uses k-nearest neighbors\nsearch (KNN) to find vectors that are similar to a query. Vector indexes\nimplement a search strategy called Approximate Nearest Neighbor (ANN). When you\ncreate a vector index, AlloyDB AI uses ANN, which provides better\nperformance than KNN. Keep in mind that, when you select a vector index, you\nneed to balance query latency and recall.\n\n*Recall* measures how effectively a search retrieves all relevant items for\na given query. For example, imagine you have 100 embeddings, each one\nrepresenting an entity in your database. You query your embeddings with a\ntarget vector and limit it to 10 results. A KNN vector search finds the 10\nexact closest vectors using a brute force calculation method, which\nresults in 100% recall. AlloyDB AI uses this method by default\nif no vector search index is created or chosen.\nWhen you create a vector index in AlloyDB for PostgreSQL, it typically uses ANN,\nwhich might partition\nvectors according to similarity to facilitate faster retrieval. As a result,\nusing ANN, the 10 vectors returned in the earlier example might not be exactly\nthe 10 vectors that are closest in\ndistance. If only 8 out of the 10 retrieved vectors are the closest in space\nto your query vector, then your recall is 80%.\n\n*Query latency* defines how fast the search results are generated. For\nexample, latency is calculated based on the time spent on a search to\nreturn the vectors after you submit a query.\n\nChoose your search strategy\n\nWhen you perform vector search in AlloyDB, choose one the following\nsearch strategies:\n\n|-------------------------------------||---------------------------------------------------------------------------------------------------------------------------------|\n| Search Strategy | Description | Use Cases |\n| K-nearest neighbors (KNN) | An algorithm that finds the k-nearest neighbors data points to a given query data point. When you perform a vector search without creating an index, a KNN search is performed by default. To further improve the performance of KNN search, add your embedding column, and other columns related to your query, to the column store in the [columnar engine](/alloydb/docs/columnar-engine/about). You can [add the columns manually](/alloydb/docs/columnar-engine/manage-content-manually) or [add the columns using auto-columnarization](/alloydb/docs/columnar-engine/manage-content-recommendations). | - Your application is very sensitive to accuracy and you need the exact closest matches. - You have fewer than 100,000 vectors. |\n| Approximate Nearest Neighbors (ANN) | An algorithm that finds approximately the closest data points. ANN divides existing customer data points into small groups based on similarities. | - Your application requires low latency. - You have more than 100,000 vectors. |\n\nGoogle recommends that you create a vector index to optimize performance on your\nvector search queries. For more information about how the ANN index is used for\nsimilarity searches, see [Create indexes using ScaNN](/alloydb/docs/ai/store-index-query-vectors?resource=scann).\n\nTo accelerate your filtered KNN search, use the [columnar engine](/alloydb/docs/columnar-engine/configure).\n\nWhat's next\n\n- [Create indexes and query vectors using ScaNN](/alloydb/docs/ai/store-index-query-vectors?resource=scann)\n- [Tune vector query performance](/alloydb/docs/ai/tune-indexes)"]]