LinkedIn and 3rd parties use essential and non-essential cookies to provide, secure, analyze and improve our Services, and to show you relevant ads (including professional and job ads) on and off LinkedIn. Learn more in our Cookie Policy.

Select Accept to consent or Reject to decline non-essential cookies for this use. You can update your choices at any time in your settings.

Start free trial Sign in

From the course: Fundamentals of AI Engineering: Principles and Practical Applications

Unlock this course with a free trial

Join today to access over 24,300 courses taught by industry experts.

Scaling strategies (approximate nearest neighbor, or ANN)

Scaling strategies (approximate nearest neighbor, or ANN)

From the course: Fundamentals of AI Engineering: Principles and Practical Applications

Start my 1-month free trial Buy for my team

Scaling strategies (approximate nearest neighbor, or ANN)

“

- [Instructor] Welcome back, everyone. Today we're going to tackle one of the most important challenges that AI engineers face in production: how to scale their vector databases. To get started, open up chapter five and navigate to 05_04.ipymb. As always, make sure your VN in the upper right-hand corner is selected to the .vn virtual environment. As your AI applications grow, you'll likely move from handling tens, to thousands, to millions, to potentially even billions of vectors at scale. The techniques that we'll cover today are essential for making that transition successfully. Before we dive in, let's establish three primary factors we need to consider when scaling vector databases. First, speed versus accuracy. Understanding when to prioritize one, really, versus the other. Second are resource limitations. How do we work with the memory, CPU, and storage constraints that we have in our operating environment? Third is horizontal scaling. How do we redistribute the workload across…

Contents