📝 Guest Post: Democratizing Vector Databases: Empowering Access & Equality*
Was this email forwarded to you? Sign up here In this guest post, Yujian Tang, Developer Advocate at Zilliz uncovers the true meaning behind democratizing a vector database and its profound implications to promote accessibility, equality, and inclusivity. The 21st century is all about the democratization of technology. The internet boom enabled large-scale collaboration leading to open source becoming a typical software adoption pattern. As the pace and scale of technological innovation grow, we must work to make it more accessible. As a software engineer, democratization of technology means making it as widely available as possible. It means using what I know to make creating, adopting, and understanding technological advances easier for others. Here at Zilliz, we have always been about accelerating the adoption of vector databases, not just about increasing adoption of the open source project Milvus. In this article, I’m going to cover:
What Does Democratizing Vector Databases Mean for Devs?Whenever I hear “democratize” in the context of “democratizing XYZ technology,” I think of expanding access to that technology. So when it comes to vector databases, I think of expanding access to vector databases. Traditionally, vector databases have only been available to software developers at enterprises. Milvus began the process of democratizing vector databases when it became an open-source Linux Foundation project. It was one of the first vector databases available to developers through being an open-source project. As the project has grown, more and more developers have been able to use, learn about, and contribute to vector databases. Pillars of Technology DemocratizationDemocratization of technology comes with specific challenges — especially the democratization of complex tools like vector databases. There are three pillars to look at when it comes to democratizing technology. They are education, increasing accessibility, and evangelism. Education on Vector Databases and Related ToolingEducation is the most crucial topic that many companies often get wrong. Education is about educating on your specific product, the technology at large, and related tools. That’s why here at Zilliz, we create content about many things, not just Milvus. The content we write reflects our desire to accelerate the adoption of vector databases through education. Therefore, we have content about essential concepts like Hierarchical Navigable Small Worlds (HNSW), scalar and product quantization, and inverted file indices. In addition to providing resources for understanding the concepts behind vector databases, we must provide resources for related tooling. For example, the popularity of large language models (LLMs) has ushered in a range of new tools. Some new tools that have come to the forefront include LlamaIndex, Auto-GPT, and LangChain. Additionally, in alignment with our goal to provide educational resources for the community, many of our content pieces go out to third parties, such as TheSequence, The New Stack, and some Medium publications. Increasing Accessibility to Vector DatabasesWhile providing education about technology is excellent, it’s not helpful unless you offer ways to access it. In our case, open-sourcing Milvus was the first step to increasing access to vector databases. Moving beyond simply being open source, the Milvus project has also pursued other avenues of increasing accessibility. Milvus is also available through Docker images with templates for Docker Compose and Helm. In addition, we recently made it available through In addition to Milvus, Zilliz has worked to increase accessibility as well. Initially, Zilliz provided $400 in free credits and now offers a free tier that allows up to half a million vectors! That’s enough for pretty much any developer. With Zilliz Cloud's free tier, almost any developer can get started with vector databases — for free. Technology EvangelismThe last pillar to address in democratizing technology is to evangelize it. What use is making technology available and providing education about it if you don’t tell people why it’s useful? In terms of accelerating adoption, education explains the how, and increasing the accessibility accounts for the what — evangelizing provides the why. We do evangelism mainly through content that shows the power of vector databases. You can see this through some of the educational material I provided above. We also give talks about vector databases. Some virtual and some in-person talks. I recently gave a talk in Seattle about the use of vector databases as a solution to solving data problems with LLMs. Summing Up: Democratizing Vector DatabasesDemocratizing vector databases is critical because vector databases solve many problems in unstructured data. They were previously only available to developers at large companies due to the sheer complexity and scale of such a project. However, the popularity of LLMs has thrust the idea of vector databases into the mainstream and given rise to countless use cases that they didn’t have before. This makes democratization even more critical. At Zilliz, we approached democratization with three pillars — education, accessibility, and evangelism. Education is the most crucial part of these pillars for us. For developers, educational material provides the “how” to use vector databases and complementary tools. Additionally, we’ve always worked to increase accessibility and continue to do so. Open-sourcing the software was the first step. Other steps to increase accessibility include providing templates and images for containerization. We recently released Milvus Lite, a vector database that can run directly in your Jupyter Notebook. Finally, we engage in technical evangelism to spread the word about the use and power of vector databases. We do this through providing webinars, speaking at community events, and being present on social media. Zilliz continues to make efforts to democratize vector databases and this is exciting because of how important they are. Vector databases are critical for solving data problems in LLMs and are the best existing solution for things like reverse image search, semantic text search, and product recommendations. I’m personally excited to be part of a team who’s helping to grow the vector database space and look forward to all the amazing things being built for and by the community! *This post was written by Yujian Tang, Developer Advocate at Zilliz, exclusively for TheSequence. We thank Zilliz for their ongoing support of TheSequence.You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
Yann LeCun's Vision Starts Materializing
Tuesday, June 20, 2023
Sundays, The Sequence Scope brings a summary of the most important research papers, technology releases and VC funding deals in the artificial intelligence space.
📝 Guest Post: Achieving real enterprise outcomes with GPT-You, not GPT-X*
Tuesday, June 20, 2023
Introducing Snorkel's Foundation Model Data Platform
Edge 299: A Taxonomy to Understand Tool-Augmented Language Models
Tuesday, June 13, 2023
What are the different ways to augment LLMs with tools.
📝 Guest Post: Enhancing ChatGPT's Efficiency – The Power of LangChain and Milvus*
Monday, June 12, 2023
In this guest post, the Zilliz team lists the challenges of using ChatGPT and explores how to enhance the intelligence and efficiency of ChatGPT to overcome the obstacles of hallucinations. While
Edge 297: Tool-Augmented Language Models
Monday, June 12, 2023
Can LLMs master knowledge tools?
You Might Also Like
Retro Recomendo: Gift Ideas
Sunday, November 24, 2024
Recomendo - issue #438 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Kotlin Weekly #434
Sunday, November 24, 2024
ISSUE #434 24th of November 2024 Hi Kotliners! Next week is the last one to send a paper proposal for the KotlinConf. We hope to see you there next year. Announcements State of Kotlin Scripting 2024
Weekend Reading — More time to write
Sunday, November 24, 2024
More Time to Write A fully functional clock that ticks backwards, giving you more time to write. Tech Stuff Martijn Faassen (FWIW I don't know how to use any debugger other than console.log) People
🕹️ Retro Consoles Worth Collecting While You Still Can — Is Last Year's Flagship Phone Worth Your Money?
Saturday, November 23, 2024
Also: Best Outdoor Smart Plugs, and More! How-To Geek Logo November 23, 2024 Did You Know After the "flair" that servers wore—buttons and other adornments—was made the butt of a joke in the
JSK Daily for Nov 23, 2024
Saturday, November 23, 2024
JSK Daily for Nov 23, 2024 View this email in your browser A community curated daily e-mail of JavaScript news React E-Commerce App for Digital Products: Part 4 (Creating the Home Page) This component
Not Ready For The Camera 📸
Saturday, November 23, 2024
What (and who) video-based social media leaves out. Here's a version for your browser. Hunting for the end of the long tail • November 23, 2024 Not Ready For The Camera Why hasn't video
Daily Coding Problem: Problem #1617 [Easy]
Saturday, November 23, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Microsoft. You are given an string representing the initial conditions of some dominoes.
Ranked | The Tallest and Shortest Countries, by Average Height 📏
Saturday, November 23, 2024
These two maps compare the world's tallest countries, and the world's shortest countries, by average height. View Online | Subscribe | Download Our App TIME IS RUNNING OUT There's just 3
⚙️ Your own Personal AI Agent, for Everything
Saturday, November 23, 2024
November 23, 2024 | Read Online Subscribe | Advertise Good Morning. Welcome to this special edition of The Deep View, brought to you in collaboration with Convergence. Imagine if you had a digital
Educational Byte: Are Privacy Coins Like Monero and Zcash Legal?
Saturday, November 23, 2024
Top Tech Content sent at Noon! How the world collects web data Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, November 23, 2024? The HackerNoon