Getting Started with Vector Search in SQL Server 2025 Using Ollama

Ollama SQL FastStart streamlines the deployment of SQL Server 2025 with integrated AI capabilities through a comprehensive Docker-based solution. This project delivers a production-ready environment combining SQL Server 2025, Ollama’s large language model services, and NGINX with full SSL support—all preconfigured to work together seamlessly.

I built this project to eliminate the complex configuration hurdles that typically slow down AI integration projects. Whether you’re a database professional wanting to explore SQL Server 2025’s new vector search capabilities or a developer looking to build AI-powered applications on familiar infrastructure, this solution provides everything you need in a single docker-compose file. The entire stack—including the complex certificate trust chain between SQL Server and the Ollama API—is automatically configured, allowing you to focus on building data driven AI applications rather than infrastructure setup.

You can find the code for this project on Github.

Note: SQL Server 2025 RC0 is vector operations are now functional on Macs with M-series CPUs via Rosetta 2

Architecture Overview

The project consists of several Docker containers working together:

SQL Server 2025: Running the latest release with vector and REST capabilities enabled
Ollama: An open-source model serving platform that generates text embeddings
NGINX with SSL: Acts as a secure proxy between SQL Server and Ollama
Automation containers: For certificate generation, model pulling, and SQL Server configuration
Data Persistence: Persistent storage for SQL Server and Ollama models.

The docker compose implementation process will:

Generate SSL certificates for secure communication
Start SQL Server 2025
Launch Ollama and pull the nomic-embed-text model
Configure SQL Server to trust the certificates
Create an external model connection to Ollama’s secure TLS endpoint in SQL Server

Docker Compose Services

Everything is tied together with Docker Compose for easy deployment. The architecture ensures secure communication between components while keeping everything containerized.

Service	Description
`config`	Generates self-signed SSL certificates for NGINX
`ollama`	Runs the Ollama service for managing models
`model-puller`	Automatically pulls a model from Ollama after the service is healthy
`nginx`	Acts as a reverse proxy with SSL termination for secure communication
`sql1`	SQL Server instance with preconfigured settings for Vector, REST and SSL support
`sql-tools`	Utility container for running SQL scripts and configuring the database

Getting Started

1. Clone the repository:

git clone https://github.com/nocentino/ollama-sql-faststart.git
cd ollama-sql-faststart

2. Build and start the services:

docker compose up --detach

3. Verify that all services are running with `docker ps`

Three containers are running successfully:

Container	Port	Purpose	Status
SQL Server	`1433`	Connect with SQL clients	Healthy
NGINX Reverse Proxy	`443`	Secure Ollama API access	Healthy
Ollama Model Server	`11434`	Direct Ollama API access	Healthy

All services are operational and ready for vector database demos.

Once up and running, you’ll have access to:

SQL Server on port 1433
Ollama on port 11434
NGINX enabling TLS Access to Ollama’s API on port 443

Working with Vector Embeddings in SQL Server

The SQL script vector-demos.sql demonstrates vector search capabilities in SQL Server 2025 with Ollama integration:

Connect using your favorite SQL Server tooling, SSMS, or VSCode to localhost, 1433. The default username is sa and the default password is S0methingS@Str0ng!

Demo Steps

The demo script is broken up into 6 steps walking you through how to setup SQL Server 2025 database for AI enabled similarity search.

1. Database Setup

Restores AdventureWorks2025 database from backup

2. Create a connection to an external mode

Create and test an External Model pointing to our local Ollama Container over HTTPS

3. Vector Storage

Adds VECTOR(768) column to Product table for embeddings
Adds text chunk column for storing descriptive text

4. Embedding Generation

Processes each product record
Creates text descriptions by combining product attributes
Generates vector embeddings using Ollama and the new to SQL Server 2025 function (AI_GENERATE_EMBEDDINGS)

5. Basic Vector Search

Demonstrates semantic queries with cosine distance calculation

6. Advanced Vector Search

Enables required trace flags
Creates a vector index using the DiskANN algorithm
Performs efficient Approximate Nearest Neighbor (ANN) search

The script showcases a complete vector search implementation from setup to optimized semantic/similarity search queries.

Conclusion

SQL Server 2025’s vector capabilities represent a significant step forward for integrating AI into traditional database workloads. The ability to perform semantic search directly within SQL Server opens up a whole new world of possibilities for your applications.

This project makes it easy to explore these features in an easy to configure, containerized environment. If you’re interested in vector search, semantic analysis, or keeping up with the latest database technologies, I encourage you to clone this repo and experiment!