Vector Library

The AI-native Redis Java client

Home Documentation Recipes

Introduction

Redis Vector Library (RedisVL) is the ultimate Java client designed for AI-native applications harnessing the power of Redis.

RedisVL is your go-to client for:

Lightning-fast information retrieval & vector similarity search
Real-time RAG pipelines
LLM semantic caching
Hybrid search combining vectors and metadata

💪 Getting Started

Installation

Add RedisVL to your Java (17+) project using Maven or Gradle:

Maven:

<dependency>
    <groupId>com.redis</groupId>
    <artifactId>redisvl</artifactId>
    <version>0.12.0</version>
</dependency>

Gradle:

implementation 'com.redis:redisvl:0.12.0'

Setting up Redis

Choose from multiple Redis deployment options:

Redis Cloud: Managed cloud database (free tier available)

Redis on Docker: Docker image for development

docker run -d --name redis-stack -p 6379:6379 -p 8001:8001 redis/redis-stack:latest

Redis Enterprise: Commercial, self-hosted database
Azure Managed Redis: Fully managed Redis Enterprise on Azure

Enhance your experience and observability with the free Redis Insight GUI.

Overview

🗃️ Redis Index Management

Design a schema for your use case that models your dataset with built-in Redis and indexable fields (e.g. text, tags, numerics, geo, and vectors). Load a schema from a YAML file:

index:
  name: user-idx
  prefix: user
  storage_type: json

fields:
  - name: user
    type: tag
  - name: credit_score
    type: tag
  - name: embedding
    type: vector
    attrs:
      algorithm: flat
      dims: 4
      distance_metric: cosine
      datatype: float32

import com.redis.vl.schema.IndexSchema;

IndexSchema schema = IndexSchema.fromYaml("schemas/schema.yaml");

Or load directly from a Java Map:

Map<String, Object> schemaMap = Map.of(
    "index", Map.of(
        "name", "user-idx",
        "prefix", "user",
        "storage_type", "json"
    ),
    "fields", List.of(
        Map.of("name", "user", "type", "tag"),
        Map.of("name", "credit_score", "type", "tag"),
        Map.of(
            "name", "embedding",
            "type", "vector",
            "attrs", Map.of(
                "algorithm", "flat",
                "datatype", "float32",
                "dims", 4,
                "distance_metric", "cosine"
            )
        )
    )
);

IndexSchema schema = IndexSchema.fromMap(schemaMap);

Create a SearchIndex class with an input schema to perform admin and search operations on your index in Redis:

import redis.clients.jedis.UnifiedJedis;
import com.redis.vl.index.SearchIndex;

// Define the index
UnifiedJedis jedis = new UnifiedJedis("redis://localhost:6379");
SearchIndex index = new SearchIndex(schema, jedis);

// Create the index in Redis
index.create(true); // overwrite if exists

Load and fetch data to/from your Redis instance:

Map<String, Object> data = Map.of(
    "user", "john",
    "credit_score", "high",
    "embedding", new float[]{0.23f, 0.49f, -0.18f, 0.95f}
);

// load list of maps, specify the "id" field
index.load(List.of(data), "user");

// fetch by "id"
Map<String, Object> john = index.fetch("john");

🔍 Retrieval

Define queries and perform advanced searches over your indices, including the combination of vectors, metadata filters, and more.

VectorQuery - Flexible vector queries with customizable filters enabling semantic search:

import com.redis.vl.query.VectorQuery;

VectorQuery query = VectorQuery.builder()
    .vector(new float[]{0.16f, -0.34f, 0.98f, 0.23f})
    .field("embedding")
    .numResults(3)
    .build();

// run the vector search query against the embedding field
List<Map<String, Object>> results = index.query(query);

Incorporate complex metadata filters on your queries:

import com.redis.vl.query.filter.FilterQuery;

// define a tag match filter
FilterQuery tagFilter = FilterQuery.tag("user", "john");

// update query definition
VectorQuery filteredQuery = query.toBuilder()
    .withPreFilter(tagFilter.build())
    .build();

// execute query
List<Map<String, Object>> results = index.query(filteredQuery);

HybridQuery - Combines text and vector search with weighted scoring:

import com.redis.vl.query.HybridQuery;
import com.redis.vl.query.Filter;

// Hybrid search: text + vector with alpha weighting
HybridQuery hybridQuery = HybridQuery.builder()
    .text("machine learning algorithms")
    .textFieldName("description")
    .vector(queryVector)
    .vectorFieldName("embedding")
    .filterExpression(Filter.tag("category", "AI"))
    .alpha(0.7f)  // 70% vector, 30% text
    .numResults(10)
    .build();

List<Map<String, Object>> results = index.query(hybridQuery);
// Results scored by: alpha * vector_similarity + (1-alpha) * text_score

VectorRangeQuery - Vector search within a defined range paired with customizable filters
FilterQuery - Standard search using filters and the full-text search
CountQuery - Count the number of indexed records given attributes

Read more about building advanced Redis queries.

🔧 Utilities

Vectorizers

Integrate with popular embedding providers to greatly simplify the process of vectorizing unstructured data for your index and queries:

LangChain4J - Integration with LangChain4J embedding models
- OpenAI
- Cohere
- HuggingFace
- Ollama
- Vertex AI
- Azure OpenAI
- Mistral AI
- Voyage AI
ONNX Models - Local embedding models via ONNX Runtime
- Sentence Transformers (all-MiniLM-L6-v2, etc.)
- Custom ONNX models
DJL (Deep Java Library) - Face recognition and embeddings

import com.redis.vl.utils.vectorize.LangChain4JVectorizer;
import dev.langchain4j.model.cohere.CoherEmbeddingModel;

// Using LangChain4J with Cohere
CoherEmbeddingModel cohereModel = CoherEmbeddingModel.withApiKey(apiKey);
LangChain4JVectorizer vectorizer = new LangChain4JVectorizer(cohereModel);

float[] embedding = vectorizer.embed("What is the capital city of France?");

List<float[]> embeddings = vectorizer.embedBatch(
    List.of(
        "my document chunk content",
        "my other document chunk content"
    )
);

import com.redis.vl.utils.vectorize.SentenceTransformersVectorizer;

// Using local ONNX model
SentenceTransformersVectorizer vectorizer = new SentenceTransformersVectorizer("sentence-transformers/all-mpnet-base-v2");

float[] embedding = vectorizer.embed("What is the capital city of France?");

Learn more about using vectorizers in your embedding workflows.

Rerankers

Integrate with popular reranking providers to improve the relevancy of the initial search results from Redis:

Cohere Reranker - Using Cohere's reranking API
HuggingFace Cross-Encoder - Local reranking with cross-encoder models

import com.redis.vl.utils.rerank.CohereReranker;

CohereReranker reranker = new CohereReranker(apiKey);

List<Map<String, Object>> reranked = reranker.rerank(
    "What is the capital of France?",
    results,
    topK
);

💫 Extensions

RedisVL supports Extensions that implement interfaces exposing best practices and design patterns for working with LLM memory and agents.

LLM Semantic Caching

Increase application throughput and reduce the cost of using LLM models in production by leveraging previously generated knowledge with the SemanticCache.

import com.redis.vl.extensions.cache.llm.SemanticCache;
import com.redis.vl.extensions.cache.llm.CacheConfiguration;

// init cache with TTL and semantic distance threshold
CacheConfiguration config = CacheConfiguration.builder()
    .name("llmcache")
    .ttl(360L)
    .distanceThreshold(0.1)
    .build();

SemanticCache cache = new SemanticCache(config, jedis);

// store user queries and LLM responses in the semantic cache
cache.store(
    "What is the capital city of France?",
    "Paris"
);

// quickly check the cache with a slightly different prompt (before invoking an LLM)
CacheResult result = cache.check("What is France's capital city?");
if (result.isHit()) {
    System.out.println(result.getResponse());
    // Output: Paris
}

Learn more about semantic caching for LLMs.

LLM Semantic Routing

Build fast decision models that run directly in Redis and route user queries to the nearest "route" or "topic".

import com.redis.vl.extensions.router.SemanticRouter;
import com.redis.vl.extensions.router.Route;
import com.redis.vl.utils.vectorize.SentenceTransformersVectorizer

SentenceTransformersVectorizer vectorizer = new SentenceTransformersVectorizer("sentence-transformers/all-mpnet-base-v2")

List<Route> routes = List.of(
    Route.builder()
        .name("greeting")
        .references(List.of("hello", "hi"))
        .metadata(Map.of("type", "greeting"))
        .distanceThreshold(0.3)
        .build(),
    Route.builder()
        .name("farewell")
        .references(List.of("bye", "goodbye"))
        .metadata(Map.of("type", "farewell"))
        .distanceThreshold(0.3)
        .build()
);

// build semantic router from routes
SemanticRouter router = SemanticRouter.builder()
    .name("topic-router")
    .vectorizer(vectorizer)
    .routes(routes)
    .jedis(jedis)
    .build();

RouteMatch match = router.route("Hi, good morning");
System.out.println(match.getName()); // Output: greeting
System.out.println(match.getDistance()); // Output: 0.273891836405

Learn more about semantic routing.

🧪 Experimental: VCR Test System

RedisVL includes an experimental VCR (Video Cassette Recorder) test system for recording and replaying LLM/embedding API calls. This enables:

Deterministic tests - Replay recorded responses for consistent results
Cost reduction - Avoid repeated API calls during test runs
Speed improvement - Local Redis playback is faster than API calls
Offline testing - Run tests without network access or API keys

Quick Start with JUnit 5

The simplest way to use VCR is with the declarative annotations:

import com.redis.vl.test.vcr.VCRMode;
import com.redis.vl.test.vcr.VCRModel;
import com.redis.vl.test.vcr.VCRTest;

@VCRTest(mode = VCRMode.PLAYBACK_OR_RECORD)
class MyLLMTest {

    // Models are automatically wrapped by VCR
    @VCRModel(modelName = "text-embedding-3-small")
    private EmbeddingModel embeddingModel = createEmbeddingModel();

    @VCRModel
    private ChatLanguageModel chatModel = createChatModel();

    @Test
    void testEmbedding() {
        // First run: Records API response to Redis
        // Subsequent runs: Replays from Redis cassette
        Response<Embedding> response = embeddingModel.embed("What is Redis?");
        assertNotNull(response.content());
    }

    @Test
    void testChat() {
        String response = chatModel.generate("Explain Redis in one sentence.");
        assertNotNull(response);
    }
}

VCR Modes

Mode	Description	API Key Required
`PLAYBACK`	Only use recorded cassettes. Fails if missing.	No
`PLAYBACK_OR_RECORD`	Use cassette if available, record if not.	Only for first run
`RECORD`	Always call real API and record response.	Yes
`OFF`	Bypass VCR, always call real API.	Yes

Environment Variable Override

Override the VCR mode at runtime without changing code:

# Record new cassettes
VCR_MODE=RECORD OPENAI_API_KEY=your-key ./gradlew test

# Playback only (CI/CD, no API key needed)
VCR_MODE=PLAYBACK ./gradlew test

LangChain4J Integration

import com.redis.vl.test.vcr.VCREmbeddingModel;
import com.redis.vl.test.vcr.VCRChatModel;
import com.redis.vl.test.vcr.VCRMode;

// Wrap any LangChain4J EmbeddingModel
VCREmbeddingModel vcrEmbedding = new VCREmbeddingModel(openAiEmbeddingModel);
vcrEmbedding.setMode(VCRMode.PLAYBACK_OR_RECORD);
Response<Embedding> response = vcrEmbedding.embed("What is Redis?");

// Wrap any LangChain4J ChatLanguageModel
VCRChatModel vcrChat = new VCRChatModel(openAiChatModel);
vcrChat.setMode(VCRMode.PLAYBACK_OR_RECORD);
String response = vcrChat.generate("What is Redis?");

Spring AI Integration

import com.redis.vl.test.vcr.VCRSpringAIEmbeddingModel;
import com.redis.vl.test.vcr.VCRSpringAIChatModel;
import com.redis.vl.test.vcr.VCRMode;

// Wrap any Spring AI EmbeddingModel
VCRSpringAIEmbeddingModel vcrEmbedding = new VCRSpringAIEmbeddingModel(openAiEmbeddingModel);
vcrEmbedding.setMode(VCRMode.PLAYBACK_OR_RECORD);
EmbeddingResponse response = vcrEmbedding.embedForResponse(List.of("What is Redis?"));

// Wrap any Spring AI ChatModel
VCRSpringAIChatModel vcrChat = new VCRSpringAIChatModel(openAiChatModel);
vcrChat.setMode(VCRMode.PLAYBACK_OR_RECORD);
String response = vcrChat.call("What is Redis?");

How It Works

Container Management: VCR starts a Redis Stack container with persistence
Model Wrapping: @VCRModel fields are wrapped with VCR proxies
Cassette Storage: Responses stored as Redis JSON documents
Persistence: Data saved to src/test/resources/vcr-data/ via Redis AOF/RDB
Playback: Subsequent runs load cassettes from persistent storage

Demo Projects

Complete working examples are available:

LangChain4J VCR Demo - LangChain4J embedding and chat models
Spring AI VCR Demo - Spring AI embedding and chat models

Run the demos without an API key (uses pre-recorded cassettes):

./gradlew :demos:langchain4j-vcr:test
./gradlew :demos:spring-ai-vcr:test

Learn more about VCR testing.

🚀 Why RedisVL?

In the age of GenAI, vector databases and LLMs are transforming information retrieval systems. With emerging and popular frameworks like LangChain4J and Spring AI, innovation is rapid. Yet, many organizations face the challenge of delivering AI solutions quickly and at scale.

Enter Redis – a cornerstone of the NoSQL world, renowned for its versatile data structures and processing engines. Redis excels in real-time workloads like caching, session management, and search. It's also a powerhouse as a vector database for RAG, an LLM cache, and a chat session memory store for conversational AI.

The Redis Vector Library bridges the gap between the AI-native developer ecosystem and Redis's robust capabilities. With a lightweight, elegant, and intuitive interface, RedisVL makes it easy to leverage Redis's power. Built on the Jedis client, RedisVL transforms Redis's features into a grammar perfectly aligned with the needs of today's AI/ML Engineers and Data Scientists.

📚 Examples and Notebooks

Check out the notebooks directory for interactive Jupyter notebook examples:

Getting Started - Introduction to RedisVL basics
Hybrid Queries - Combining vector and metadata search
LLM Cache - Semantic caching for LLMs
Hash vs JSON Storage - Storage type comparison
Vectorizers - Working with embedding models
Rerankers - Improving search relevance

Running the Notebooks

The notebooks use Java kernel support via JJava. To run them:

Start the notebook environment:
```
cd notebooks
docker compose up -d
```
Access Jupyter Lab: Navigate to http://localhost:8888/
Stop when done:
```
docker compose down
```

See the notebooks README for more details.

😁 Helpful Links

For additional help, check out the following resources:

🫱🏼‍🫲🏽 Contributing

Please help us by contributing PRs, opening GitHub issues for bugs or new feature ideas, improving documentation, or increasing test coverage. Read more about how to contribute!

Requirements

Java 17+
Redis Stack 7.2+ or Redis with RediSearch module

License

This project is licensed under the MIT License - see the LICENSE file for details.

🚧 Maintenance

This project is supported by Redis, Inc on a good faith effort basis. To report bugs, request features, or receive assistance, please file an issue.

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
.github/workflows		.github/workflows
core		core
demos		demos
docs		docs
gradle/wrapper		gradle/wrapper
notebooks		notebooks
.gitignore		.gitignore
FEATURE_PARITY_REPORT.md		FEATURE_PARITY_REPORT.md
README.md		README.md
build.gradle.kts		build.gradle.kts
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
jreleaser.yml		jreleaser.yml
settings.gradle.kts		settings.gradle.kts
spotbugs-exclude.xml		spotbugs-exclude.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Vector Library

Introduction

💪 Getting Started

Installation

Setting up Redis

Overview

🗃️ Redis Index Management

🔍 Retrieval

🔧 Utilities

Vectorizers

Rerankers

💫 Extensions

LLM Semantic Caching

LLM Semantic Routing

🧪 Experimental: VCR Test System

Quick Start with JUnit 5

VCR Modes

Environment Variable Override

LangChain4J Integration

Spring AI Integration

How It Works

Demo Projects

🚀 Why RedisVL?

📚 Examples and Notebooks

Running the Notebooks

😁 Helpful Links

🫱🏼‍🫲🏽 Contributing

Requirements

License

🚧 Maintenance

About

Uh oh!

Releases 3

Contributors 3

Languages

redis/redis-vl-java

Folders and files

Latest commit

History

Repository files navigation

Vector Library

Introduction

💪 Getting Started

Installation

Setting up Redis

Overview

🗃️ Redis Index Management

🔍 Retrieval

🔧 Utilities

Vectorizers

Rerankers

💫 Extensions

LLM Semantic Caching

LLM Semantic Routing

🧪 Experimental: VCR Test System

Quick Start with JUnit 5

VCR Modes

Environment Variable Override

LangChain4J Integration

Spring AI Integration

How It Works

Demo Projects

🚀 Why RedisVL?

📚 Examples and Notebooks

Running the Notebooks

😁 Helpful Links

🫱🏼‍🫲🏽 Contributing

Requirements

License

🚧 Maintenance

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 3

Contributors 3

Languages