Using django_haystack_opensearch

This guide covers how to use django-haystack-opensearch to add powerful search functionality to your Django applications using OpenSearch.

Basic Search

The most basic search operation is to query the index for documents matching a search term:

from haystack.query import SearchQuerySet

# Simple text search
results = SearchQuerySet().filter(content="django")

# Auto-query (automatically parses the query string)
results = SearchQuerySet().auto_query("django haystack")

# Get all results
all_results = SearchQuerySet().all()

# Iterate over results
for result in results:
    print(result.object)  # The Django model instance
    print(result.score)   # Relevance score

Filtering

You can filter search results using various filter types:

Content Filtering

Filter on the main content field:

# Filter by content
results = SearchQuerySet().filter(content="python")

Other Filter Types

# Contains (wildcard search)
results = SearchQuerySet().filter(content__contains="django")

# Starts with
results = SearchQuerySet().filter(content__startswith="django")

# Ends with
results = SearchQuerySet().filter(content__endswith="search")

# Exact match (for non-facet fields)
results = SearchQuerySet().filter(content__exact="django haystack")

# Greater than / Less than (for numeric/date fields)
results = SearchQuerySet().filter(created__gte="2023-01-01")
results = SearchQuerySet().filter(price__lt=100)

# In (multiple values)
results = SearchQuerySet().filter(id__in=[1, 2, 3])

# Range
results = SearchQuerySet().filter(price__range=[10, 50])

Faceting

Faceting allows you to get counts of documents grouped by field values. This is useful for building filter interfaces.

Highlighting

Highlight search terms in results:

from haystack.query import SearchQuerySet

sqs = SearchQuerySet().filter(content="django").highlight()
results = list(sqs)

for result in results:
    if hasattr(result, "highlighted"):
        print(result.highlighted)  # HTML with <em> tags around matches

Spelling Suggestions

Get spelling suggestions for queries:

from haystack.query import SearchQuerySet

sqs = SearchQuerySet().filter(content="djangoo")  # Misspelled
results = list(sqs)

# Get spelling suggestion
suggestion = sqs.spelling_suggestion()
if suggestion:
    print(f"Did you mean: {suggestion}")

How it Works

By default, the OpenSearch backend uses the main document field (the one with document=True) to generate spelling suggestions.

For better results, you can provide a dedicated spelling field in your SearchIndex. This is useful if you want to use a different analyzer for spelling than for your main content (e.g., a non-stemmed analyzer).

To use a dedicated spelling field, simply name it _spelling:

class MyIndex(indexes.SearchIndex, indexes.Indexable):
    text = indexes.CharField(document=True, use_template=True)
    # Dedicated spelling field
    _spelling = indexes.CharField(model_attr='my_content_field')

The backend will automatically detect this field and use it for suggestions.

More Like This

Find documents similar to a given document:

from haystack.query import SearchQuerySet
from myapp.models import Article

# Get a document
article = Article.objects.get(pk=1)

# Find similar documents
backend = connections["default"].get_backend()
similar = backend.more_like_this(article)

similar_results = similar["results"]
for result in similar_results:
    print(result.object)

You can also add additional query constraints:

similar = backend.more_like_this(
    article,
    additional_query_string="category:technology"
)

File Content Extraction

The OpenSearch backend provides a utility method to extract text and metadata from binary files (like PDF, DOCX, etc.) using OpenSearch’s ingest-attachment plugin.

from haystack import connections

backend = connections["default"].get_backend()

with open("document.pdf", "rb") as f:
    result = backend.extract_file_contents(f)

if result:
    print(result["contents"])  # Extracted text
    print(result["metadata"])  # File metadata (author, title, etc.)

This method is particularly useful when you want to index the contents of uploaded files into your search index.

Note: This feature requires the ingest-attachment plugin to be installed and enabled on your OpenSearch node. See OpenSearch Plugins for more details.

Spatial Search

Search by geographic location:

Within a Bounding Box

from haystack.query import SearchQuerySet
from django.contrib.gis.geos import Point

# Define bounding box corners
point1 = Point(-122.5, 37.7)  # Southwest corner
point2 = Point(-122.3, 37.9)  # Northeast corner

# Search within bounding box
sqs = SearchQuerySet().within("location", point1, point2)

Within a Distance

from haystack.query import SearchQuerySet
from django.contrib.gis.geos import Point
from django.contrib.gis.measure import D

# Define center point
center = Point(-122.4, 37.8)

# Search within 10km
sqs = SearchQuerySet().dwithin("location", center, D(km=10))

# Sort by distance
sqs = sqs.distance("location", center).order_by("distance")

for result in sqs:
    print(f"{result.object}: {result._distance}")

Sorting

Sort results by field values:

from haystack.query import SearchQuerySet

# Sort ascending
sqs = SearchQuerySet().order_by("created")

# Sort descending
sqs = SearchQuerySet().order_by("-created")

# Multiple sort fields
sqs = SearchQuerySet().order_by("-score", "created")

Pagination

Paginate search results:

from haystack.query import SearchQuerySet
from django.core.paginator import Paginator

sqs = SearchQuerySet().filter(content="django")

# Using Django's paginator
paginator = Paginator(sqs, 20)  # 20 results per page
page = paginator.page(1)

for result in page:
    print(result.object)

# Or use slicing
page1 = sqs[0:20]
page2 = sqs[20:40]

Model Filtering

Filter results to specific Django models:

from haystack.query import SearchQuerySet
from myapp.models import Article, BlogPost

# Filter to specific models
sqs = SearchQuerySet().models(Article)

# Multiple models
sqs = SearchQuerySet().models(Article, BlogPost)

Stored Fields

Retrieve specific stored fields (non-indexed fields):

from haystack.query import SearchQuerySet

# Fields marked with indexed=False are stored but not searchable
# They can be retrieved directly from search results
sqs = SearchQuerySet().filter(content="django")

for result in sqs:
    # Access stored fields directly
    stored_value = result.stored_field_name

Boost

Boost certain fields or queries for relevance:

from haystack.query import SearchQuerySet

# Fields can be boosted in the search index definition
# Higher boost = more relevant

# You can also boost in queries
sqs = SearchQuerySet().boost("title", 2.0).filter(content="django")

Advanced Usage

Combining Features

You can combine multiple features in a single query:

from haystack.query import SearchQuerySet

sqs = (
    SearchQuerySet()
    .models(Article)
    .filter(content="django")
    .facet("category")
    .facet("author")
    .highlight()
    .order_by("-score")
)

results = list(sqs)
facets = sqs.facet_counts()

# Process results
for result in results:
    print(f"{result.object.title}: {result.score}")
    if hasattr(result, "highlighted"):
        print(f"  {result.highlighted}")

Using Multiple Connections

If you’ve configured multiple connections, you can use them:

from haystack import connections
from haystack.query import SearchQuerySet

# Use default connection
sqs = SearchQuerySet()

# Use specific connection
sqs = SearchQuerySet(using="readonly")

Indexing

Indexing Your Models

After defining your search indexes (see Quickstart Guide), you need to index your data:

# Rebuild the entire index
python manage.py rebuild_index

# Update the index (incremental)
python manage.py update_index

# Clear the index
python manage.py clear_index

Programmatic Indexing

You can also index programmatically:

from haystack import connections
from myapp.models import Article
from myapp.search_indexes import ArticleIndex

backend = connections["default"].get_backend()
index = ArticleIndex()

# Index a single object
backend.update(index, [article])

# Index multiple objects
articles = Article.objects.all()
backend.update(index, articles)

# Remove from index
backend.remove(article)

Using django_haystack_opensearch

Basic Search

Filtering

Content Filtering

Facet Field Filtering

Other Filter Types

Faceting

Field Facets

Date Facets

Query Facets

Filtering by Facets

Highlighting

Spelling Suggestions

How it Works

More Like This

File Content Extraction

Spatial Search

Within a Bounding Box

Within a Distance

Sorting

Model Filtering

Stored Fields

Boost

Advanced Usage

Combining Features

Using Multiple Connections

Indexing

Indexing Your Models

Programmatic Indexing

Understanding Facet Field Filtering

Why `__exact` is Required for Facet Fields

Using django_haystack_opensearch

Basic Search

Filtering

Content Filtering

Facet Field Filtering

Other Filter Types

Faceting

Field Facets

Date Facets

Query Facets

Filtering by Facets

Highlighting

Spelling Suggestions

How it Works

More Like This

File Content Extraction

Spatial Search

Within a Bounding Box

Within a Distance

Sorting

Pagination

Model Filtering

Stored Fields

Boost

Advanced Usage

Combining Features

Using Multiple Connections

Indexing

Indexing Your Models

Programmatic Indexing

Understanding Facet Field Filtering

Why __exact is Required for Facet Fields

Why `__exact` is Required for Facet Fields