Index Method and Retrieval Settings

Choose the indexing method and configure retrieval strategy to optimize search results.

Table of Contents

· [Select the Index Method](#select-the-index-method)

· [Configure the Retrieval Settings](#configure-the-retrieval-settings)

· [Q&A Mode](#qa-mode)

Select the Index Method

ClickAI provides 2 indexing methods:

High Quality

· Calls an Embedding Model to create vectors for each chunk

· Enables semantic search (meaning-based retrieval)

· Pros: Higher accuracy, better contextual understanding

· Cons: Consumes embedding tokens, longer processing time

Economical

· Uses keyword indexing (inverted index) — a standard search engine technique

· No embedding model needed, no token cost

· Pros: Fast, no embedding cost

· Cons: Keyword matching only, no contextual understanding

Criteria

High Quality

Economical

Technology

Embedding vectors + keyword

Keyword indexing (inverted index)

Accuracy

High

Medium

Token cost

Yes (embedding)

No

Processing time

Longer

Faster

Semantic search

Multilingual search

💡 TIP: If you need highly accurate chatbot responses or handle multilingual content, choose **High Quality**. If speed and cost savings are priorities, choose **Economical**.

Configure the Retrieval Settings

The Knowledge Base supports 2 core retrieval techniques:

1. Semantic Retrieval: Based on vector similarity — text chunks and queries are converted into vectors and matched via similarity scoring.

2. Keyword Matching: Using an inverted index (a standard search engine technique).

Weight Settings

This feature enables you to set custom weights for semantic priority and keyword priority:

Semantic Value = 1 (Semantic Search Only)

· Activates only semantic search mode

· Uses embedding models to search deeper — even if exact query terms don't appear in the knowledge base

· Calculates vector distances to return relevant content

· Captures meaning across different languages for accurate cross-language results

Keyword Value = 1 (Keyword Search Only)

· Activates only keyword search mode

· Performs a full-text match against the knowledge base

· Suitable when users know exact terminology

· Consumes fewer resources, ideal for quick searches within large document repositories

Custom Keyword and Semantic Weights

Beyond enabling only semantic or keyword search, ClickAI provides flexible custom weight settings:

· Continuously adjust weights of both methods

· Identify the optimal weight ratio for your business scenario

Rerank Model

· Disabled by default. When enabled, a third-party Rerank model sorts the text chunks returned to optimize results

· Helps the LLM access more precise information, improving output quality

· Before enabling, go to Settings > Model Providers to configure the Rerank model API key

⚠️ IMPORTANT: If the selected embedding model is multimodal, select a multimodal rerank model (marked with a Vision icon) as well. Otherwise, retrieved images will be excluded from reranking.

📝 NOTE: Enabling Rerank will consume tokens from the Rerank model. Refer to the associated model's pricing page for details.

TopK and Score Threshold

Setting

Description

Default

Recommended

Top K

Maximum number of most similar chunks to retrieve

3

3-5

Score Threshold

Minimum similarity score for a chunk to be retrieved

0.5

0.5-0.7

· Higher Top K → retrieves more chunks, but may include less relevant content

· Higher Score Threshold → requires higher similarity, fewer but more accurate chunks returned

Q&A Mode

Q&A Mode is a special option that lets ClickAI automatically analyze documents and generate question-answer pairs. Suitable for:

· FAQ documents

· Customer support documentation

· Documents with clear Q&A structure

📖 Previous: [Chunk Settings](./03-chunk-settings.md) · Next: [Knowledge Pipeline]

Last updated