Function reference

Vectorisers

Utilities to project text into numerical representation in a semantic vector space

vectorisers This module provides classes for creating and utilizing embedding models.
vectorisers.base This module provides classes for creating and utilizing embedding models from different services.
vectorisers.base.VectoriserBase Abstract base class for all vectorisers.
vectorisers.base.VectoriserBase.transform Transforms input text(s) into embeddings.
vectorisers.huggingface.HuggingFaceVectoriser A general wrapper class for Huggingface Transformers models to generate text embeddings.
vectorisers.ollama.OllamaVectoriser A wrapper class allowing a locally-running ollama server to generate text embeddings.
vectorisers.gcp.GcpVectoriser A class for embedding text using Google Cloud Platform’s GenAI API.

Indexers

Creation of Vector Stores for efficient similarity search and retrieval

indexers This module provides functionality for creating a VectorStore from a CSV (text) file.
indexers.main This module provides functionality for creating a VectorStore from a CSV (text) file.
indexers.VectorStore A class to model and create VectorStore objects for building and searching vector databases from CSV text files.
indexers.dataclasses This module defines dataclasses for structuring and validating input and output data for
indexers.dataclasses.VectorStoreSearchInput DataFrame-like object for forming and validating search query input data.
indexers.dataclasses.VectorStoreSearchOutput DataFrame-like object for storing and validating search results with rankings
indexers.dataclasses.VectorStoreReverseSearchInput DataFrame-like object for forming and validating reverse search query input data.
indexers.dataclasses.VectorStoreReverseSearchOutput DataFrame-like object for storing reverse search results.
indexers.dataclasses.VectorStoreEmbedInput DataFrame-like object for forming and validating text data to be embedded.
indexers.dataclasses.VectorStoreEmbedOutput DataFrame-like object for storing and validating embedded vectors and associated metadata.
indexers.VectorStore.embed Converts text (provided via a VectorStoreEmbedInput object) into vector embeddings using the Vectoriser.
indexers.VectorStore.search Searches the VectorStore using queries from a VectorStoreSearchInput object.
indexers.VectorStore.reverse_search Reverse searches the VectorStore using a VectorStoreReverseSearchInput object.
indexers.VectorStore.from_filespace Creates a VectorStore instance from stored metadata and Parquet files.

Hooks

Creation of Pre and post Processing hooks for improved search and RAG

indexers.hooks This module contains a factory for, and example pre-built hooks. Allowing users to manipulate the content of ClassifAI dataclass objects as they enter or leave the VectorStore.
indexers.hooks.hook_factory
indexers.hooks.default_hooks Submodule containing the prebuilt hooks for the service.
indexers.hooks.default_hooks.preprocessing
indexers.hooks.default_hooks.postprocessing
indexers.hooks.hook_factory.HookBase Abstract base class for all post-processing hooks requiring customisation.
indexers.hooks.default_hooks.preprocessing.CapitalisationStandardisingHook A pre-processing hook to handle upper-/lower-/sentence-/title-casing.
indexers.hooks.default_hooks.postprocessing.DeduplicationHook A post-processing hook to remove duplicate knowledgebase entries, i.e. entries with the same label.
indexers.hooks.default_hooks.postprocessing.RagHook A post-processing hook to perform Retrieval Augmented Generation.

Servers

Expose ClassifAI functionality via Fast-API endpoints

servers This module provides functionality for creating or extending a REST-API service.
servers.main This module provides functionality for creating a start a restAPI service.
servers.get_router Create and return a FastAPI.APIRouter with search endpoints.
servers.get_server Create and return a FastAPI server with search endpoints.
servers.run_server Create and run a FastAPI server with search endpoints.
servers.make_endpoints Create and register the different endpoints to your app.