The Tika Server Converter extension is used by search index managers to call a Apache Tika Server to extract content from documents and other content.
You need a Apache Tika Server server to use the Tika Server Converter extension.
Nucleus has a built-in simple content converter which can extract text from HTML, Markdown and PDF content. If you don't need to extract content from other formats, you can use the built-in content converter and you don't need the Tika Server converter. You don't need to configure anything if you are using the simple built-in content converter.
Depending on the built-in functionality of each search service or application, a Search Index Manager may need to extract content from documents or other content. The most common requirement is to extract content in plain text format. For example, if you host PDF documents on your web site, your search index manager may need to extract plain text content from them in order to create index entries, including vectors (embeddings). Tika can extract content from a large number of common formats.
To configure the Tika Server Converter, log on as a system administrator or site administator, click "Manage" and then click "Tika Server Converter".
Tika Server Endpoint | Enter the endpoint (URL) for your Tika server, including the port, which is normally 9998. |