What Are the Conversion Phases?
-
Crawling: In this phase, the document will be fetched using a connector (file, Exchange, SharePoint, etc.). Some metadata will already be made available by the connector, including the URI, modification date and much more.
-
Global Preconversion script (optional phase): If a global preconversion script has been set, this script will be executed in this phase. Common script tasks: Read document content, read and modify metadata, update document permissions, etc.
-
Source Preconversion script (optional phase): If a preconversion script has been set on the document source, this script will be executed in this phase. Common script tasks: Read document content, read and modify metadata, update document permission, etc.
-
Open Converter (optional phase): If the Document Type has been set to convert the document using an open converter, this script will be executed in this phase. Common script tasks: Convert the document from an unsupported format to a CES native format, read and modify metadata, update document permission, etc.
-
Native Converter: In this phase, the document content and properties will be extracted using one of the native CES converters.
-
Document Text Analysis: Many proprietary analysis technologies will be used to generate a document summary and key concepts, perform a language and title detection, and much more.
-
Source Postconversion script (optional phase): If a postconversion script has been set on the document source, this script will be executed in this phase. Common script tasks: Update the document extracted text and HTML, read and modify metadata, reject documents, etc.
-
Global Postconversion script (optional phase): If a global postconversion script has been set, this script will be executed in this phase. Common script tasks: Update the document extracted text and HTML, read and modify metadata, update document permission, etc.
-
Index the document. This phase will put the document extracted content and properties into the Coveo unified index to make it available for user queries.