Product DocsMenu

What Are the Conversion Phases?

The following lists the different phases of the conversion process:

  1. Crawling: In this phase, the document will be fetched using a connector (file, Exchange, SharePoint, etc.). Some metadata will already be made available by the connector, including the URI, modification date and much more.

  2. Global Preconversion script (optional phase): If a global preconversion script has been set, this script will be executed in this phase. Common script tasks: Read document content, read and modify metadata, update document permissions, etc.

  3. Source Preconversion script (optional phase): If a preconversion script has been set on the document source, this script will be executed in this phase. Common script tasks: Read document content, read and modify metadata, update document permission, etc.

  4. Open Converter (optional phase): If the Document Type has been set to convert the document using an open converter, this script will be executed in this phase. Common script tasks: Convert the document from an unsupported format to a CES native format, read and modify metadata, update document permission, etc.

  5. Native Converter: In this phase, the document content and properties will be extracted using one of the native CES converters.

  6. Document Text Analysis: Many proprietary analysis technologies will be used to generate a document summary and key concepts, perform a language and title detection, and much more.

  7. Source Postconversion script (optional phase): If a postconversion script has been set on the document source, this script will be executed in this phase. Common script tasks: Update the document extracted text and HTML, read and modify metadata, reject documents, etc.

  8. Global Postconversion script (optional phase): If a global postconversion script has been set, this script will be executed in this phase. Common script tasks: Update the document extracted text and HTML, read and modify metadata, update document permission, etc.

  9. Index the document. This phase will put the document extracted content and properties into the Coveo unified index to make it available for user queries.

 

People who viewed this topic also viewed