Product DocsMenu

Coveo Platform 7.0 >
Administrator Help > Coveo Enterprise Search 7.0 > Administration Tool > Configuration Tab > Converters Menu > Modifying How Documents Written in Unrecognized Languages Are Indexed

Modifying How Documents Written in Unrecognized Languages Are Indexed

When a document is written in a language other than one of the supported languages (see Supported Languages), it is attributed the Unknown language status in the index. By default, CES indexes documents written in unknown languages only if their size is inferior to 4,096 bytes. However, it is possible to modify this behavior.

To modify the indexing of documents written in unrecognized languages

  1. On the Coveo server, access the Administration Tool (see Opening the Administration Tool).

  2. Access the Converter Managers page (Configuration > Converters).

  3. In the navigation panel on the left, click Languages.

  4. In the Languages page:

    1. In the Language Detection section, select one of the following action to apply when the language of the document is registered as Unknown:

      Use indexing failure action set for the document type

      CES handles documents in unknown languages as corrupted documents whose content cannot be indexed—they are either rejected or indexed by reference depending on the Indexing Failure Action selected for this document type (see Modifying How CES Handles a Document Type).

      Reject the document

      CES does not index documents in unknown languages.

      Index

      CES indexes all documents in unknown languages.

      Index if document is smaller than X bytes

      CES indexes documents in unknown languages only if their size is inferior to the specified value in bytes.

    2. Click Apply Changes.

People who viewed this topic also viewed