What Is the Difference Between ANSI and UTF-8 URI Formats?
By default, the Web Pages connector expects that addresses are in the ANSI format, but you can select the Use UTF-8 addresses option for a given source (see Modifying Advanced Source Parameters).
Note: When the Web Pages or the SharePoint connectors download web content, they expect documents with addresses encoded in UTF-8.