Sitemap Connector
![]() |
Sitemap Connector Guide |
Connector Features Summary
Features | Supported | Additional information | |
---|---|---|---|
Sitemap version | XML, Text, RSS 2.0, and Atom 1.0 | Sitemap files and Sitemap index file must respect the Sitemap protocol (validations can however be turned off by a parameter) | |
Searchable content type |
![]() |
Web pages (URL) |
|
Content update | Incremental refresh |
![]() |
|
Full refresh |
![]() |
||
Rebuild |
![]() |
||
Document-level security |
![]() |
Permissions must be manually defined on the source [more] |
Features
The features of the Sitemap connector are:
-
Content indexing
-
The connector can retrieve and index exclusively web pages from Sitemaps:
-
Supported Sitemap file formats
-
The connector can retrieve web pages from the following Sitemap file formats (see Sitemap protocol):
-
XML (Sitemap and index)
Note: CES 7.0.7814+ (August 2015) Support sitemap files containing custom metadata (see Adding and Indexing Custom Metadata in an XML Sitemap).
-
Text
-
Syndication Feeds (Atom 1.0 and RSS 2.0)
-
-
Supported authentication schemes
-
The connector can authenticate with the following authentication schemes:
-
Basic
-
Digest
-
NTLM
-
Negotiate/Kerberos
-
Form-based CES 7.0.7914+ (October 2015)
-
-
Incremental refresh
-
Periodically queries your Sitemap for the latest items modifications (addition, edition), keeping the index content up-to-date.
Notes:
-
The Sitemap must define the optional Last Modification Date attribute (e.g., <lastmod> for XML Sitemaps, <updated> for Atom Sitemaps, <pubDate> for RSS Sitemaps) for each URL. If not, you need to perform a source full refresh to catch changes. Text Sitemaps do not contain this attribute.
-
Deleted web pages require a full refresh to be taken in account.
-
-
Pause/Resume
-
When indexing Sitemap content, the connector can be paused and resumed.