This is basically an issue with Microsoft Search Server, but the C1 Package for Composite C1 tries to tackle this problem. Quoting from http://docs.composite.net/Composite.Search.MicrosoftSearchServer :
Excluding page content from indexing
Make sure that RemapperHttpModule is specified in the Web.config on your C1 website. If not, do the following:
- Open the Web.config on your C1 website.
- Add the following configuration under
<httpModules>
and <modules>
(for IIS 7.0 or later) elements: <add name="MssRemaper" type="Composite.Search.MicrosoftSearchServer.RemapperHttpModule" />
Now, to exclude a part of a page from being indexed by Microsoft Search Server:
- Open the page in Source Editor.
- Locate the element you want to exclude from indexing.
- Add the class attribute with the "noindex"
value to the element (class="noindex"): e.g.
<div class="noindex">
- Re-index your website.
Ensuring the crawler find your pages
The exclusion of navigation elements will happen on the Composite C1 side, so doing this will remove the Search Server crawlers ability to follow those links. You can solve this by making sure you have a sitemap on your website and make Search Server index the URL to the sitemap page.