Duplicate Content - SES San Jose 2007

By chuckaikens · Wednesday, August 22nd, 2007

Here are a few good notes from Shari Thurow from Omni Interactive that were given in the Duplicate Content session, :

Linkage properties of a particular site and page help the search engines detect unique content that is distributed via syndication.

Content Evolution looks at how often the pages change, a low average page mutation is expected on an article pages.

Host Name finds out how many host names point to a single IP.

Shingle Comparison looks at every document has a unique signature measured by shingles, or groups of words.

You can use robots exclusion to exclude pages that will create duplicate content in the search engine. This is important because you want to show the best content and have fewer, higher quality pages.

A couple notes about content: Use analytics to find what is working and feed this content to the search engines. Use Copyscape and Archive.org to watch for copyright infringment and report to search engines.

If you have multiple domains that are serving the same content. Choose one brand domain and 301 redirect to the front page or a relevant sub-page on your brand domain.

If you use a subdomain, make sure the pages can only be accessed on one of the subdomains. Be sure to redirect your non-www to www if you can.

Topics: Research Engine · Tags:
 

Leave a Comment