{"id":126063,"date":"2019-04-30T16:20:44","date_gmt":"2019-04-30T16:20:44","guid":{"rendered":"https:\/\/www.searchenginewatch.com\/?p=126063"},"modified":"2020-02-11T08:56:05","modified_gmt":"2020-02-11T08:56:05","slug":"improve-seo-check-duplicate-content","status":"publish","type":"post","link":"https:\/\/searchenginewatch.com\/2019\/04\/30\/improve-seo-check-duplicate-content\/","title":{"rendered":"How to check for duplicate content to improve your site&#8217;s SEO"},"content":{"rendered":"<p><strong>Publishing original content to your website is, of course, critical for building your audience and boosting your <a href=\"https:\/\/searchenginewatch.com\/2019\/06\/03\/podcast-seo-tips-101\/\">SEO<\/a>.<\/strong><\/p>\n<p>The benefits of unique and original <a href=\"https:\/\/searchenginewatch.com\/2019\/04\/16\/seo-writing-guide-from-keyword-to-content-brief\/\">content<\/a> are twofold:<\/p>\n<ol>\n<li>Original <a href=\"https:\/\/searchenginewatch.com\/2019\/05\/22\/nine-types-of-meta-descriptions-that-win-more-clicks\/\">content<\/a> delivers a superior user experience.<\/li>\n<li>Original <a href=\"https:\/\/searchenginewatch.com\/2019\/06\/03\/podcast-seo-tips-101\/\">content<\/a> helps ensure that search engines aren&#8217;t forced to choose between multiple pages of yours that have the same <a href=\"https:\/\/searchenginewatch.com\/2019\/07\/17\/the-most-common-seo-errors-research-infographics\/\">content<\/a>.<\/li>\n<\/ol>\n<p>However, when <a href=\"https:\/\/searchenginewatch.com\/2019\/07\/16\/delete-your-pages-and-rank-higher-in-search-index-bloat-and-technical-optimization-2019\/\">content<\/a> is duplicated either accidentally or on purpose, search engines will not be duped and may penalize a site with lower search rankings accordingly. Unfortunately, many businesses often publish repeated <a href=\"https:\/\/searchenginewatch.com\/2019\/07\/12\/how-to-get-featured-snippets-no-link-building\/\">content<\/a> without being aware that they\u2019re doing so. This is why <a href=\"https:\/\/searchenginewatch.com\/2019\/04\/15\/how-to-branded-search-audit\/\" target=\"_blank\" rel=\"noopener noreferrer\">auditing your site<\/a> with a <a href=\"https:\/\/searchenginewatch.com\/2019\/07\/16\/delete-your-pages-and-rank-higher-in-search-index-bloat-and-technical-optimization-2019\/\">duplicate content<\/a> checker is so valuable in helping sites to recognize and replace such <a href=\"https:\/\/searchenginewatch.com\/2019\/07\/12\/how-to-get-featured-snippets-no-link-building\/\">content<\/a> as necessary.<\/p>\n<p>This article will help you better understand what is considered duplicate <a href=\"https:\/\/searchenginewatch.com\/2019\/06\/25\/google-ads-hacks\/\">content<\/a>, and steps you can take to make sure it doesn&#8217;t hamper your <a href=\"https:\/\/searchenginewatch.com\/2019\/04\/16\/seo-writing-guide-from-keyword-to-content-brief\/\">SEO<\/a> efforts.<\/p>\n<h2>How does Google define &#8220;duplicate content&#8221;?<\/h2>\n<p>Duplicate <a href=\"https:\/\/searchenginewatch.com\/2019\/08\/28\/guide-to-keyword-research-content-strategy\/\">content<\/a> is\u00a0<a href=\"https:\/\/support.google.com\/webmasters\/answer\/66359?hl=en\" target=\"_blank\" rel=\"noopener noreferrer\">described by Google<\/a>\u00a0as <a href=\"https:\/\/searchenginewatch.com\/2019\/08\/30\/marketing-feedback-loop-for-seo\/\">content<\/a> &#8220;within or across domains that either completely matches other <a href=\"https:\/\/searchenginewatch.com\/2019\/08\/26\/pitch-to-top-online-publishers-survey\/\">content<\/a> or are appreciably similar&#8221;. Content fitting this description can be repeated either on more than one page within your site, or across different websites. Common places where this duplicate <a href=\"https:\/\/searchenginewatch.com\/2019\/08\/20\/content-marketing-tools-recommended-by-the-best\/\">content<\/a> might be hiding include duplicated copy across <a href=\"https:\/\/searchenginewatch.com\/2019\/09\/11\/landing-page-copy-tips\/\">landing pages<\/a> or blog posts, or harder-to-detect areas such as meta descriptions that are repeated in a webpage&#8217;s code. <a href=\"https:\/\/searchenginewatch.com\/2017\/10\/18\/duplicate-content-faq-what-is-it-and-how-should-you-deal-with-it\/\" target=\"_blank\" rel=\"noopener noreferrer\">Duplicate content<\/a> can be produced erroneously in a number of ways, from simply reposting existing <a href=\"https:\/\/searchenginewatch.com\/2019\/06\/24\/create-seo-friendly-faq-pages\/\">content<\/a> by mistake to allowing the same page <a href=\"https:\/\/searchenginewatch.com\/2020\/01\/07\/five-simple-content-marketing-trends-for-2020\/\">content<\/a> to be accessible via multiple URLs.<\/p>\n<p>When visitors come to your page and begin reading what seems to be newly posted content only to realize they\u2019ve read it before, that experience can reduce their trust in your site and likeliness that they\u2019ll seek out your content in the future. Search engines have an equally confusing experience when faced with multiple pages with similar or identical content and often respond to the challenge by assigning lower search rankings across the board.<\/p>\n<p>At the same time, there are sites that intentionally duplicate content for malicious purposes, scraping content from other sites that don\u2019t belong to them or duplicating content known to deliver successful <a href=\"https:\/\/searchenginewatch.com\/2019\/05\/22\/nine-types-of-meta-descriptions-that-win-more-clicks\/\">SEO<\/a> in an attempt to game search engine algorithms. However, most commonly, duplicated content is simply published by mistake. There are also scenarios where republishing existing content is acceptable, such as guest blogs, syndicated content, intentional variations on the copy, and more. These techniques should only be used in tandem with best practices that help search engines understand that this content is being republished on purpose (described below).<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"fr-fic fr-dib fr-draggable aligncenter\" src=\"https:\/\/s3.amazonaws.com\/clearvoice-media\/asg_o9JWmFLRfU0s8A8U%2Fart_kdPOLhnoHQbh6ekG%2F1555023606015-1.png\" alt=\"SEO audit report that helps spot and rectify duplicate content\" width=\"974\" height=\"754\" \/><\/p>\n<p><em>Source: Alexa.com <a href=\"https:\/\/searchenginewatch.com\/2019\/05\/20\/seven-reasons-why-your-rankings-dropped-and-how-to-fix-them\/\">SEO<\/a> Audit<\/em><\/p>\n<p>An automated duplicate content checker tool can quickly and easily help you determine where such content exists on your site, even if hidden in the\u00a0site code. Such <a href=\"https:\/\/sewprod.wpenginepowered.com\/2019\/07\/15\/keyword-research-tools-free\/\">tools<\/a> should display each URL and meta description containing duplicate content so that you can methodically perform the work of addressing these issues. While the most obvious practice is to either remove repeated content or add original copy as a replacement, there are several other approaches you might find valuable.<\/p>\n<h2>How to check for duplicate content<\/h2>\n<h3>1. Using\u00a0the\u00a0<a href=\"https:\/\/support.google.com\/webmasters\/answer\/139066?hl=en#rel-canonical-link-method\" target=\"_blank\" rel=\"noopener noreferrer\">rel=canonical &lt;link&gt; tag<\/a><\/h3>\n<p>These tags can tell search engines which specific URL should be viewed as the master copy of a page, thus solving any duplicate content confusion from the search engines\u2019 standpoint.<\/p>\n<h3>2. Using 301 redirects<\/h3>\n<p>These offer a simple and search engine-friendly method of sending visitors to the correct URL when a duplicate page needs to be removed.<\/p>\n<h3>3. Using the &#8220;<a href=\"https:\/\/searchenginewatch.com\/2019\/07\/17\/the-most-common-seo-errors-research-infographics\/\">noindex<\/a>&#8221; meta tags<\/h3>\n<p>These will simply tell search engines not to index pages, which can be advantageous in certain circumstances.<\/p>\n<h3>4. Using\u00a0<a href=\"https:\/\/support.google.com\/webmasters\/answer\/6080550?hl=en\" target=\"_blank\" rel=\"noopener noreferrer\">Google\u2019s URL Parameters tool<\/a><\/h3>\n<p>This tool helps you tell Google not to crawl pages with specific parameters. This might be a good solution if your site uses parameters as a way to deliver content to the visitor that is mostly the same content with minor changes (i.e. headline changes, color changes, etc). This tool makes it simple to let Google know that your <a href=\"https:\/\/searchenginewatch.com\/sew\/news\/2319706\/googles-matt-cutts-a-little-duplicate-content-wont-hurt-your-rankings\" target=\"_blank\" rel=\"noopener noreferrer\">duplicated content is intentional and should not be considered for SEO purposes.<\/a><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"fr-fic fr-dib fr-draggable aligncenter\" src=\"https:\/\/s3.amazonaws.com\/clearvoice-media\/asg_o9JWmFLRfU0s8A8U%2Fart_kdPOLhnoHQbh6ekG%2F1555023352481-2.png\" alt=\"Example of resolving duplication of meta tag descriptions\" width=\"995\" height=\"714\" \/><\/p>\n<p><em>Source:\u00a0Alexa.com <a href=\"https:\/\/searchenginewatch.com\/2019\/06\/14\/five-backlink-analysis-tools\/\">SEO<\/a> Audit<\/em><\/p>\n<p>By actively checking your site for duplicated content and addressing any issues satisfactorily, you can improve not only the search rankings of your site\u2019s pages but also make sure that your site visitors are directed to fresh content that keeps them coming back for more.<\/p>\n<p>Got any effective tips of how you deal with on-site content duplication?\u00a0Share them in the comments.<\/p>\n<p><em>Kim Kosaka is Director of Marketing at\u00a0Alexa.com.<\/em><\/p>\n<h3><strong>Further reading:<\/strong><\/h3>\n<ul>\n<li><a href=\"https:\/\/searchenginewatch.com\/2018\/11\/20\/optimize-local-business-voice-search\/\" target=\"_blank\" rel=\"noopener noreferrer\">How to optimize your local business for voice search<\/a><\/li>\n<li><a href=\"https:\/\/searchenginewatch.com\/2019\/04\/29\/how-to-take-advantage-of-the-latest-updates-to-google-search-console\/\" target=\"_blank\" rel=\"noopener noreferrer\">How to take advantage of the latest updates to Google Search Console<\/a><\/li>\n<li><a href=\"https:\/\/searchenginewatch.com\/white-papers\/buyers-guide-enterprise-seo-tools\/\" target=\"_blank\" rel=\"noopener noreferrer\">Buyers Guide: Enterprise SEO Tools<\/a><\/li>\n<li><a href=\"https:\/\/searchenginewatch.com\/2019\/04\/26\/venngage-turns-seo-into-sales\/\" target=\"_blank\" rel=\"noopener noreferrer\">SEO case study: How Venngage turned search into their primary lead source<\/a><\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>What is considered duplicate content? What steps can you take to make sure it doesn&#8217;t hamper your SEO efforts? Pressing SEO questions answered.<\/p>\n","protected":false},"author":1092,"featured_media":126102,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[8,14,5],"tags":[1104,152,27441,1149,667,37,27442,784,22,239,27443,27344],"content_type":[],"class_list":["post-126063","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-content","category-development","category-seo","tag-301-redirect","tag-alexa","tag-canonical-tags","tag-content-audit","tag-duplicate-content","tag-google","tag-meta-content","tag-noindex","tag-seo","tag-seo-audit","tag-url-parameters-tool","tag-website-audit"],"acf":{"tad_independentcommercial":false,"tad_content_format":false},"post_info":{"name":"idris.nagri@blenheimchalcot.com idris.nagri@blenheimchalcot.com","title":"","thumbnail_url":"https:\/\/searchenginewatch.com\/wp-content\/uploads\/2019\/04\/Improving-your-sites-SEO-by-checking-duplicate-content-120x90.jpg","category":"Content","timeago":"7y"},"_links":{"self":[{"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/posts\/126063","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/users\/1092"}],"replies":[{"embeddable":true,"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/comments?post=126063"}],"version-history":[{"count":0,"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/posts\/126063\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/media\/126102"}],"wp:attachment":[{"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/media?parent=126063"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/categories?post=126063"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/tags?post=126063"},{"taxonomy":"content_type","embeddable":true,"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/content_type?post=126063"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}