{"id":663,"date":"2017-10-18T16:17:42","date_gmt":"2017-10-18T16:17:42","guid":{"rendered":"https:\/\/www.searchenginewatch.com\/2017\/10\/18\/duplicate-content-faq-what-is-it-and-how-should-you-deal-with-it\/"},"modified":"2020-02-12T07:16:08","modified_gmt":"2020-02-12T07:16:08","slug":"duplicate-content-faq-what-is-it-and-how-should-you-deal-with-it","status":"publish","type":"post","link":"https:\/\/searchenginewatch.com\/2017\/10\/18\/duplicate-content-faq-what-is-it-and-how-should-you-deal-with-it\/","title":{"rendered":"Duplicate content FAQ: What is it, and how should you deal with it?"},"content":{"rendered":"<p><strong>There are a few questions that have been confusing the SEO industry for many years. No matter how many times Google representatives try to clear the confusion, some myths persist. <\/strong><\/p>\n<p>One such question is the widely discussed\u00a0issue of <a href=\"https:\/\/searchenginewatch.com\/2016\/10\/03\/guide-to-google-ranking-signals-part-5-duplicate-content-and-syndication\/\">duplicate content<\/a>. What is it, are you being penalized for it, and how can you avoid it?<\/p>\n<p>Let&#8217;s\u00a0try to clear up some of the confusion\u00a0by answering some frequently-asked (or frequently-wondered) questions about duplicate <a href=\"https:\/\/searchenginewatch.com\/2019\/04\/16\/seo-writing-guide-from-keyword-to-content-brief\/\">content<\/a>.<\/p>\n<h2>How\u00a0can you diagnose a duplicate content penalty?<\/h2>\n<p>It&#8217;s funny how some of the readers of this article are rolling their eyes right now reading the first subheading. But let&#8217;s deal with this myth first thing.<\/p>\n<p><a href=\"http:\/\/www.thesempost.com\/duplicate-content-penalty\/\">There\u00a0is no duplicate content penalty<\/a>. None of Google&#8217;s representatives has ever confirmed the existence of such a penalty; there were no algorithmic updates called &#8220;duplicate <a href=\"https:\/\/searchenginewatch.com\/2019\/03\/12\/social-media-how-does-it-affect-seo\/\">content<\/a>&#8220;; and there can never be such a penalty because in the overwhelming number of cases, duplicate <a href=\"https:\/\/searchenginewatch.com\/2019\/05\/22\/nine-types-of-meta-descriptions-that-win-more-clicks\/\">content<\/a> is a natural thing with no evil intent behind that. We know that, and Google knows that.<\/p>\n<p>Still, lots of SEO experts keep &#8220;diagnosing&#8221; a duplicate <a href=\"https:\/\/searchenginewatch.com\/2019\/06\/03\/podcast-seo-tips-101\/\">content<\/a> &#8220;penalty&#8221; when they analyze every other website.<\/p>\n<p><a href=\"https:\/\/searchenginewatch.com\/2019\/07\/16\/delete-your-pages-and-rank-higher-in-search-index-bloat-and-technical-optimization-2019\/\">Duplicate content<\/a> is often mentioned in conjunction with updates like Panda and <a href=\"https:\/\/searchenginewatch.com\/2017\/09\/27\/the-last-word-on-fred-from-googles-gary-illyes\/\">Fred<\/a>, but it is used to identify bigger issues, i.e. thin or\u00a0spammy\u00a0(&#8220;spun&#8221;, auto-generated, etc.) and stolen (scraped) <a href=\"https:\/\/searchenginewatch.com\/2019\/06\/25\/google-ads-hacks\/\">content<\/a>.<\/p>\n<p>Unless you have\u00a0the latter issue, a few instances of duplicate <a href=\"https:\/\/searchenginewatch.com\/2019\/07\/12\/how-to-get-featured-snippets-no-link-building\/\">content<\/a> throughout your site cannot cause an isolated penalty.<\/p>\n<p>Google keeps\u00a0urging website owners to focus on <a href=\"https:\/\/searchenginewatch.com\/2016\/09\/19\/guide-to-google-ranking-signals-part-3-quality-content\/\">high-quality expert content<\/a>,\u00a0which is your safest bet when it comes to avoiding having your pages flagged as a result of thin <a href=\"https:\/\/searchenginewatch.com\/2019\/07\/12\/how-to-get-featured-snippets-no-link-building\/\">content<\/a>.<\/p>\n<p>You do want to\u00a0handle your article republishing strategy carefully, because you don&#8217;t want to confuse Google when it comes to finding the actual source of the <a href=\"https:\/\/searchenginewatch.com\/2019\/08\/20\/content-marketing-tools-recommended-by-the-best\/\">content<\/a>. You don&#8217;t want to have your site pages filtered when you <a href=\"https:\/\/searchenginewatch.com\/2017\/10\/04\/still-doing-guest-blogging-keep-these-4-tips-in-mind\/\">republish your article on an authoritative blog<\/a>. But if it does happen, chances are, it will not reflect on how Google treats your overall site.<\/p>\n<p>In short, duplicate <a href=\"https:\/\/searchenginewatch.com\/2019\/09\/04\/improve-seo-using-data-science\/\">content<\/a> is a filter, not a penalty, meaning that Google has to choose one of the URLs with non-original <a href=\"https:\/\/searchenginewatch.com\/2020\/01\/07\/five-simple-content-marketing-trends-for-2020\/\">content<\/a> and filter out the rest.<\/p>\n<h2>So should I just stop worrying about internal duplicate content then?<\/h2>\n<p>In short, no. It&#8217;s like you don&#8217;t want to ignore a recurring headache:\u00a0it&#8217;s not that a headache is a disease on its own, but it may\u00a0be a symptom\u00a0of\u00a0a more serious condition, so you want to clear those out or treat them if there are any.<\/p>\n<p>Duplicate content may signal some structural issues within your site, preventing Google from understanding what they should rank and what matters most on your site. And generally, while Google is getting much better at understanding how to handle different instances of the same content within your site,\u00a0<strong>you still don&#8217;t want to ever confuse Google.<\/strong><\/p>\n<p>Internal duplicate content may signal\u00a0a lack of original content on your site too, which is another problem you&#8217;ll need to deal with.<\/p>\n<p>Google wants original content in their SERPs for obvious reasons: They don&#8217;t want their <a href=\"https:\/\/searchenginewatch.com\/2018\/12\/21\/guide-google-analytics-confusing-terms\/\">users<\/a> to land on the same content over and over again. That&#8217;s a bad <a href=\"https:\/\/searchenginewatch.com\/2016\/07\/20\/five-simple-user-experience-tweaks-for-better-conversion\/\">user experience<\/a>. So Google will have to figure out which non-unique pages they want to show to their users and which ones to hide.<\/p>\n<p>That&#8217;s where a problem can occur: The more pages on your site have original content, the more Google positions they may be able to appear at throughout different search queries.<\/p>\n<p>If you want to know whether your site has any internal duplicate content issues, try using tools like\u00a0<a href=\"https:\/\/seranking.com\/\" target=\"_blank\" rel=\"noopener noreferrer\">SE Ranking<\/a>,\u00a0which\u00a0crawls your website and analyzes\u00a0whether there are any URLs with duplicate content Google may be confused about:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter\" src=\"http:\/\/annsmarty.com\/wp-content\/uploads\/2017\/10\/seranking.png\" alt=\"SE Ranking\" width=\"600\" height=\"113\" \/><\/p>\n<h2>How does Google choose which non-original URLs to rank and which to filter out?<\/h2>\n<p>You&#8217;d think Google would want to choose\u00a0the more authoritative post (based on various signals including <a href=\"https:\/\/searchenginewatch.com\/2016\/11\/07\/guide-to-google-ranking-factors-part-10-backlinks\/\">backlinks<\/a>), and they probably do.<\/p>\n<p>But what they also do is choose\u00a0the shorter URL when they find two more pages with identical URLs:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter\" src=\"http:\/\/annsmarty.com\/wp-content\/uploads\/2017\/10\/duplicate-content.png\" alt=\"Duplicate content\" width=\"600\" height=\"555\" \/><\/p>\n<h2>How about <a href=\"https:\/\/searchenginewatch.com\/2017\/05\/17\/international-seo-5-ways-to-scale-performance\/\">international<\/a> websites? Can translated content pose a duplicate content issue?<\/h2>\n<p>This question was addressed by\u00a0<a href=\"https:\/\/www.youtube.com\/watch?v=UDg2AGRGjLQ\">Matt Cutts back in 2011<\/a>. In short, translated content doesn&#8217;t pose any duplicate content issues even if it&#8217;s translated very closely to the original.<\/p>\n<p>There&#8217;s one word of warning though: Don&#8217;t publish automated translation using tools like Google Translate because Google is very good at identifying those. If you do so, you run into risk of having your content labeled as spammy.<\/p>\n<p>Use real translators whom you can find using platforms like\u00a0Fiverr,\u00a0Upwork\u00a0and\u00a0Preply. You can find high-quality translators and native speakers there on a low budget.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter\" src=\"http:\/\/annsmarty.com\/wp-content\/uploads\/2017\/10\/translation.png\" alt=\"Translation\" width=\"600\" height=\"322\" \/><\/p>\n<p style=\"text-align: center;\"><em>Look for native speakers in your target language who can also understand your base language<\/em><\/p>\n<p>You are\u00a0<a href=\"http:\/\/support.google.com\/webmasters\/bin\/answer.py?hl=en&amp;answer=189077\">also advised<\/a>\u00a0to use the hreflang attribute to point Google to the actual language you are using on a regional version of your website.<\/p>\n<h2>How about different versions of the website across different localized domains?<\/h2>\n<p>This\u00a0can be tricky, because it&#8217;s not easy to come up with completely different content when putting up two different websites with the same products for the US and the UK, for example. But you still don&#8217;t want Google to choose.<\/p>\n<p>Two workarounds:<\/p>\n<ul>\n<li>Focus on local traditions, jargon, history, etc. whenever possible<\/li>\n<li>Choose the country you want to focus on\u00a0<a href=\"https:\/\/support.google.com\/webmasters\/answer\/182192?hl=en#2\">from within Search Console<\/a>\u00a0for all localized domains except .com.<\/li>\n<\/ul>\n<p>There&#8217;s\u00a0another old video from Matt Cutts\u00a0which explains this issue and the solution:<\/p>\n<p><iframe loading=\"lazy\" title=\"How can I tell Google that multiple domains are related?\" width=\"640\" height=\"360\" src=\"https:\/\/www.youtube.com\/embed\/EcSsMbFSGyc?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><\/p>\n<p>Are there any other duplicate-content-related questions you&#8217;d like to be covered? Please comment below!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>There are a few questions that have been confusing the SEO industry for many years. No matter how many times Google representatives try to clear the confusion, some myths persist. One such question is the widely discussed issue of duplicate content. What is it, are you being penalized for it, and how can you avoid it?<\/p>\n","protected":false},"author":1092,"featured_media":664,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[8,5],"tags":[667,668,422],"content_type":[27095],"class_list":["post-663","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-content","category-seo","tag-duplicate-content","tag-google-fred","tag-google-panda","content_type-news"],"acf":{"tad_independentcommercial":false,"tad_content_format":false},"post_info":{"name":"idris.nagri@blenheimchalcot.com idris.nagri@blenheimchalcot.com","title":"","thumbnail_url":"https:\/\/searchenginewatch.com\/wp-content\/uploads\/2018\/10\/photocopier-120x90.jpg","category":"Content","timeago":"8y"},"_links":{"self":[{"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/posts\/663","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/users\/1092"}],"replies":[{"embeddable":true,"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/comments?post=663"}],"version-history":[{"count":0,"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/posts\/663\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/media\/664"}],"wp:attachment":[{"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/media?parent=663"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/categories?post=663"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/tags?post=663"},{"taxonomy":"content_type","embeddable":true,"href":"https:\/\/searchenginewatch.com\/wp-json\/wp\/v2\/content_type?post=663"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}