Autospeak-Straight Talk contains articles covering digital and social media marketing social communities and events marketing

Autospeak-Straight Talk

View All Blog Posts

Widgets Home

Content Marketing

3 Myths About Duplicate Content
Tags:

duplicate content SEO content marketing non-original content

(Posted on Jul 12, 2014 at 11:50AM )

3 Myths About Duplicate Content By Andy Crestodina

The words â€œduplicate content penaltyâ€ strike fear in the hearts of marketers. People with no SEO experience use this phrase all the time. Most have never read Googleâ€™sÂ guidelines on duplicate content. They just somehow assume that if something appears twice online, asteroids and locusts must be close behind.

This article is long overdue. Letâ€™s bust some duplicate content myths.

Note: This article is about content and publishing, not technical SEO issues such as URL structure.

Myth #1: Non-Original Content on Your Site Will Hurt Your Rankings across Your Domain

I have never seen any evidence that non-original content hurts a siteâ€™s ranking, except for one truly extreme case. Hereâ€™s what happened:

The day a new website went live, a very lazy PR firm copied the home page text and pasted it into a press release. They put it out on the wire services, immediately creating hundreds of versions of the home page content all over the web. Alarms went off at Google and the domain was manually blacklisted by a cranky Googler.

It was ugly. Since we were the web development company, we got blamed. We filed a reconsideration request and eventually the domain was re-indexed.

So what was the problem?

Volume: There were hundreds of instances of the same text
Timing: All the content appeared at the same time
Context: It was the homepage copy on a brand new domain

Itâ€™s easy to imagine how this got flagged as spam.

But this isnâ€™t what people are talking about when they invoke the phrase â€œduplicate content.â€ Theyâ€™re usually talking about 1,000 words on one page of a well-established site. It takes more than this to make red lights blink at Google.

Many sites, including some of the most popular blogs on the internet, frequently repost articles that first appeared somewhere else. They donâ€™t expect this content to rank, but they also know it wonâ€™t hurt the credibility of their domain.

Myth #2: Scrapers Will Hurt Your Site

I know a blogger who carefully watches Google Webmaster Tools. When a scraper site copies one of his posts, he quickly disavows any links to his site. Clearly, he hasnâ€™t read Googleâ€™sÂ Duplicate Content GuidelinesÂ or theÂ Guidelines for Disavows.

Ever seen the analytics for a big blog?Â Some sites get scraped ten times before breakfast. Iâ€™ve seen it in their trackback reports. Do you think they have a full-time team watching GWT and disavowing links all day? No. They donâ€™t pay any attention to scrapers. They donâ€™t fear duplicate content.

Scrapers donâ€™t help or hurt you. Do you think that a little blog in Asia with no original writing and no visitors confuses Google? No. It just isnâ€™t relevant.

Personally, I donâ€™t mind scrapers one bit. They usually take the article verbatim, links and all. The fact that they take the links is a good reason to pay attention to internal linking. The links on the scraped version pass little or no authority, but you may get the occasional referral visit.

Tip: Report Scrapers that Outrank Your Site

On the (very) rare occasion that Google does get confused and the copied version of your content is outranking your original, Google wants to know about it. Hereâ€™s the fix. Tell them using theÂ Scraper Report Tool.

Tip: Digitally Sign Your Content with Google Authorship

Getting your picture to appear in search results isnâ€™t the only reason to use Google Authorship. Itâ€™s a way of signing your name to a piece of content, forever associating you as the author with the content.

With Authorship, each piece of content is connected to one and only one author and their corresponding â€œcontributor toâ€ blogs, no matter how many times it gets scraped.

Tip: Take Harsh Action against Actual Plagiarists

There is a big difference between scraped content and copyright infringement. Sometimes, a company will copy your content (or even your entire site) and claim the credit of creation.

Plagiarism is the practice of someone else taking your work and passing it off as their own. Scrapers arenâ€™t doing this. But others will, signing their name to your work. Itâ€™s illegal, and itâ€™s why you have a copyright symbol in your footer.

If it happens to you, youâ€™ll be thinking about lawyers, not search engines.

There are several levels of appropriate response. Hereâ€™sÂ a true story of a complete website ripoffÂ and step-by-step instructions on what actions to take.

Myth #3: Republishing Your Guest Posts on Your Own Site Will Hurt Your Site

I do a lot of guest blogging. Itâ€™s unlikely that my usual audience sees all these guest posts, so itâ€™s tempting to republish these guest posts on my own blog.

As a general rule, I prefer that the content on my own site be strictly original. But this comes from a desire to add value, not from the fear of a penalty.

Ever written for a big blog?Â Iâ€™ve guest posted on some big sites. Some actually encourage you to republish the post on your own site after a few weeks go by. They know that Google isnâ€™t confused. In some cases, they may ask you to add a little HTML tag to the postâ€¦

Tip: Use rel=â€œcanonicalâ€ Tag

Canonical is really just a fancy (almost biblical) word that means â€œofficial version.â€ If you ever republish an article that first appeared elsewhere, you can use the canonical tag to tell search engines where the original version appeared. It looks like this:

canonical anchor link reference example

Thatâ€™s it! Just add the tag and republish fearlessly.

Tip: Write the â€œEvil Twinâ€

If the original was a â€œhow toâ€ post, hold it up to a mirror and write the â€œhow not toâ€ post. Base it on the same concept and research, but use different examples and add more value. This â€œevil twinâ€ post will be similar, but still original.

Example Guest Post:Â Common Website Navigation Mistakes
Example Post on My Site:Â Website Navigation Best Practices

Not only will you avoid a penalty, but you may get an SEO benefit. Both of these posts rank on page one for â€œwebsite navigation.â€

Calm down, People.

In my view, weâ€™re living through a massive overreaction. For some, itâ€™s a near panic. So, letâ€™s take a deep breath and consider the followingâ€¦

Googlebot visits most sites every day. If it finds a copied version of something a week later on another site, it knows where the original appeared. Googlebot doesnâ€™t get angry and penalize. It moves on. Thatâ€™s pretty much all you need to know.

Remember, Google has 2,000 math PhDs on staff. They build self-driving cars and computerized glasses. They are really, really good. Do you think theyâ€™ll ding a domain because they found a page of unoriginal text?

A huge percentage of the internet is duplicate content. Google knows this. Theyâ€™ve been separating originals from copies since 1997, long before the phrase â€œduplicate contentâ€ became a buzzword in 2005.

Disagree? Got Any Conflicting Evidence?

When I talk to SEOs about duplicate content, I often ask if they have first-hand experience. Eventually, I met someone who did. As an experiment, he built a site and republished posts from everywhere, verbatim, and gradually some of them began to rank. Then along came Panda and his rank dropped.

Was this a penalty? Or did the site just drop into oblivion where it belongs? Thereâ€™s a difference between a penalty (like the blacklisting mentioned above) and a correction that restores the proper order of things.

If anyone out there has actual examples or real evidence of penalties related to duplicate content, Iâ€™d love to hear â€˜em.

About the Author:Â Andy Crestodina is the Strategic Director ofÂ Orbit Media, a web design company in Chicago. You can find Andy onÂ Google+Â andÂ Twitter.

Photo Courtesy of Yehyon Chung, Poptip