Duplicated Contents Problem, Be Careful!
Duplicated contents is not a new topic. We all know that it is not good for your site.
SEOmoz has categorized this problem into “issue” and “penalty” here. An “issue” is created when Google or other search engines don’t know how to index and rank the same piece of content on different domain. Rand Fish summarized how to know if you are having an issue or getting penalized:
Penalties require a good bit of abuse to go into effect, but I’ve seen it happen, even on domains from respectable brands. The penalties really arise when you start copying hundreds or thousands of pages from other domains and don’t have a considerable amount of unique content of your own.
Recently, CN Reviews has experienced one of this duplicated content problem - between an issue and a penalty. I want to share with you and hope it is helpful for you to maintain a healthy blog.
Symptom of sickness:
- CNReviews can’t rank at the first place on Google for title tag. In fact, a RSS feed aggregator site called virtualreview.org ranked on all our title tags during that period.
- The ranking of some keywords dropped dramatically. See below. Obviously, our ranking for “airport” dropped dramatically during May 16 - Jun 8.
Trouble shooting process:
- I started to “blame” a plugin we installed recently which is to create “sticky posts”. So I deactivated.
- I signed up Google Webmaster Tool and look at CNReviews from the eye of Google (bot). It is a two-step process.Sign up here and upload a verification file to your site’s root directory. And soon I found out there were a few hundred pages URLs ended with “?wpcf7=json”. For example, we have a page called: cnreviews.com?wpcf7=json which is extractly the same as cnreviews homepage. According to WebTalk, this is a problem created by a Wordpress plugin called “Contact Form 7″ which we have installed since the blog launched.
Solutions
- Deactivated the Contact Form 7 plugin.
- I used the “disallow” command to block Google bot from indexing the pages have “?wpcf7=json”. It is very easy to compile this robots.txt file once you get into Google Webmaster Tools and follow the instructions.
So far, I think we have solved the problem as you can see the searched for “airport” going up again. But why Google, such an intelligent search engine, indexes pages like this. The code “?wpcf7=json” is only used in AJAX submitting (POST) process by the plugin? And why this issue didn’t float up as a problem earlier? I don’t know the answers from technical standpoint, but this problem became visible after we got the traffic spikes from Sichuan Earthquake Donation Guide.
Lessons Leaned:
- Do more research about the plugins before installing.
- Monitor your metrics, especially when you have a spike in traffic; a larger data set tell you more stories. If you find something unusual, do some sample queries to see if your ranking of past top keywords drop.
- Sign up Google Webmaster Tool and see if you have any duplicated contents indexed by Google.


I remember I stayed at home (Shanghai) for 7 days watching all the Desperate Housewives episodes last year. LOL. It is not that I don’t want to explore our beautiful western provinces - Yunnan (云南), Sichuan(四川), Tibet (西藏) and Qinghai (青海), I was afraid that the overwhelming crowds would ruin the beautiful natural scenery. Year 2008 is the first year that the eight-year-old “7-day-Golden-Week” is cut to 3 days. Still, I can’t go out to explore the far-away Wild Wild West. What’s worst, my friend from Suzhou told me that a nice hotel in Suzhou is very difficult to book (= more expensive) in the coming few days. So, I guess I will have to stay at home to watch LOST this time (like other
I have personally used Go2EU (穷游网) and knew it was very hot, but I didn’t expect it to be Number 1. Go2EU’s Chinese name can be understood as “Travel though you are poor” or “How to travel even if you don’t have much $$$.” It is a portal and community for budget/independent outbound travelers. All the information (in Chinese) focuses on overseas travel: from how to get a visa, where to buy cheap ticket, to tips on when and where to take great pictures. I am a little surprised to see only Travel Channel of Sohu on # 10 but no other major portals.







