Kissmetrics Blog

A blog about analytics, marketing and testing

Built to optimize growth. Track, analyze and engage to get more customers.

Content Scrapers – How to Find Out Who is Stealing Your Content & What to Do About It

If you have been blogging for a while, chances are you are familiar with content scrapers. Content scrapers are websites that steal your content for their own blogs without your permission. Some content scrapers will just copy the content off of your blog, but most use automated software that takes the content from your RSS feed and posts your content to their site like it is a new post.

In this post, we are going to look at some potential link building benefits to content scrapers, how to find out what sites are scraping your content, and what you can do if you want to either benefit from the linking standpoint or have them take it down.

Linking Benefits of Content Scrapers

Last week, I was happy to see that I was listed in ProBlogger’s 20 Bloggers to Watch in 2012. Within 24 hours, I received a notification in my WordPress dashboard that a page on my blog had been linked to in the post on ProBlogger’s site.

After receiving the original notification from the ProBlogger post, I also received another 18 trackbacks from sites that had stolen the content in their post verbatim. Trackbacks are WordPress’ way of letting you know that another website has linked to a post on your blog. In this case, these 18 sites had posted the content exactly like the original post – with the links back to my blog still intact.

It was then that I started contemplating the potential link building benefits of content scrapers. These are not by any means quality links – the highest Google PageRank was a PR 2 domain, many were stealing content in a variety of languages, and one even had the nerve to use some kind of redirection script to take away the link juice of outgoing links! So while these links didn’t have the same authority that the original post had, they still count as links.

How to Catch Content Scrapers

Unfortunately, unless you want to continuously search for your post titles in Google, you’ll only be able to easily track down sites that keep your in-content links active. If you want to know what websites are scraping your content, here are a few tips to sniff them out.


Copyscape is a simple search engine that allows you to enter the URL of your content to find out if there are duplicates of it on the Internet. You can get a few results using their free search, or you can pay for a premium account to check up to 10,000 pages on your site and more.


The first way is through your trackbacks in WordPress (as shown in the image above). Many of these will show up in the spam folder if you use Akismet. The key to getting trackbacks to appear from content scrapers is to always include links to other posts in your content. Be sure those links have great anchor text too, if you’re going for a little extra link juice. And even if you are not, internal linking with strong anchor text is good for your on-site optimization too!

Webmaster Tools

The next way to catch them is in Webmaster Tools. Simply go to your site in Webmaster Tools, and look under Your Site on the Web > Links to Your Site. Then sort by the Linked Pages column.

Anyone thinking about link building benefits at this point is probably noting the sheer volume of links from these sites, some of which are content scrapers. Essentially any site that is linking to a lot of your posts that isn’t a social network, social bookmarking site, or a die-hard fan who just loves linking to you is potentially a content scraper. You’ll have to go to their website to be sure. To find your links on their site, click on one of the domains to see the details of what pages on your site they are linking to specifically.

Then, click on one of your links to see which pages on their site is linking to yours.

You can see here that they are just blatantly copying my posts titles. When I visited one of the links, sure enough, they are copying my entire posts in their full glory onto their site.

Google Alerts

If you don’t post often or want to keep up with any mentions of your top blog posts on other websites, you can create a Google Alert using the exact match for your post’s title by putting the title in quotation marks.

I deliver all of my Google Alerts to an RSS feed so I can manage them in Google Reader, but you can also have them delivered regularly by email. You’ll even get an instant preview of the types of results you will get.

How to Get Credit for Scraped Posts

If you use WordPress, then you definitely want to try out the RSS footer plugin. This plugin allows you to place a custom piece of text at the top or bottom of your RSS feed content.

The result is this simple line on my blog posts when viewed through a RSS feed.

As you can see, even if you aren’t using it for the purpose of getting credit back to your posts when content thieves steal it, you can still use it for a little extra bit of advertising with the possible benefit of people who subscribe to your RSS feed clicking through to your website or social profiles. And when someone does scrape your content from your RSS feed, it shows up there too.

So in the event that someone finds your scraped content, they will hopefully notice the credit before assuming it was created by the blog that stole it. If you don’t have WordPress, you can simply include a note at the top or bottom of your content that includes the same information.

How to Stop Content Scrapers

If you’re not interested in anyone copying your content, then you have a few options to choose from. You can start by contacting the site that is stealing your content and sending them a notice that you want all of your content removed immediately. You can do this through the site’s contact form, email address, or post it to any social accounts they list.

If there is no contact information on the website stealing your content, you can do a Whois Lookup to (hopefully) find out who owns the domain.

If it is not privately registered, you should find an administrative contact’s email address. If not, you should at least see the domain registrar which, in this case, is GoDaddy and/or the hosting company for the website which, in this case, is HostGator. You can try to contact both companies (HostGator has a DMCA form and GoDaddy has an email) and let them know that the domain in question is stealing copyrighted content in hopes that the website will be suspended or removed.

You can also visit the DMCA and use their takedown services to remove anyone who is copying your photos, video, audio, blog, or other content. They even offer a WordPress plugin to incorporate a DMCA protected badge on your site to warn potential thieves.

Have you ever dealt with content scrapers and thieves? Do you leave it alone for the link benefits, or do you fight back? What other tools, services, or other preventative tactics do you use to block content scrapers? Please share your thoughts and experiences in the comments!

About the Author: Kristi Hines is a freelance writer, professional blogger, and social media enthusiast. Her blog Kikolani focuses on blog marketing for personal, professional, and business bloggers. You can follow her on , Twitter, and Facebook.

  1. Great post, Kristi.

    I’m always on the look out for content thieves.

  2. In the past I’ve used Tynt. You add a script to your blog and it reports what has been copied.

  3. Tea Silvestre Jan 09, 2012 at 2:55 pm

    I’ve seen a lot of sites copy my content verbatim, leaving links in tact, but they always show that the post was written by me and my name is linked back to my site. Technically, is that scraping?

    • I consider any site that uses your full content without your permission a form of content theft. Scraping is really the process for grabbing the content – specifically the software / plugins that will “scrape” content from RSS feeds. Some of those will do it with full attribution, others will just grab what is there, and some sites specifically strip any links or originating author information.

  4. Excellent post, specially about how to turn the content theft to your advantage…

  5. Kristi, I love that you’ve found the “bright side” of the issue. We could complain all day about the scrapers and make ourselves crazy fuming, but they aren’t going away any time soon. I’d rather join you in making lemonade! Thanks for pointing us to the sugar.

  6. Great post Kristi!

    I installed the RSS-footer immediately and it works like a charm!
    Thanks for the tip!


    PS. I’m now following you on Twitter as well.

  7. Is it worth hunting them down? If you run a site with several new posts a day, it’ll be a lot of work keeping track of them and hunting them down.

    • It depends on how proprietary you want to keep your content. I know some people that do hunt them down and stand up to them. I really don’t have time to do it, but sometimes I will check out the sites to at least make sure they are not posting my content next to something offensive.

      • Kristi,
        There are a lot of sites on the Internet that if you let them link to you will hurt your rankings (especially in Google) so you should find out what kind of site has used your content and what reputation they have with the big ‘G’.

        If they are questionable you should do something about that type of link.


  8. Really enjoy the rss footer options

  9. Hi! It is very useful information. Thanks for sharing. Now. I can catch content thieves.

  10. Michelle Minch Jan 10, 2012 at 6:44 am

    Awesome post, Kristi! My content, but especially my photos, are hijacked fairly frequently. I watermark all my photos, and sometimes the thieves have the nerve to crop out the watermark. I have started watermarking them higher up in the photo and more prominently. My content is also frequently plagiarized. People say that copying is the sincerest form of flattery. I really don’t want to be flattered in that way!

    For me, the biggest problem is website designers who steal images off Google Images to make their clients’ sites pretty. Very often, the client is totally unaware where the photos came from or that they are infringing on someones’ copyright. I received a nasty email from a web designer after sending him a copy of the US Copyright law (he said the photos didn’t say “copyright” so he was free to use them – he was wrong). He did remove the photos, but not without a fight. I doubt he will ever use another of my photos, but he made it clear he was not going to change his ways.

    Thank you for this great information (I love all the details) and for getting the word out.

    • You might try contacting the client in whose material the image appears, and send them a bill for the licensing fee for the image, with an explanation that their web designer has illegally used your material.

      I’ve seen it done and I’ve seen it work. Even if the bill doesn’t get paid, you make the designer look horrible. Which he/she should.

    • Hi Michelle! I’d like to think that copying is flattery, but a lot of copying these days comes from automation based on a keyword. Flattery would be a bit better comparatively. :)

      When it comes to photos, that is definitely something you want to fight for. My husband sells his photos as prints, so it’s almost product theft when his are taken. He uses the meta data option in Lightroom to automatically add his name, website, and copyright information into each photo he imports. I’ve noticed that when you upload one of his photos, that info actually pops up in the details in WordPress. I’m not sure what program you use, but you could put your copyright info in that way that way it is on each individual photo.

    • michael, don’t be such a fool. Professional web designers don’t steal photos from other websites or form Google images. How are you so sure he/she was a web designer and not an amateur aka web designer

  11. Jonathan Jones Jan 10, 2012 at 7:13 am

    I used copyscape and found that some one has copied my content in my social media blog . But What would be the next step?

    • Hi Jonathan. The next steps are listed in the last section of this post under “How to Stop Content Scrapers.” Basically you can report the site to their hosting company or work with the DMCA to do a takedown. Good luck!

  12. Kristi, thanks for pointing out what most people don’t seem to realize — if you include lots of internal links in your content (which is a good practice anyway), scrapers don’t do much harm. (I’m talking written content, photographers should IMO go after those who steal their work.)

    If we spent any time going after the hundreds of scrapers who lift Copyblogger posts, we’d have to choose what productive activity to stop doing instead. It’s annoying, but there are worse annoyances on the web.

    I really like that RSS footer plugin.

    • Hi Sonia! I know what you mean – tracking down people who lift content from my site wouldn’t be so bad, but if I started going on the lookout for the other sites I write for, then I’d be in for a time-consuming fight. I figure even if they are lower quality, having the backlinks is nice. In the even that someone did find a scraped post over the original, at least they’ll be pointed back in the right direction. Glad you like the plugin! :)

    • Sonia, I realize this is an old article, but I have to wonder if scrapers (or blatant lazy plagiarists) started targetting Copyblogger after you basically announced Copyblogger doesn’t pay too much attention to scrapers. Hopefully in the time since this post your company has started tracking it.

  13. Scrapers and curators are two entirely different creatures, imo. If you’re not publishing the entire article, you aren’t a scraper.

    I won’t speak for anyone else, but we love curators. :)

  14. I am so glad that I found this. I hate thieves. I work hard to create my own content. I pay for my images. Why should people get away with stealing my hard work. I have +1d this, liked it, shared it and now I’m going to bookmark it. After that I’m going on a crusade. If I can get a thief’s blog or site shut down, I will. If everybody took action, they would think twice about their parasitic behaviour.

    • Thanks for sharing this post Steve. As far as the images, if you pay for the image license, and the content is scraped, can you report the thief to the site/artist that licensed the image? Then maybe you can have them fight it too?

  15. Stephen Hamilton Jan 10, 2012 at 1:40 pm

    Your suggestion about adding in an RSS footer, including links and simple CTAs, is gold! So simple, but one I hadn’t thought of myself. :)

    • I used to have the plugin on my site, then forgot to reinstall it after I had to move the site to a new host. Then I saw it on another of my subscriptions and remembered just how genius it was!

  16. Agree with Simone to a great extent that if you are likely to get scraped make sure there are plenty of full url address links (not relative ones) back to internal content to minimise any possible detrimental impact. Most scrapers will maintain content text links.

    Do they not say ‘is plagiarism or imitation not the sincerest form of flattery’!

    Having said that if the only objective is to attract visitors to a low-quality site mainly for adsense or other advertising purposes it may be that the recent Panda/Farmer update and subsequent algorithm changes may have addressed some of the issues with low-quality scraper sites.

    If I remember rightly, Matt Cutts, Google’s Spam Master, stated that although an issue of duplicate content their algorithm can and does make every effort to identify the original content giving it attribution and higher ranking weight than obvious straight copies.

    • Google does a good job of ranking the scrapers beneath the original Rob, but I have still found some cases where I’m searching for an article and come across the duplicate instead of the original. And I have seen some scraper sites actually have comments on the stolen posts too! That’s why I think sticking an attribution in is important, just in case people find the duplicate first.

  17. Great tips on RSS footer options for WordPress. Thanks.

    Glad I discovered you on the new My SEO Community site.

  18. Hi Michael! I do believe there are good ways to go about content curation, and so long as you are getting permission from the author and crediting them in the repost, then there’s nothing wrong with it at all. I’ve given permission to a few curation sites – I basically look for a site where I can expand my own reach.

    As far as advice, I would say that you need to put up the reasons why someone should want content on your site. Traffic stats, future plans, how it will be displayed, customized byline, etc. Maybe even some control as to which posts will be curated and which won’t, like letting the author submit a category feed instead of their main one. :)

  19. Really through post. I definitely learned something today, although, thankfully I haven’t had to deal with it firsthand (yet!). I am going to add the RSS footer that you describe today, because you are so right that we can’t prevent scrapers completely so we might as well do what we can to benefit from their actions as much as possible.

    I did see you listed on the Copyblogger list of bloggers to watch in 2012. After reading this post, I’ll definitely be back.

  20. Heck yes everyone should fight back to help make this problem shrink! The question we have is what if it is a press release? We link back but do we have to since the release is for journalist?

  21. I only knew about Copyscape, I never realised there were so many ways to find out who is nicking your content. I was even more surprised to learn I could find out via Google Analytic. Thanks Kristi. :)

  22. Thank you or the tip on the plugin! I installed it immediately. I often find my RSS feeds on other scrapper sites. Thank you!

  23. Good job Kristi…I never knew Webmaster Tools will have such stuffs in it. I will explore them tonight and check because our website content was on several blogs..need to track them. I was looking for some application to find them, wonder webmaster tools got what I really want.

  24. The only time I happen to notice my content or images that appear elsewhere is because I happen to be checking logs and notice a bunch of incoming traffic from the same site or page. It tends to be other sites using my screen shot images. It is a bit annoying. I don’t have a lot of time to contact the site owner, host, or dmca. It does make me wonder how much is actually copied, scraped, etc. and many people probably don’t even realize it.

  25. really learnt a lot through this article.images made it easy

  26. Very interesting article, and there are a couple of links I should definitely check. As for me I just publish rss as summary and not full posts. It works for now but one of the tool I have to check is the rss footer plugin.

    Thanks for the interesting article.

  27. Well, lots of people having their own blogs have to deal with this problem of content scrapping. This is the same problem with me as I have a blog on umrah packages and I have to suffer a lot due to the theft of my content but now I will check out the tools suggested by you. Your suggestions are worth practicing.

  28. Karen Maskall Jan 19, 2012 at 11:17 am

    Great post kristi and thanks for the link to wordpress Rss footer plugin. As you say its a brilliant way to get some credit back even if they are low quality sites. Better than nothing at all isnt it?

  29. Evelyn Vincent Jan 20, 2012 at 8:25 am

    Thanks so much Kristi!!! I’ll have to go check my other blogs but I know for certain that this is happening regularly with one of them… I had been wondering what all of those WordPress Trackbacks were and if it was a good thing or not. Now I know and I know what to do about it.

  30. Most of the blogger are facing the same problem including me. This post is giving the valuable information. Thanks for this post.

  31. Content Thieves really annoy me so thanks for you suggestions.

    We take the time to create good unique content and some **** steals it and half the time doesn’t even bother to spin it.

    Lazy scumbags.

  32. Thanks for the invaluable info!

    That make two of your post i now need to take action after seeing. You are deff keeping me busy :)

  33. I am glad that you point out that there are some situations where content scrapers benefit the sites from which they retrieve information.

    A good example of “white hat” web scraping is my horses for sale platform at, Horses Farm & Market. It indexes horse for sale ads from other sites and compiles them into a single, searchable list. It gives a very brief summation of each external ad with a link to the original ad, and it also has Twitter, Facebook, and Google+ share buttons for each external ad as a courtesy to those sites. The intent is to augment the external services may providing a layer of greater functionality and accessibility.

  34. Thanks so much for the WP “trackbacks” clarification. I usually just spam or trash them when the sources are less than reputable (most are). Ironically earlier today I was curious and followed one trackback only to find my content on another site with back-links still intact? I do use anchor text in my href’s often so like you said it’s odd that they “scrapers” leave it all intact? Anyways thanks so much for the clarification.


  35. Great publish and blog! I do not have time for you to read every publish right now however i have book-marked it as well as added your Nourishes, then when I’ve time I’ll be to find out more. Please continue the truly amazing work.

  36. Finding out such content scrapers of your site is really a tough job. But thanks for your nice post and tips I will act upon these tips mentioned in your post to trace content scrapers of my site.

  37. Hi,
    Its very difficult to stop content scrappers. They will get the contents checked for copy scape, with the Google Analytics showed some results for copy scapes. And they do necessary steps.

  38. Auren Lengkong Feb 12, 2012 at 4:06 pm

    In my opinion you can fight and win but you you still lose in one part. You will lose your time and health dealing with content scrapper. I got a question will content scrapper post outrank our post as the original post? Will our page be devaluate by Google when there is duplicate content?

    I go for take a benefit from content scraper.

  39. While I certainly don’t track down every content thief, I do pay attention to scrapers and go after them — especially the ones stealing all of the blog content. I’m a writer. I make my living largely blogging for clients (and for myself). When my blog posts appear for free on sites without permission and those bloggers are monetizing them, it cheapens my work in the eyes of prospects and clients. That’s just as bad as someone stealing images that a photographer might want to sell in product form.

    I’ve had several sites shut down over the years and even more forced to remove specific posts. A DMCA takedown notice to the host is always a good idea. And if you automate the infringement searches and you use templates for the notices, it really doesn’t take much time or effort to go after them.

    I also tell other writers and bloggers to consider approaching two other groups which are better at hitting thieves where it hurts:

    1. Search engines — if the stolen content is appearing in results at all, especially anywhere near (or above) your own

    2. Advertisers — they can have their ad network account access suspended over copyright infringements which can not only knock out their income on that site, but on any site where they’re using the network’s ads (they often have scraper sites in more than one niche)

    Learn to craft a firm cease and desist letter, and you rarely even have to go that far. The vast majority yank the content down without 48 hours when I send them a demand letter to that effect.

  40. Great info in this post and in the comments! I’ve had several of my posts “scraped” and I always comment on the blog as soon as I find it to express my extreme ire and demand the poached post be immediately removed. If it is not, I proceed with further action but have so far gotten cooperation. Some of the scrapers apologize and pretend not to realize that they are stealing content because they included all my credits, but I am careful to educate them about posting content I wrote on their site without my knowledge or permission as being a hybrid of pirating and plagiarism, scraping. I always ask them if they would like it if I copied their posts and put them on my blog without them knowing about it. Another thing that really bugs me about scrapers – they are never good sties that steal my content, they are crappy sites and some are even questionable, that I don’t want to be associated with in any way.

  41. Thank you so much, this is very interesting! I’m a victim of content scrapers!

  42. Thanks for this info. It is still unclear to me; if google finds duplicated content taken from my site will google then consider that original content still on my site as being duplicate content and negatively impact my site or will google recognize that my site has the original content and not impact my site?

  43. That is what I want to find on the web. It will definitely help me a lot. We have seen many scraper of our blog’s content.

    Best Regards.

  44. Thank you for the post!

    Someone (psdto-magento .info) copied our site and advertising via Google adwords. Even they are displaying my site name as well.

    Please suggest what to do to take down their website.

  45. Wonderful post Kristi Hines, intuitive and clear too.

    I have a big, important question.

    In the case of article directories or blog directories, one can post their article on their website exactly as it appears on their own blog. You can simply copy and paste content from your blog into their post area and publish it their for whatever benefit one is searching for such as exposure.

    Is that stealing/scraping?

    The content will obviously appear somewhere in the internet exact; title and body including the links within.

    Being the one who did do the copy and posting, can I be labelled as a scraper of my own content?

    If that site, lets say the article directory, has advertisements which is their sole income earner, does that mean they are benefiting financially by having content such as the one I did post on them?

  46. OK, scrapers: Grrr!! Snippet scrapers: Hmm (so long as they don’t appear above you in Google… But what about a scraper that reposts your content in their own language?

    By complete chance we discovered a Russian website had taken all of our post content but translated them completely into Russian! Hard to track down were it not for a freak coincidence and this leaves a language barrier between us when attempting to get them to remove content.

    Thank you so much for the RSS footer plugin suggestion.

  47. It’s a shame that this goes on as much as it does. On one of my other websites, I have a site that does pingbacks just about every day. At first, I thought it was a good thing, but I see now that I am supplying all their content for their blog!

  48. A great help for me. Thank you very much for this informative post. I really appreciate your effort. And now I am using plugin by yoast to get backlins if someone scrap my contents via RSS feeds!!

  49. Pam Komarnicki Jun 13, 2012 at 8:11 pm

    Thanks for the information! In the spirit of stealing, I hope you don’t mind that I copied your format for the RSS Footer plugin. :) I’ve been using it for a while, but I liked the way you added the links to Facebook and the rest. And as far as making sure you use internal links on each post in order to get notified via trackbacks if the article shows up anywhere else, thanks for the reminder! I had forgotten that. You did note that they can end up in the spam folder if using Akismet. Do you have any ideas how to avoid that? Or do I just have to go in periodically and check?

  50. Great post Kristi,

    I use copyscrap and Google Alert but with a slightly different way.
    Many content scrapers change the posts’ title so I don’t think it would be effective to set the Google Alert for the post’s title.
    What I do is I choose a couple of sentences, a sentence in the first paragraph and a sentence somewhere in the middle.
    Or two continuous sentences in the first paragraph and two continuous sentences in the middle.

    This has always gave me better results.

    Thank you for the great post.


  51. It’s a shame that this goes on as much as it does. On one of my other websites, I have a site that does pingbacks just about every day. At first, I thought it was a good thing, but I see now that I am supplying all their content for their blog!

  52. Thanks for the info! Used it and found someone stealing our content before I even finished reading your article! Creating unique content is our goal and I am shocked because someone took it upon themselves to use our articles in almost completeness as their own.

  53. According to Google, content scrapping is called Content Syndication, here is a snippet from webmasters support:

    Syndicate carefully: If you syndicate your content on other sites, Google will always show the version we think is most appropriate for users in each given search, which may or may not be the version you’d prefer. However, it is helpful to ensure that each site on which your content is syndicated includes a link back to your original article.


    My question is if the copied content has a link back to the original author’s blog, is it not considered a spam? What I understood from the above text block is, Google will even index the syndicated post and it displays it in the search result (if relevant to user).

  54. Thanks for the tips! I just had one of my posts “curated” for the first time (that I know of) and I feel weird about it… I put a lot of work into it, and some guy has a blog that’s just filled with content he takes from other people? It’s only a partial post and there is a link to my site, but I would have liked it if I was asked, or even just a comment to let me know about it… seems like that would be the expected etiquette? I haven’t seen any traffic to my site from it yet, so time will tell… maybe I’ll see the bright side of it. I’ll take the steps you recommend with the RSS feed for any future instances, thanks.

  55. Thanks Kristi. You just helped me to catch someone who had stolen my content. As advised I have contacted them and they pulled it out of their website.

  56. jeffry ng darwis Sep 07, 2012 at 8:17 am

    Great post ,
    My site is about tech gadgets and web 2.0 . So i do many reviews on products in it .some , i do it hands on and to test out a single gadget like samsung tab 2 for example takes me 2 or 3 days .
    Some site just blatantly copy the full post and its really hurts . If its a smaller site then maybe i dun care much , but the site that copy my content is big and with a huge follower base . Stealing my work 100 percent without even giving some credit back to me really hurts ….
    Im just 6 month into blogging but i intend to make it big …now met with thief like this really stall my way but gonna keep it up anyway ^^

    Once again , Thank You very very much for you post ^

  57. Sarah Gooding Sep 07, 2012 at 3:29 pm

    I don’t know who’s worse. You or them.
    You scan the net for your keyword competitors content and copy their idea’s. Many of the guest posts you write aren’t based on your own experience and the blog owners know this but use you for your content.

  58. But why on earth do these exist in the first place?! What possible benefits could they get out of scraping for content and who reads these phony websites?

  59. Great article, I am starting to blog now and I do worry about intellectual property theft, etc. So far I haven’t had anyone (to my knowledge anyway) stealing my content (early days I guess; or just poor content I produce!) however I had people stealing photographs of projects I have the copyright of and show my work (house extensions, home design, etc; I am an architect). Any idea how I can find sites that used my pictures?

    Thanks a lot for any feedback.

  60. I received a comment today telling me that I have no right to demand or ask for a link back for ‘fair use’ excerpts copy/pasted from my site and placed somewhere else, whether it be an online forum, blog or newspaper. The commentator said I was entitled to attribution, but nothing more.

    Doing a search to understand if this is true, I landed here. Do you have any expertise or knowledge on this?

  61. You know what, I found one of my best article a few years ago on another site. I was so mad that I couldn’t think straight and obviously didn’t know what to do about it. Now that I’ve read your article, I will take precautions and hopefully this will never happen again. Thanks! Great article :)

  62. Hi Kristi,
    I liked your post. It was very informative. I have to tell you that I am a total novice in Internet language and hence I always am lost after a while. I need a little help and I hope you can help me solve my problem.
    I have my own blog
    It is a food blog.
    The problem I am encountering it is that one particular post of mine ( Quick Red Hot Salsa) seems to have suddenly become very popular and I have been receiving a lot of appreciative comments but all these readers seem to be promoting “cigarettes” and hence these comments are automatically going into the spam box. I have requested often that they do not add their web address but to no avail.
    I do not want to promote any cigarettes or any thing detrimental to good health.
    I would like the comment to be posted but without their web address that promotes ill-health.
    What should I do? ( in simple steps for dummies like me).
    Tulika Chari

  63. Hi Kristi,
    I was just looking thru the comments to your post, to see if people had similar problems and I found that one of the comments( the top 5 lines) are exactly the same that I have received.
    Does that mean something? Or is it just that the person uses same language everywhere including the time they spent on the Internet.
    Tulika Chari

  64. A company called “turnitin” markets an “Originality Checker” to schools for grading papers, and brags at how it has been scraping contents off of millions of websites to accomplish this.

    The problem is, especially if your website is e-commerce, that they are scraping your content and making money from it. This is, of couse, illegal, as they are violating copyright law.

  65. Yes I have seen my original content over other blogs with out my permission they have stolen my content! because of these spammers my site was penalized by Google!
    Still I don’t know how deal with copied content can anyone say me how to deal with those spammers and content thieves

  66. There’s been some posting around about, which is doing a lot of frame scraping–ie, they are stealing your traffic by framing your content. Adding this javascript to your header will stop your page from being framed like that.

    if (window!= top) top.location.href = location.href;

  67. Hmmmm.. my javascript got edited out! anyway, do a search for the above, and you should find a google post about it.

  68. Thanks for this post… I always wondered what Trackbacks were lol

  69. Thanks a lot for sharing such a wonderful information..

  70. very nice information

  71. well very nicely written but unfortunately there isn’t any way to stop them

    but bes thing you can do is to inset your site links in your posts i.e link blog posts with one another . Its simply the best way to handle this then no one will copy from you :)

  72. Do you think the google authorship is having any impact on the content that is getting scrapped? I am curious if google knows where the article came from if it is making a difference.

  73. Thanks! I bookmarked this so I can come back and try a few of the suggestions at a later time.

  74. Nice and very important article!

    Here is an example of a site called: that has clearly and blatantly infringed on one of our sites called: They have copied almost everything to a tee and have moved (OUR) content from Germany to Russia and back again whenever we have approached their hosts threatening action. These guys are a notorious group of spammers/scammers as they have been doing this to several technology sites. It’s tricky to stop them as the laws in terms of DMCA take-down notices are not always relevant when the information/data is hosted outside of North America.

    Some of your suggestions are a good starting: You can even try to work with the .htaccess and use a CDN service or threaten DMCA take-downs. However, nothing is foolproof at this time. Its still work in progress and we are working hard at future prevention of such shady business practices.

    To be continued….


  75. Douglas Laney Dec 27, 2012 at 9:18 pm

    Thanks. Interesting stuff, but not sure about Copyscape’s efficacy. Google found several blogscrapes of mine (I searched for a unique string of text), while Copyscape found none. –Doug Laney, VP Research, Gartner, @doug_laney

  76. Great Article. Now i have to work on it too… I have seen lots of sites are copying my article.

  77. I just had a 2,000 word article on preppers that I found on someone else’s site when I was trying to see how it ranked. My site was pushed back behind the duplicate content comment on google so no one could find it. I sent him a bill via paypal and said he had 24 hours to pay it or remove the article or I would contact his advertisers and hosting company. He removed it.

  78. Thanks for sharing, good to know.

  79. Hello would you mind stating which blog platform you’re using? I’m planning to start my own blog
    in the near future but I’m having a tough time making a decision between BlogEngine/Wordpress/B2evolution and Drupal. The reason I ask is because your design seems different then most blogs and I’m looking
    for something unique. P.S My apologies for getting
    off-topic but I had to ask!

  80. Wonderful blog you have here but I was curious if you knew of any message boards that cover the same topics discussed here?
    I’d really like to be a part of group where I can get suggestions from other experienced people that share the same interest. If you have any suggestions, please let me know. Thanks!

  81. Hola! I’ve been following your web site for some time now and finally got the courage to go ahead and give you a shout out from Kingwood Tx! Just wanted to mention keep up the great work!

  82. My brother suggested I might like this website. He was
    once totally right. This put up truly made my day.

    You cann’t imagine just how so much time I had spent for this information! Thank you!

  83. we cannot stop content scraping now… but some of the big website scrap too but the good thing they did is put the the original author for every site they scrap well thats the right thing to do.. remember sharing is caring and google loves that :)

  84. Thanks for sharing for this wonderful informative article, but i agree on liza too we cannot stop content scraping.. let google punish those scraper

  85. This continually is amazing to me how bloggers such as yourself can find the time and also the dedication to keep on creating fantastic blog posts. This is wonderful and one of my have to read on the web. I simply want to say thanks.

  86. Fantastic post! I never knew about the linking part in Webmaster tool. I had no idea that one could do those stuff in Webmaster tool. Thanks for this informative article.

  87. Howdy, i read your blog occasionally and i own a similar one and i was just wondering if you get
    a lot of spam remarks? If so how do you reduce it, any plugin or
    anything you can recommend? I get so much lately it’s driving me crazy so any support is very much appreciated.

  88. you have given a very round about way of combating content scraping…I was using atcontent plugin for some time and it was good…but I think too many plugins just slow you down…webmaster tool is a better idea and you can know what’s happening but you have to know your way around…

    I get some spam comments –do they steal your traffic

    visit my site if you can and point out any suggestions if you want

  89. You stated: “Unfortunately, unless you want to continuously search for your post titles in Google, you’ll only be able to easily track down sites that keep your in-content links active.” Good news – there is another simple way to find poachers. Perform a Google search on a unique phrase. Google will lead you to the sites that are using your content. I have done this in the reverse direction and that’s how I found you! No, you are not the poacher. I was sent an email that pointed to a file that contained a modified form of your hard-earned content. I suspected that It was ripped-off. I found you with this technique and I informed you about it. I didn’t follow-up on your response – who knows what happened. Good luck on keeping-them-honest. Regards, Sid

  90. I have the same problem and also index speed problem. So many sites are copying my article by RSS. Are they any way to understand that which article google indexed first, who is the owner of original content?

  91. Hello! I understand this is kind of off-topic but I needed to ask.
    Does running a well-established blog such as yours require a large amount of work?
    I’m completely new to writing a blog however I do write in my diary on a daily basis. I’d like to start a blog so I can
    share my experience and thoughts online. Please let me know if you have any kind of ideas or tips for new aspiring blog owners.

  92. I leave a response each time I especially enjoy a article on a website or if I have something to add to the discussion.
    It’s a result of the fire displayed in the post I looked at. And on this post Content Scrapers – How to Find Out Who is Stealing Your Content & What to Do About It. I was excited enough to leave a comment ;-) I actually do have 2 questions for you if you don’t mind.

    Could it be just me or does it look like like a few of the comments
    come across like left by brain dead folks? :-P And, if
    you are posting on other online sites, I’d like to follow you. Could you make a list every one of all your public pages like your Facebook page, twitter feed, or linkedin profile?

  93. wow !.. within 10 mins of reading your article, I catch one scraper .. tx

  94. A very comprehensive article on the tips, I am facing the issue for a month now. I guess it is the right time to file DMCA – but do you think the request by new bloggers are paid heed by the associations.

  95. Hey are using WordPress for your blog platform? I’m new to the blog world but I’m trying to get started and create my own.
    Do you require any coding expertise to make your own blog?

    Any help would be greatly appreciated!

  96. Woah! Worth sharing. DMCA filing and sending email to that person can work more effective to stop person from copying your content. Other wise you can paste code to disable copying option on your web page and images.

  97. Hi Kristi,
    Thanks for the great post! I’m new to blogging, and your post will help people like me and veterans alike.

    Peace, Jason

  98. Thanks for this post. I was going to install the RSS plugin you mentioned but from one of the comments we discovered Tynt which is perfect for people copying and pasting. I might see if we can run both so nobody can copy our blog material without us being aware!

  99. You might not come across my comment amid the 200+ comments on this page – but great job done on the research here. The Copyscape link is very useful – I just tried a few links and found some copied content! I arrived on your blog searching for how to report content theft, because I discovered a blog that has been copying my entire feed or something – exact text and pictures – on their blog!

    A great post, Kristi. Thanks!

  100. In my opinion it is best to leave content scrapers apart. Filing DMCA report and other sort of legal stuff is not a good approach because:

    1) You are wasting hell a lot of your time in fighting rather than concentrating on your future posts.
    2) Even though you take them down, there are chances that more content scrapers are gonna come in the days to come.
    3) Google is clever enough to spot content scrapers and de-evaluate them. When Google does this job, why don’t you just leave it to them.

    It’s not about getting link benefit. As you said, they don’t have any value. But cheerfully smile when somebody is scrapping your content because it means that you are getting popular. Search engines would definitely think that you have a great website that people are copying like crazy.

    But importantly make sure your site gets indexed quicker than the scrappers. This can be done by setting up xml sitemap and making the navigation clean and easy for users.


  101. It’s awesome to go to see this web page andd reading the views of all friends concerning thos piece
    of writing, wile I am also eager of ggetting familiarity.

  102. Hi there! This post couldn’t be written any better!
    Reading this post reminds me of my previous room mate!
    He always kept chatting about this. I will forward this article to him.
    Fairly certain he will have a good read. Many thanks
    for sharing!

  103. Amazing blog! Do you have any tips for aspiring writers? I’m planning to
    start my own blog soon but I’m a little lost on everything.
    Would you propose starting with a free platform like WordPress or go
    for a paid option? There are so many options out there that I’m totally confused ..

    Any tips? Bless you!

  104. Asking questions are really pleasant thing if you are not understanding something totally, but this article gives nice understanding even.

  105. My family members always say that I am killing my
    time here at web, but I know I am getting experience every day by reading such
    good articles.

  106. Very nice post. I just stumbled upon your blog and wished to say
    that I have truly enjoyed surfing around your blog posts.

    In any case I’ll bbe subscribing to your rss feed and I hope
    you write again very soon!

  107. interesting write up…i will try them out because i want to protect my content

  108. I think the admin of this web page is actually working hard
    in favor of his site, because here every stuff is quality based stuff.

  109. Hi there! I just wanted to ask if you ever have any problems
    with hackers? My last blog (wordpress) was hacked and I ended up losing months of hard work due to
    no backup. Do you have any methods to protect against

  110. I’ve been browsing online more than 3 hours today,
    yett I never found any interesting article like yours.

    It’s pretty worth enough for me. Personally, if all webmasters and bloggers
    made good content as you did, the net will be uch
    more useful thann ever before.

  111. There is a possibility where the footer plugin, would actually damage if too many websites contemporarily are stealing your content for their blogs, you will end up with a spammy backlink profile, so instead of gaining free advertisement you would actually get damaged by it. Just a thought, no case studies

  112. I think what you posted made a bunch of sense. But, what about this?
    what if you aadded a little content? I ain’t suggesting your infrmation isn’t solid., but suppoze you added a
    post title thaat makes people desire more? I mean Content Scrapers
    – How tto Find Out Who is Stealing Your Content & What to Do
    About It is a little plain. You could look at Yahoo’s front page and see
    hhow they create news headlines to grab people interested.
    You might add a related video or a related pic oor two to get readers excited about what you’ve got to say.

    In my opinion, it would make your posts a little bit more interesting.

  113. Hi, I do think this is a great website. I stumbledupon it ;) I am going to revisit
    yet again since i have book-marked it. Money and freedom is the greatest way to change, may you bee rich and
    continue to guide other people.

  114. In actual fact, in LinkedIn’s annual 12 months-finish report of which abilities got folks
    hired probably the most , SEARCH ENGINE OPTIMISATION was #5!

  115. I totally agree with Sam. Every one want their website to be popular and find a way to bring visitor to their page. even pay for Adword to drive traffic to their site. How about a lot web site link back to your? that’s not good enough?

    “Sam Jul 13, 2013 at 4:38 am
    In my opinion it is best to leave content scrapers apart. Filing DMCA report and other sort of legal stuff is not a good approach because:

    1) You are wasting hell a lot of your time in fighting rather than concentrating on your future posts.
    2) Even though you take them down, there are chances that more content scrapers are gonna come in the days to come.
    3) Google is clever enough to spot content scrapers and de-evaluate them. When Google does this job, why don’t you just leave it to them.

    It’s not about getting link benefit. As you said, they don’t have any value. But cheerfully smile when somebody is scrapping your content because it means that you are getting popular. Search engines would definitely think that you have a great website that people are copying like crazy.

    But importantly make sure your site gets indexed quicker than the scrappers. This can be done by setting up xml sitemap and making the navigation clean and easy for users.


  116. Thanks for making such a cool post which is really very well written.will be referring a lot of friends about this.Keep blogging.

  117. What’s up, I log on to your blogs regularly. Your story-telling style is witty,
    keep doing what you’re doing!

  118. Great post. I was checking continuously this blog and I am impressed!
    Very helpful information particularly the last part :
    ) I care for such information a lot. I was looking for this certain info
    for a long time. Thank you and best of luck.

  119. You got an extremely helpful website I actually have been here reading for regarding an hour. I’m an initiate and your success is incredibly a lot of a concept on behalf of me.

  120. You actually make it seem really easy together with your presentation however I to find this matter to be really one thing that
    I feel I would by no means understand. It sort of feels too complicated and extremely huge for me.
    I’m having a look forward on your next submit, I will try to
    get the hang of it!

  121. Do you have any video of that? I’d love to find out more details.

  122. Very nice nad informative post exactly what i was looking for!!
    Thanks :D :)

  123. The ROI on this is minimal. Who has the time for all of this stuff? This is exhaustive for what in the end? Google rankings don’t earn any money. I was listed many times as a top leadership and small business expert on Twitter. At the end of the day it provided no significant ROI.

    Everyday there is a new company improving upon Twitter. However most people focus so much on followers they have forgotten their job is to sell something. This is the same. With all due respect, even if one were to master your suggestions, so what? This is tantamount to middle school. Sure it is our intellectual property. But, really? As author Richard Eyre said, everybody is plagiarizing somebody because in reality there aren’t any new ideas.

    One can pay a person to pen a blog and then make one’s phone calls to make sells. I have four blogs. ROI wise they produce views but little money. A blog with a 100,000 views a month means nothing unless one is producing a significant income. Views i.e. followers do not equal cash flow. The cash cow for us is dialing.

  124. Remarkable! Its genuinely awesome piece of writing, I have got much clear idea regarding from this paragraph.

  125. Yourr means of describing the whole thing in this article
    is in fact good, every one can without difficulty understand it,
    Thanks a lot.

  126. I have found out that my entire ebook and PDF library has been posted to Amazon AWS by someone and is being downloaded illegally from there. I can tell, because I’m getting traffic from the stolen PDF documents as they click on links to my site. The stolen PDF is hosted on Amazon and shows up as a referrer in my stats.

    But I’m at a total loss as to how to find out who has done this, and where these links are online for me to track this down and put a stop to it. I can’t have all my sales leaking out this way!
    Anyone have any advice on how to find the site responsible?

  127. I heard if someone else copies your content and his/her site is old and got good SEO then your post will be considered as a copied one… Is it true???

  128. James Mcguire Apr 21, 2015 at 10:16 pm

    I was doing a check to see if one of my writers had copied work but I saw once in the search results that there was a bunch of results with the exact work I had up on the site my content was published on.

    I’ve never used Copyscape but it looks like it’s worth trying out. I never gave it much thought only because the work that was being copied from me was back in the day when I posted on Ezinearticles a lot, so no big deal for me at least. And I’m sure/hope Google has something for this in place so the original owner doesn’t get penalized.

  129. Good article for professional bloggers.

  130. You won’t find content thieves in WT. Just copy paste a whole senctence from an article in Google, in quotes, and you’ll see how many automated blogs steal your content.

  131. its useful information about content scrapers .but my question is that is there any good software available in the market to block content scrapers ,web harvesting or web data extraction.

  132. I think instead of reporting to DMCA we should ask for to get credit by linking as this will be a win win situation.If someone is copying repeatedly without giving you the credit then we should go for DMCA as a last option.

  133. Hello Kristi,
    Before commenting on this post I would like to share a short story (my personal experience).
    A couple of years ago someone republished a post from one of my blogs. I tried to contact him, but their was no contact information on the website. I used the Whois Lookup option and luckily the domain was not privately registered. I sent him an email asking for a link back to the original article. I sent reminder 1, reminder 2 (3 emails in a week.) but their was no reply…
    His was hosting his website on Hostgator. I simply contacted his host and told them the whole story. You won’t believe, Hostgator removed his entire website within the next 24 hours. :)
    My Comment on This Post
    (Again, in the light of my personal experience).

    I’d prefer to go with the first part of this post. Instead of getting the content removed from their websites, I believe it is better to leave them as long as they are linking back to the original post.

  134. I work for a major e-commerce company in Illinois. We have had people scrape our content and it is a serious problem. When you have 150,000 backlinks suddenly to your site, it might seem like a fantastic link building strategy at first, but the ratio between domain names and inbound links is a quality metric.

    150,000 links from one site and 130,000 from another scraper is 2 domain names = 280,000 links. NOT Good! Especially when the difference between a major product page ranking 4th instead of 2nd costs $50,000-150,000/ year.

    Contact those scrapers and make sure they don’t do it again. If you don’t defend your intellectual property, you have no claim to it. That ‘s the law in the U.S.

  135. Maybe I’m not understanding something, but how does the RSS footer plugin help if thieves are scraping the PAGE itself, not the RSS feed?

  136. Doreen Pendgracs Mar 15, 2016 at 10:35 am

    The subject of content scraping became real in my world when I noticed on last month’s Google Analytics report of my site that a particular site had been top my site 103 times in the past 4 weeks. I checked their link and it appears it is a start-up scraper, so maybe they can be stopped before they even launch. You’ll find them at To my knowledge, they haven’t yet done anything with my content, but I could be wrong! Any specific suggestions with respect to “A1-Writer”?

  137. Hola! I’ve been reading your blog for a long time now and finally got thee courage to go
    ahead and give youu a shout outt from Porter, Texas! Just wanted to tell you keep
    up the great job!

Comments are closed.

← Previous ArticleNext Article →