Diễn Đàn SEO Panda - SEO Panda Forum

Diễn Đàn SEO Panda Dành Cho Các SEOers Tự Do Thảo Luận SEO - SEO Panda Forum - Free SEO Forum to share your knowledge to the world
 
HomeCalendarFAQSearchMemberlistUsergroupsRegisterLog in
Search
 
 

Display results as :
 
Rechercher Advanced Search
Latest topics

Share | 
 

 Guest Blogging for Links? Choose a Heavily Scraped Site!

View previous topic View next topic Go down 
AuthorMessage
khiemsound



Posts : 1016
Points : 11625
Join date : 2012-03-27

PostSubject: Guest Blogging for Links? Choose a Heavily Scraped Site!   Sun Apr 22, 2012 9:29 am

Today, I’d like to share an observation I made after analysing new back links acquired from guest blogging on Search Engine Journal and getting promoted to the main blog at SEOmoz. It’s really interesting how the more popular, high authority domains get copied (scraped) so frequently by other sites that have pagerank or are sometimes even functioning companies in their own right.

Could these scraper sites pass any value through their outbound links and as a consequence, can the process of guest blogging on well scraped sites be levered to work positively for your SEO?
Blogs get scraped

Ever since the introduction of WordPress plugins such as Wp-O-Matic scrapers have become a fact of life. Blogs get scraped, particularly, the larger, more successful and regularly updated sites. Take this popular post on Search Engine Journal for example – there are 78 instances of the and first line of the opening paragraph according to Google’s index. If you take a look at any blog in the Adage Power 150 or Technorati’s Most Popular you’ll be sure to find their posts duplicated hundreds if not thousands of times elsewhere.<br /><br />Scraping is with us and it’s here to stay, but can that fact be used to add short to medium term value to our SEO campaign? Back in April 2008, I wrote a Youmoz post about my good friend Dan Faircloth. Dan’s an engineer at the Rutherford Appleton Laboratory and specialising in particle accelerators, not SEO, he hadn’t attracted many links to his (rather new) domain at the time. After the post got published, the links back to his site increased quite significantly. They were all links from sites scraping the original Youmoz article. The best part was all of the scraped links were using our targeted anchor text. Soon after, Dan was ranking in 1st place for his own name, which was exactly what we had intended.<br />Do links in duplicated content pages still pass value?<br /><br />In my opinion, yes they do. There doesn’t even seem to be a limit to the number of times you can duplicate a page across unique domains to pass link value. You’d expect (or hope) that pages triggering the duplicate content filter at Google would have the value of their outbound links nullified, but I don’t see this happening in many cases. It’s not up to me to out specific examples of this, we’ve all seen it happening. If you haven’t, I’d suggest finding a high competition market and analyse the backlinks to a few domains. If you start seeing links from sites like articleblast.com, goarticles.com and articlesbase.com just do an exact match query in Google for some of the text you find and you’ll find your duplicate articles and inbound links.<br />Case study: Scraped post at SEOmoz<br /><br />I decided to take a look at my post (titled “SEOmoz Tools – Top Pages on Domain Kick Ass”) published on SEOmoz a few weeks back. At the base of the article, there is a link back to my site using the anchor “SEO Consultant in London“. It’s not a particularly competitive phrase (nor is there much traffic) but, nonetheless, it’s a valid term and one for which SEOgadget ranked third for until a week or so ago. The article was scraped by at least 21 other domains, the data on which I gathered by using an “intitle” query on exact match for the post title and a randomly chosen sentence from the content, also on exact match.<br />How do you find scraped content?<br /><br />My favourite way is just to use a search engine. In this example, I have used an “intitle” operator and a section of text that could only have appeared in the article in question.<br /><br /><br /><br />You could use Copyscape to do the same thing, though I have found the results to be less useful and not as fresh as the main search engines. You’ll end up going to Google in the long run. Whether you’re familiar with anti plagiarism tools online or not, it’s worth checking your own site. You might be (unpleasantly) suprised.<br />Data captured<br /><br />To answer my question: “Could scraper sites pass any value?” I needed to collect some data. For each of the scraped articles, I collected the following information:<br /><br />- URL and Domain Pagerank<br /> - SEOmoz Domain MozRank and Domain MozTrust<br /> - Comments on the article (How the original has been scraped and played back to the user on the new page)<br /> - The search engine used to find the article (Yahoo or Google)<br /><br />You can download my raw data from this URL. (Office 2007 Excel).<br />Common forms of scraping<br /><br />The most typical form of scraping was to directly copy the original post HTML and present the content back to the audience of the scraper site. In many cases, the original links to SEOmoz.org had been removed and replaced with the host domain. One site had taken a copy of the page and nofollowed all of the external article links. Frequently, the scrapers were citing a Google feed proxy URL as the “original” source of the content. The remaining pages were displaying only the first paragraph of the page content and linking back to the original with either a do followed or no followed link.<br /><br />Though all forms of scraping are quite annoying if you’re a site owner, the worst instances (IMO) are when the original links in the article are replaced with internal links elsewhere on the scraped site. No value whatsoever is passed back to the original author, nor the sources the original author cited as valuable. I did find that specific domains were being removed rather than all external links – i.e “seomoz.org” was replaced where “seogadget.co.uk” was not.<br />Google Pagerank<br /><br />Though none of the urls had yet been awarded pagerank, out of the 21 scraping sites found, 17 of the domains had a Google pagerank between PR6 and PR1:<br /><br /><br /><br />SEOmoz Domain MozRank and Domain MozTrust<br /><br />16 of the 21 sites found had MozRank and MozTrust – the most trusted and ranked sites being quite high (6.03 DmR and 6.24 DmT). These values are higher than SEOgadget, which has a DmR of 4.39 and a DmT of 5.28. None of the scraped page URLs were in the Linkscape index and didn’t have their own metrics available.<br /><br /><br /><br />Conclusion<br /><br />Most of the site domains included in the sample data have Pagerank, MozRank and MozTrust. Some of them are in fact perfectly “authoritative” sites in the eyes of search engines such as Google and backlink value analysers such as Linkscape, which would imply they are capable of passing link value. I’m not saying scraping is good, but I am making a comment on their ability to pass value. There are a number of different methods of scraping and problems can be introduced during the scrape process such as bad HTML parsing, linking to RSS feeds and linking out to 404 error pages. That said, for the most part, links back to sources referenced in the posts tend to be left untouched, which (during this test) included the footer text left in the base of my articles. Authoritative domains pass value as search engines index new pages on those domains. Taking that fact into account, it is fair to assume that the scraped sites identified in this test will pass value via the outbound links in the scraped content. I’m still watching a few pages which have links from recently published, scraped posts to test this conclusion further.<br /><br />Recommendations<br /><br />My recommendation to anyone thinking of posting on a 3rd party blog is, given the likelihood of the target site being heavily scraped, think very carefully about your content’s outbound links, especially in the footer of the article. Use a sign off, referencing your site and the most important pages on your own blog. In my case, I use a footer link like this:<br /><br />Richard Baxter is an SEO Consultant in the UK and chief blogger at SEOgadget.co.uk. Come check out our latest SEO Jobs or, if you’re recruiting, post a job free.<br /><br />Finally, if you’re thinking of targeting a blog with an offer of a guest post, be sure to read Josh Klien’s “How to Guest Post to Promote Your Blog” and Darren Rowse’s advice on “How to be a Good Guest Blogger” to get yourself positioned in the right way when you’re authoring your content.</div><div class="clear"></div><div class="signature_div"><br />_________________<br /><a href="http://vuongquocgamemobile.com/ngu-de" title="ngu de">ngu de</a><br /><a href="http://taiiwins.info/ibet88" title="ibet">ibet88</a><br /><a href="http://taiiwins.info/phong-van-truyen-ky-mobile" title="phong van truyen ky">tai phong van truyen ky</a><br /><a href="http://vuongquocgamemobile.com/tai-iwin-online" title="iwin">iwin</a><br /><a href="http://vuongquocgamemobile.com/ionline" title="ionline">ionline</a><br /><a href="http://vuongquocgamemobile.com/gopet" title="gopet">gopet</a></div></div><span class="gensmall"></span></td></tr></table></td></tr><tr class="post--517" style=""><td class="row1 browse-arrows" align="center" valign="middle" width="150"><a href="#top"><img class="sprite-arrow_subsilver_up" src="https://illiweb.com/fa/empty.gif" alt="Back to top" /></a> <a href="#bottom"><img class="sprite-arrow_subsilver_down" src="https://illiweb.com/fa/empty.gif" alt="Go down" /></a></td><td class="row1 messaging gensmall" width="100%" height="28"><table border="0" cellspacing="0" cellpadding="0"><tr><td valign="middle"><a href="/u465" class="profile-icon" title="View user profile"><img src="https://hitsk.in/t/17/24/51/i_icon_profile.png" class="i_icon_profile " alt="View user profile" /></a> </td></tr></table></td></tr><tr align="right"><td class="catBottom" colspan="3" height="28"><table width="100%" border="0" cellspacing="0" cellpadding="0"><tr><td width="9%" class="noprint"> </td><td align="center" class="t-title"><a name="bottomtitle"></a><div class="cattitle">Guest Blogging for Links? Choose a Heavily Scraped Site!</div></td><td align="right" nowrap="nowrap" width="9%" class="browse-arrows"><a href="/t516p-guest-blogging-for-links-choose-a-heavily-scraped-site"><img class="sprite-arrow_subsilver_left" src="https://illiweb.com/fa/empty.gif" alt="View previous topic" /></a> <a href="/t516n-guest-blogging-for-links-choose-a-heavily-scraped-site"><img class="sprite-arrow_subsilver_right" src="https://illiweb.com/fa/empty.gif" alt="View next topic" /></a> <a href="#top"><img class="sprite-arrow_subsilver_up" src="https://illiweb.com/fa/empty.gif" alt="Back to top" /></a> </td></tr></table></td></tr></table><table class="forumline noprint" width="100%" border="0" cellspacing="0" cellpadding="0" style="margin: 0 0 1px 0; border-top: 0px;"><tr><td class="row2" valign="top" colspan="2" width="150"><span class="gensmall">Page <strong>1</strong> of <strong>1</strong></span></td></tr></table><table class="forumline" width="100%" border="0" cellpadding="1" cellspacing="0" id="ptrafic_close" style="display:none;margin: 1px 0px 1px 0px"><tr><td class="catBottom" height="28"><table width="100%" border="0" cellspacing="0" cellpadding="0"><tr><td valign="top"><div class="cattitle"> Similar topics</div></td><td align="right" valign="middle" width="10"><span class="gensmall"><a href="javascript:ShowHideLayer('ptrafic_open','ptrafic_close');"><img class="sprite-tabs_more" src="https://illiweb.com/fa/empty.gif" alt="+" align="middle" border="0" /></a></span></td></tr></table></td></tr></table><table class="forumline" width="100%" border="0" cellpadding="1" cellspacing="0" id="ptrafic_open" style="display:'';margin: 1px 0px 1px 0px"><tr><td class="catBottom" height="28"><table width="100%" border="0" cellspacing="0" cellpadding="0"><tr><td valign="top"><div class="cattitle"> Similar topics</div></td><td align="right" valign="middle" width="10"><span class="gensmall"><a href="javascript:ShowHideLayer('ptrafic_open','ptrafic_close');"><img class="sprite-tabs_less" src="https://illiweb.com/fa/empty.gif" alt="-" align="middle" border="0" /></a></span></td></tr></table></td></tr><tr><td class="row2 postbody" valign="top">» <a style="text-decoration:none" href="http://webartz.forumotion.com/t1004-open-every-link-in-a-new-tab" target="_blank" title="open every link in a new tab?" rel="nofollow">open every link in a new tab?</a><br />» <a style="text-decoration:none" href="http://seleniumforum.forumotion.net/t390-sample-ui-element-file-for-google-site" target="_blank" title="Sample UI Element file for google site." rel="nofollow">Sample UI Element file for google site.</a><br />» <a style="text-decoration:none" href="http://seleniumforum.forumotion.net/t546-did-you-commit-to-selenium-stack-exchange-site" target="_blank" title="Did you commit to Selenium-Stack Exchange site?" rel="nofollow">Did you commit to Selenium-Stack Exchange site?</a><br />» <a style="text-decoration:none" href="http://seleniumforum.forumotion.net/t805-urgent-how-to-get-links-under-specific-panel-div-table" target="_blank" title="Urgent --- how to get links under specific panel/ div/ table" rel="nofollow">Urgent --- how to get links under specific panel/ div/ table</a><br />» <a style="text-decoration:none" href="http://seleniumforum.forumotion.net/t1089-how-to-verify-the-url-displayed-after-click-the-links-is-correct-selenium-rc-and-webdriver" target="_blank" title="How to verify the url displayed after click the links is correct (Selenium RC and Webdriver)?" rel="nofollow">How to verify the url displayed after click the links is correct (Selenium RC and Webdriver)?</a><br /></td></tr></table><table class="forumline noprint" width="100%" border="0" cellspacing="0" cellpadding="0"><tr><td class="row2" colspan="2" align="center" style="padding:0px"><a name="quickreply"></a><br /></td></tr><tr><td style="margin:0; padding: 0;" colspan="2"><table border="0" cellpadding="0" width="100%" cellspacing="0" id="info_open" style="display:''"><tbody><tr><td class="row2" valign="top" width="25%"><span class="gensmall"><strong>Permissions in this forum:</strong></span></td><td class="row1" valign="top" width="75%"><span class="gensmall">You <strong>cannot</strong> reply to topics in this forum<br /></span></td></tr><tr><td class="catBottom" colspan="2" height="28"><table width="100%" border="0" cellspacing="0" cellpadding="0"><tr><td valign="middle" width="100%"><span class="nav"><a class="nav" href="/">Diễn Đàn SEO Panda - SEO Panda Forum</a><a class="nav" href=""></a> :: <a href="/c3-search-engine-optimization" class="nav"><span>Search Engine Optimization</span></a> :: <a href="/f10-link-building" class="nav"><span>Link Building</span></a></span></td><td align="right" valign="middle"><span class="gensmall"><a href="javascript:ShowHideLayer('info_open','info_close');"><img class="sprite-tabs_less" src="https://illiweb.com/fa/empty.gif" alt="-" align="middle" border="0" /></a></span></td></tr></table></td></tr></tbody></table></td></tr><tr><td style="margin:0; padding: 0;" colspan="2"><table border="0" cellpadding="0" cellspacing="0" width="100%" id="info_close" style="display:none;"><tbody><tr><td class="catBottom" colspan="2" height="28"><table width="100%" border="0" cellspacing="0" cellpadding="0"><tr><td valign="middle" width="100%"><span class="nav"><a class="nav" href="/">Diễn Đàn SEO Panda - SEO Panda Forum</a><a class="nav" href=""></a> :: <a href="/c3-search-engine-optimization" class="nav"><span>Search Engine Optimization</span></a> :: <a href="/f10-link-building" class="nav"><span>Link Building</span></a></span></td><td align="right" valign="middle"><span class="gensmall"><a href="javascript:ShowHideLayer('info_open','info_close');"><img class="sprite-tabs_more" src="https://illiweb.com/fa/empty.gif" alt="+" align="middle" border="0" /></a></span></td></tr></table></td></tr></tbody></table></td></tr></table><form action="/viewforum" method="get" name="jumpbox" onsubmit="if(document.jumpbox.f.value == -1){return false;}"><table class="noprint" width="100%" border="0" cellspacing="2" cellpadding="0" align="center"><tr><td align="left" valign="middle" nowrap="nowrap" ><span class="nav"></span></td><td align="right" nowrap="nowrap"><span class="gensmall">Jump to: <select name="selected_id" onchange="if(this.options[this.selectedIndex].value != -1){ forms['jumpbox'].submit() }"><option value="-1">Select a forum</option><option value="-1"></option><option value="-1">|</option><option tag="01" value="c1">|--Admin</option><option tag="01" value="f4">|   |--Paid Ads</option><option tag="01" value="f5">|   |--General Discussions</option><option tag="01" value="f3">|   |--Forum News - Review Sites</option><option tag="01" value="f1">|   |--Questions&Answers&Feedbacks</option><option value="-1">|   </option><option tag="01" value="c5">|--Free Market</option><option tag="01" value="f27">|   |--PriceFive</option><option tag="01" value="f16">|   |--Link Trade & Link Exchange</option><option tag="01" value="f17">|   |--Domain Name & Website</option><option tag="01" value="f18">|   |--Affiliate Programs & Softwares</option><option tag="01" value="f19">|   |--Anything</option><option value="-1">|   </option><option tag="01" value="c2">|--Search Engines</option><option tag="01" value="f6">|   |--Introductions</option><option tag="01" value="f7">|   |--Google</option><option tag="01" value="f8">|   |--Yahoo & Bing</option><option value="-1">|   </option><option tag="01" value="c6">|--Search Engine Marketing</option><option tag="01" value="f25">|   |--Adwords - Adsense</option><option tag="01" value="f26">|   |--Social Sites</option><option value="-1">|   </option><option tag="01" value="c3">|--Search Engine Optimization</option><option tag="01" value="f9">|   |--General SEO</option><option tag="01" value="f10">|   |--Link Building</option><option tag="01" value="f11">|   |--On-Page SEO</option><option tag="01" value="f12">|   |--SEO Services Wanted</option><option tag="01" value="f13">|   |--SEO Services Offered</option><option tag="01" value="f14">|   |--SEO Tools & SEO Tips</option><option tag="01" value="f15">|   |--Black Hat SEO</option><option value="-1">|   </option><option tag="01" value="c4">|--Jobs</option><option tag="01" value="f20">|   |--Jobs wanted</option><option tag="01" value="f21">|   |--Jobs Offered</option><option tag="01" value="f22">|   |--Free Jobs</option><option value="-1">|   </option><option tag="01" value="c7">|--Money Online</option><option tag="01" value="f23">|   |--Tips</option><option tag="01" value="f24">|   |--Freelancers</option><option value="-1">|   </option><option tag="01" value="f2">|--Basket & Spam</option></select><input type="hidden" name="tid" value="77d0947fd5798271e9800998ecf8427e" /> <input class="liteoption" type="submit" value="Go" /></span></td></tr></table></form><script type="text/javascript">//<![CDATA[ $(resize_images({ 'selector' : '.postbody', 'max_width' : 300, 'max_height' : 100 }));//]]></script><script src="https://illiweb.com/rs3/47/frm/addthis/addthis_widget.js" type="text/javascript"></script></td><td valign="top" width="183"><div id="right"><table width="100%" border="0" cellspacing="0" cellpadding="0"><tr><td align="left"><div align="center" id="FM_widget_share"></div></td></tr></table><div style="height: 2px"></div><table class="forumline" width="100%" border="0" cellspacing="1" cellpadding="0"><tr><td class="catLeft" height="25"><span class="genmed module-title">Watch 3D Products</span></td></tr><tr><td class='row1' align="left"> <a href="http://gamesonlinevui.com" target="_blank"><img src="https://i49.servimg.com/u/f49/17/09/30/39/dichvu10.jpg" alt="banner" border="0" /></a> </td></tr></table><div style="height: 2px"></div><table class="forumline" width="100%" border="0" cellspacing="1" cellpadding="0"><tr><td class="catLeft" height="25"><span class="genmed module-title">Liên Kết Web</span></td></tr><tr><td class='row1' align="left"><a href="http://gameauditionmobile.net" title="game audition mobile">Audition Mobile</a> - <a href="http://dolottriumph.net" title="quan ao lot triumph">Do Lot Triumph</a>-<a href="http://gameditinhmobile.com" title="di tinh online">Di Tinh Online</a>-<a href="http://gamedaituong.com" title="game dai tuong mobile">Dai Tuong Online</a>-<a href="http://gunboundmobile.com" title="gunbound mobile">Gunbound mobile</a><a href="http://gopet122.com" title="gopet 122" rel="nofollow">tai gopet 122</a>-<a href="http://choica.net" title="choi ca" rel="nofollow">Game Choi Ca 3D</a>-<a href="http://taiavatarx.com/avatar-230" title="tai avatar 230">tai avatar 230</a><a href="http://ngudex.blogspot.com" title="tai ngu de">tai ngu de mien phi</a>-<a href="http://gamephongvantruyenky.blogspot.com" title="phong van truyen ky">phong van truyen ky mien phi</a>-<a href="http://diendangamemobile.com" title="dien dan game mobile">dien dan game mobile</a><a href="http://www.facebook.com/TayDuKyMobile" title="tay du ky mobile">Tay Du Ky Mobile</a>-<a href="http://tinhbinh.net" title="tinh binh">tinh binh</a>-<a href="http://iwiniphone.blogspot.com" title="iwin" rel="nofollow">iwin</a>-<a href="http://taiiwins.info/tai-iwin-260">iwin 260</a>-<a href="http://vuongquocgamemobile.com/tra-chanh-quan" title="tra chanh quan" rel="nofollow">tra chanh quan</a>-<a href="http://apps.vuongquocgamemobile.com/" title="game android mien phi">tai game android</a>-<a href="http://gameworldfun.gamesonlinevui.com/tai-iwin-257/" title="iwin">iwin</a><a href="http://taiiwins.info/iwin-may-tinh-pc/" rel="nofollow">Choi Iwin Tren May Tinh</a>-<a href="http://www.arcadetrick.com/" rel="nofollow">choi game online</a>-<a href="http://vietnambuyo.com" rel="nofollow">game mobile mien phi</a>-<a href="http://hotvideosgames.com" title="video game">home video game</a> - <a href="http://gamesonlinevui.com">game online vui</a> - <a href="http://ngocrongonline.biz" title="ngoc rong online">Game Ngoc Rong Online Mien Phi</a> - <a href="http://taiiwin290.com" title="tai iwin 290">iwin 290 mien phi</a> - <a href="http://devuongmobile.net" title="de vuong online">De Vuong Online</a></td></tr></table><div style="height: 2px"></div></div></td></tr></tbody></table></div></div><!-- close div id="page-body" --><div id="page-footer"><div align="center"><div class="gen"><strong><a href="http://www.forumotion.com/create-private-forum" target="_blank">Private forum on Forumotion</a></strong> | <span class="gensmall">©</span> <a href="http://www.forumotion.com/phpbb" target="_blank">phpBB</a> | <a name="bottom" href="http://help.forumotion.com/" target="_blank">Free forum support</a> | <a name="bottom" href="/contact" rel="nofollow">Contact</a> | <a href="/abuse?page=%2Ft516-guest-blogging-for-links-choose-a-heavily-scraped-site&report=1" rel="nofollow">Report an abuse</a> | <strong><a href="http://www.forumotion.com" target="_blank">Free forum</a></strong></div></div><div align="center"><div class="gen"><a name="bottom" class="copyright" href="http://raovat3d.org" rel="follow" target="_blank" title="rao vat">rao vat</a> | <a name="bottom" class="copyright" href="http://www.vieclamthuctap.com" rel="follow" target="_top" title="thuc tap sinh vien">thuc tap</a></div></div></div></td></tr></table><script type="text/javascript">//<![CDATA[ fa_endpage();//]]></script><script type="text/javascript"> var vglnk = { api_url: '//api.viglink.com/api', key: '0d80ae9fe71cec9484f682bd59232f9e' }; (function(d, t) { var s = d.createElement(t); s.type = 'text/javascript'; s.async = true; s.src = ('https:' == document.location.protocol ? vglnk.api_url : '//cdn.viglink.com/api') + '/vglnk.js'; var r = d.getElementsByTagName(t)[0]; r.parentNode.insertBefore(s, r); }(document, 'script')); </script><script type="text/javascript"> document.write('<scr' + 'ipt data-cfasync="false" type="text/javascript" src="https://www.geniusdisplay.com/a/display.php?r=1242764"></scr' + 'ipt>'); </script></body></html>