<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: The DaveN Blog Hack Raises Questions On Google Blacklist API</title>
	<atom:link href="http://nickwilsdon.com/daven-blog-hack-questions-google/feed/" rel="self" type="application/rss+xml" />
	<link>http://nickwilsdon.com/daven-blog-hack-questions-google/</link>
	<description></description>
	<lastBuildDate>Sat, 30 Jan 2010 08:05:26 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Gab Goldenberg</title>
		<link>http://nickwilsdon.com/daven-blog-hack-questions-google/comment-page-1/#comment-11806</link>
		<dc:creator>Gab Goldenberg</dc:creator>
		<pubDate>Sat, 01 Nov 2008 23:30:32 +0000</pubDate>
		<guid isPermaLink="false">http://nickwilsdon.com/?p=464#comment-11806</guid>
		<description>Nick, could I ask you for a favor - a friend runs a noncommercial site, and he suddenly got a massive penalty in G. My hunch is that he got cracked n there&#039;s hidden (probably cloaked) stuff on the domain, but GWT only lists crawl errors (the static, WP generated URLs haven&#039;t changed...) . Think he could get his domain analyzed free by SERPGuard?</description>
		<content:encoded><![CDATA[<p>Nick, could I ask you for a favor &#8211; a friend runs a noncommercial site, and he suddenly got a massive penalty in G. My hunch is that he got cracked n there&#8217;s hidden (probably cloaked) stuff on the domain, but GWT only lists crawl errors (the static, WP generated URLs haven&#8217;t changed&#8230;) . Think he could get his domain analyzed free by SERPGuard?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: JohnMu</title>
		<link>http://nickwilsdon.com/daven-blog-hack-questions-google/comment-page-1/#comment-11758</link>
		<dc:creator>JohnMu</dc:creator>
		<pubDate>Mon, 22 Sep 2008 06:18:14 +0000</pubDate>
		<guid isPermaLink="false">http://nickwilsdon.com/?p=464#comment-11758</guid>
		<description>Yep,  it&#039;s certainly complicated :-)

I like the idea of an automatic alert spammy-keyword-list, but I&#039;m somewhat worried how it would look if Google were to provide something like that.  I have a feeling this would make a great service by someone outside of Google, it could be a kind of wiki that collects this information and helps users to set up appropriate Google Alerts. Since these alerts just send you an email, the occasional false positive wouldn&#039;t be much of a problem. 

John</description>
		<content:encoded><![CDATA[<p>Yep,  it&#8217;s certainly complicated :-)</p>
<p>I like the idea of an automatic alert spammy-keyword-list, but I&#8217;m somewhat worried how it would look if Google were to provide something like that.  I have a feeling this would make a great service by someone outside of Google, it could be a kind of wiki that collects this information and helps users to set up appropriate Google Alerts. Since these alerts just send you an email, the occasional false positive wouldn&#8217;t be much of a problem. </p>
<p>John</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nick Wilsdon</title>
		<link>http://nickwilsdon.com/daven-blog-hack-questions-google/comment-page-1/#comment-11757</link>
		<dc:creator>Nick Wilsdon</dc:creator>
		<pubDate>Sat, 20 Sep 2008 20:09:28 +0000</pubDate>
		<guid isPermaLink="false">http://nickwilsdon.com/?p=464#comment-11757</guid>
		<description>@John

Thanks, you helped clarify the difference here between malware and hidden links. I see now why the Google Alert strategy is not just complementary but an essential part of your site defence. 

It&#039;s a tricky one this. Most normal users are going to be assuming the Google Safe Browsing API picks up these kind of text-book hacks. I&#039;ve found the easiest way to explain these new tools are &quot;anti-virus protection&quot; for your websites. 

On the one hand, yes I think people would like to know if you pick up that kind of material on their sites. In Dave&#039;s case here, a warning email or alert on the API would have really helped. 

On the other hand, like you say, it&#039;s going to be hard to work out intent here. If someone has chosen to link out to these places, it would be harsh to stop their traffic with the interstitial page. Especially as the most recent copy doesn&#039;t seem to offer any way to continue to the site.   

So I&#039;d be in favour of warnings but not the  interstitial page for these kind of situations. 

The Google Alerts service is great but it&#039;s still left to the owner to set this up. I know a lot of people who have no idea about GWC - let alone how to set up alerts or check for keywords. It would be great to make this easier for people somehow. Maybe it could be built into the API with a public list of &quot;suspicious&quot; words (Cialis, Prozac etc.)? People are lazy and I think most just want this kind of &quot;anti-virus&quot; protection, where they just enter their URL and applications flag up issues. 

The API seems a great step in this direction, as it allows so many people and applications to build in these kind of reports. Getting them to set up Google Alerts for their users is going to be too much work. Also a centralized &quot;suspicious&quot; list would be updated more regularly as new keywords appear (drug brand names etc.). It would be interesting to see how we all agree on the list though! Maybe we need to do it like the RBLs, certain people create their own filter list which can be easy or hard and that interacts with the API in some way. 
</description>
		<content:encoded><![CDATA[<p>@John</p>
<p>Thanks, you helped clarify the difference here between malware and hidden links. I see now why the Google Alert strategy is not just complementary but an essential part of your site defence. </p>
<p>It&#8217;s a tricky one this. Most normal users are going to be assuming the Google Safe Browsing API picks up these kind of text-book hacks. I&#8217;ve found the easiest way to explain these new tools are &#8220;anti-virus protection&#8221; for your websites. </p>
<p>On the one hand, yes I think people would like to know if you pick up that kind of material on their sites. In Dave&#8217;s case here, a warning email or alert on the API would have really helped. </p>
<p>On the other hand, like you say, it&#8217;s going to be hard to work out intent here. If someone has chosen to link out to these places, it would be harsh to stop their traffic with the interstitial page. Especially as the most recent copy doesn&#8217;t seem to offer any way to continue to the site.   </p>
<p>So I&#8217;d be in favour of warnings but not the  interstitial page for these kind of situations. </p>
<p>The Google Alerts service is great but it&#8217;s still left to the owner to set this up. I know a lot of people who have no idea about GWC &#8211; let alone how to set up alerts or check for keywords. It would be great to make this easier for people somehow. Maybe it could be built into the API with a public list of &#8220;suspicious&#8221; words (Cialis, Prozac etc.)? People are lazy and I think most just want this kind of &#8220;anti-virus&#8221; protection, where they just enter their URL and applications flag up issues. </p>
<p>The API seems a great step in this direction, as it allows so many people and applications to build in these kind of reports. Getting them to set up Google Alerts for their users is going to be too much work. Also a centralized &#8220;suspicious&#8221; list would be updated more regularly as new keywords appear (drug brand names etc.). It would be interesting to see how we all agree on the list though! Maybe we need to do it like the RBLs, certain people create their own filter list which can be easy or hard and that interacts with the API in some way.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: JohnMu</title>
		<link>http://nickwilsdon.com/daven-blog-hack-questions-google/comment-page-1/#comment-11753</link>
		<dc:creator>JohnMu</dc:creator>
		<pubDate>Fri, 19 Sep 2008 13:50:39 +0000</pubDate>
		<guid isPermaLink="false">http://nickwilsdon.com/?p=464#comment-11753</guid>
		<description>There are a lot of different kinds of hacks. In this case, it&#039;s a textbook version of a SEO hack: someone placed hidden links with keyword-heavy anchor texts on the pages in the site. You can still see that in the cached versions of some of these pages. Since this kind of hack does not contain malware, it&#039;s not something that the malware diagnostics page will warn you about. It doesn&#039;t harm a user to open a page with hidden content on it :). 

We notify webmasters through Webmaster Tools when we spot malware, but general hidden/hacked content like this is hard to spot algorithmically with certainty, so we can&#039;t always notify the webmaster about it.

One way of recognizing this would be to regularly check Webmaster Tools, Statistics / &quot;What Googlebot Sees.&quot; On that page is generally a section for &quot;Keywords in your site&#039;s content.&quot; If everything is ok, you should see keywords there that match what you write about. If it&#039;s hacked with hidden text/links, you&#039;ll likely see things that match the hidden content, like various kinds of pills, porn phrases, etc. Maybe this should be something that webmasters should be doing on a regular basis? :-)

You can achieve the same by setting up Google alerts for those kinds keywords on your site. I like to do that anyway, just to be sure that I don&#039;t accidentally miss anything (it could also bring up spammy comments that you can remove, etc). 

I hate it when sites get hacked like this (and there are a LOT of sites hacked like that) and honestly, I don&#039;t know how the search engines should react when we spot hidden content like that. On the one hand, the site was compromised and ANYTHING could be placed on it by the hackers, including cloaked redirects to pornographic sites and malware; on the other hand, the webmaster is often not aware of it and we don&#039;t want to be too harsh. 

What do you think?</description>
		<content:encoded><![CDATA[<p>There are a lot of different kinds of hacks. In this case, it&#8217;s a textbook version of a SEO hack: someone placed hidden links with keyword-heavy anchor texts on the pages in the site. You can still see that in the cached versions of some of these pages. Since this kind of hack does not contain malware, it&#8217;s not something that the malware diagnostics page will warn you about. It doesn&#8217;t harm a user to open a page with hidden content on it :). </p>
<p>We notify webmasters through Webmaster Tools when we spot malware, but general hidden/hacked content like this is hard to spot algorithmically with certainty, so we can&#8217;t always notify the webmaster about it.</p>
<p>One way of recognizing this would be to regularly check Webmaster Tools, Statistics / &#8220;What Googlebot Sees.&#8221; On that page is generally a section for &#8220;Keywords in your site&#8217;s content.&#8221; If everything is ok, you should see keywords there that match what you write about. If it&#8217;s hacked with hidden text/links, you&#8217;ll likely see things that match the hidden content, like various kinds of pills, porn phrases, etc. Maybe this should be something that webmasters should be doing on a regular basis? :-)</p>
<p>You can achieve the same by setting up Google alerts for those kinds keywords on your site. I like to do that anyway, just to be sure that I don&#8217;t accidentally miss anything (it could also bring up spammy comments that you can remove, etc). </p>
<p>I hate it when sites get hacked like this (and there are a LOT of sites hacked like that) and honestly, I don&#8217;t know how the search engines should react when we spot hidden content like that. On the one hand, the site was compromised and ANYTHING could be placed on it by the hackers, including cloaked redirects to pornographic sites and malware; on the other hand, the webmaster is often not aware of it and we don&#8217;t want to be too harsh. </p>
<p>What do you think?</p>
]]></content:encoded>
	</item>
</channel>
</rss>
