<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/2.0.7" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/">
<channel>
	<title>Comments on: Complete Guide To Scraping Pt. 2 - Crawling</title>
	<link>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/</link>
	<description>Advanced SEO Tactics and Techniques</description>
	<pubDate>Thu, 24 Jul 2008 16:13:36 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.0.7</generator>

	<item>
		<title>by: download wii games</title>
		<link>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-232108</link>
		<pubDate>Fri, 30 May 2008 15:15:53 +0000</pubDate>
		<guid>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-232108</guid>
					<description>so joe cracker looks to have become the topic of the thread....was there even any answer to the use of content re-writers however??? something i would also like to use to 'flip' content pieces.</description>
		<content:encoded><![CDATA[<p>so joe cracker looks to have become the topic of the thread&#8230;.was there even any answer to the use of content re-writers however??? something i would also like to use to &#8216;flip&#8217; content pieces.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: BORAT</title>
		<link>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-209467</link>
		<pubDate>Mon, 18 Feb 2008 21:05:35 +0000</pubDate>
		<guid>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-209467</guid>
					<description>OOOOOOOOOOOOOOOOJ Joe Cracker
Do you remember me?
You were my driving instructor.
You said that woman must give me permision to have sexy time with me.
hahahahahaha what a nonsense:)</description>
		<content:encoded><![CDATA[<p>OOOOOOOOOOOOOOOOJ Joe Cracker<br />
Do you remember me?<br />
You were my driving instructor.<br />
You said that woman must give me permision to have sexy time with me.<br />
hahahahahaha what a nonsense:)
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: neil strauss</title>
		<link>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-200950</link>
		<pubDate>Mon, 28 Jan 2008 14:18:37 +0000</pubDate>
		<guid>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-200950</guid>
					<description>So Joe Cracker, would you please let us all know the technique and system you use in myspace to make so much money?  Please do so as we would all appreciate that.</description>
		<content:encoded><![CDATA[<p>So Joe Cracker, would you please let us all know the technique and system you use in myspace to make so much money?  Please do so as we would all appreciate that.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: I would</title>
		<link>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-9542</link>
		<pubDate>Thu, 04 Jan 2007 20:01:15 +0000</pubDate>
		<guid>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-9542</guid>
					<description>Joe Cracker is a noob and a half.</description>
		<content:encoded><![CDATA[<p>Joe Cracker is a noob and a half.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Eli</title>
		<link>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-8753</link>
		<pubDate>Sun, 24 Dec 2006 04:11:39 +0000</pubDate>
		<guid>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-8753</guid>
					<description>Welcome to &lt;b&gt;advanced SEO&lt;/b&gt; we take no prisoners :)</description>
		<content:encoded><![CDATA[<p>Welcome to <b>advanced SEO</b> we take no prisoners <img src='http://www.BlueHatSEO.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Matt Larson</title>
		<link>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-7376</link>
		<pubDate>Mon, 04 Dec 2006 08:37:07 +0000</pubDate>
		<guid>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-7376</guid>
					<description>Can't believe I'm backing up Joe Cracker, but black-hat doesn't mean illegal... it means "unethical". Now what is "unethical?" I suppose that depends on your own values and that of the industry to which you belong.

Conversely, laws are not issues of ethics. You break them and you pay one way or the other.

The government can make anyone's life miserable, so follow copyrights &#38; attribute sources. If it's wikipedia you're scraping or some other GNU or CC work, it's easy. For article sites, put a little more work into it and reference the source (author). Then scrape away!</description>
		<content:encoded><![CDATA[<p>Can&#8217;t believe I&#8217;m backing up Joe Cracker, but black-hat doesn&#8217;t mean illegal&#8230; it means &#8220;unethical&#8221;. Now what is &#8220;unethical?&#8221; I suppose that depends on your own values and that of the industry to which you belong.</p>
<p>Conversely, laws are not issues of ethics. You break them and you pay one way or the other.</p>
<p>The government can make anyone&#8217;s life miserable, so follow copyrights &amp; attribute sources. If it&#8217;s wikipedia you&#8217;re scraping or some other GNU or CC work, it&#8217;s easy. For article sites, put a little more work into it and reference the source (author). Then scrape away!
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: bad-ass-bob</title>
		<link>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-7180</link>
		<pubDate>Tue, 28 Nov 2006 00:10:37 +0000</pubDate>
		<guid>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-7180</guid>
					<description>Hay Joe Cracker-
Welcome to the dark side of the web. Black hat stuff it is. Does it involve unethical tactics? you bet your ass it does. If this bothers you then go play somewhere else cause we know what we are doing and we don't need you to stumble in and start blabbing to us about -hay do you realize this is copyright infringement? fuck yes we realize it but the money is too good to pass up. So as I said, find someone else to bother and get the fuck outa here. If you stick around we will turn you to the dark side. You been warned!!</description>
		<content:encoded><![CDATA[<p>Hay Joe Cracker-<br />
Welcome to the dark side of the web. Black hat stuff it is. Does it involve unethical tactics? you bet your ass it does. If this bothers you then go play somewhere else cause we know what we are doing and we don&#8217;t need you to stumble in and start blabbing to us about -hay do you realize this is copyright infringement? fuck yes we realize it but the money is too good to pass up. So as I said, find someone else to bother and get the fuck outa here. If you stick around we will turn you to the dark side. You been warned!!
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Joe Cracker</title>
		<link>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-7177</link>
		<pubDate>Mon, 27 Nov 2006 23:35:43 +0000</pubDate>
		<guid>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-7177</guid>
					<description>You ignored the issue regarding copyrights, Eli. I don't think publishers want you profiteering off their original work without permision. Scraping for content is stealing.

Also, google has duplicant content filters so your scrape content probably won't rank too well.</description>
		<content:encoded><![CDATA[<p>You ignored the issue regarding copyrights, Eli. I don&#8217;t think publishers want you profiteering off their original work without permision. Scraping for content is stealing.</p>
<p>Also, google has duplicant content filters so your scrape content probably won&#8217;t rank too well.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Seostomp</title>
		<link>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-7163</link>
		<pubDate>Mon, 27 Nov 2006 13:54:57 +0000</pubDate>
		<guid>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-7163</guid>
					<description>I use myspace for indexing purposes... used to monitize it but was asked by my affiliate managers to stop...  it works well for indexing though.</description>
		<content:encoded><![CDATA[<p>I use myspace for indexing purposes&#8230; used to monitize it but was asked by my affiliate managers to stop&#8230;  it works well for indexing though.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Aur</title>
		<link>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-7151</link>
		<pubDate>Mon, 27 Nov 2006 10:48:24 +0000</pubDate>
		<guid>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-7151</guid>
					<description>Hey today I discovered that in my company there is a big printer that also does scanner-to-email... I can put a big pile of papers in the machine and it will scan everything in less than one minute. With some nice OCR, it would mean a lot of fresh content !

About scraping, indeed the idea is to generate thousands of pages that you can re-use into a website. Every page should have some advertisers in it (affiliates and/or adsense). The idea is to get A LOT of content (like 10000 pages), build a website, get it known by the Search engines (you can use Eli's QUIT tool :)), and wait until search engines discover that you just stole the content, and then they'll ban your site. It usually takes something between 1-3 months, and meanwhile you'll have earned money from your advertisers.
Then you repeat the whole procedure.

I'm quite new to the game of scraping but I did one site that make me earn something between $5-$10 a day with 10000 pages, so if you manage to automate things well enough, you may be able to generate enough sites to multiply your income!

And by the way, I'm working on a tool that may let you have a scraped site not being banned so quickly (maybe not at all!) I'm currently testing and refining it, more news about it later ;)!

About Joe cracker, he's the kind of people who would like to be admired for what he does, so he will just boast and will not understand why people don't get impressed. On the other hand, someone like Eli just give you real keys to progress, and he deserves to get some admiration ! ;)
The myspace technique Joe's talking about is just the following: Use a myspace bot to add hundreds of friends. Then when you have friends, you can post a "bulletin" which is an announce that will be seen by all your friends. This bulletin will make them go to some affiliate (like a dating service). Again the game numbers is what is important, over thousands of friends, most will ignore your bulletin, but some will not, and you will get money from affiliate commissions.
"clicking on one button" is what Joe meant : let your bot add friends, and then when you have enough friends, post a bulletin going to somewhere that will make you some money.
Then repeat as much as you can.</description>
		<content:encoded><![CDATA[<p>Hey today I discovered that in my company there is a big printer that also does scanner-to-email&#8230; I can put a big pile of papers in the machine and it will scan everything in less than one minute. With some nice OCR, it would mean a lot of fresh content !</p>
<p>About scraping, indeed the idea is to generate thousands of pages that you can re-use into a website. Every page should have some advertisers in it (affiliates and/or adsense). The idea is to get A LOT of content (like 10000 pages), build a website, get it known by the Search engines (you can use Eli&#8217;s QUIT tool <img src='http://www.BlueHatSEO.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> ), and wait until search engines discover that you just stole the content, and then they&#8217;ll ban your site. It usually takes something between 1-3 months, and meanwhile you&#8217;ll have earned money from your advertisers.<br />
Then you repeat the whole procedure.</p>
<p>I&#8217;m quite new to the game of scraping but I did one site that make me earn something between $5-$10 a day with 10000 pages, so if you manage to automate things well enough, you may be able to generate enough sites to multiply your income!</p>
<p>And by the way, I&#8217;m working on a tool that may let you have a scraped site not being banned so quickly (maybe not at all!) I&#8217;m currently testing and refining it, more news about it later <img src='http://www.BlueHatSEO.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> !</p>
<p>About Joe cracker, he&#8217;s the kind of people who would like to be admired for what he does, so he will just boast and will not understand why people don&#8217;t get impressed. On the other hand, someone like Eli just give you real keys to progress, and he deserves to get some admiration ! <img src='http://www.BlueHatSEO.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /><br />
The myspace technique Joe&#8217;s talking about is just the following: Use a myspace bot to add hundreds of friends. Then when you have friends, you can post a &#8220;bulletin&#8221; which is an announce that will be seen by all your friends. This bulletin will make them go to some affiliate (like a dating service). Again the game numbers is what is important, over thousands of friends, most will ignore your bulletin, but some will not, and you will get money from affiliate commissions.<br />
&#8220;clicking on one button&#8221; is what Joe meant : let your bot add friends, and then when you have enough friends, post a bulletin going to somewhere that will make you some money.<br />
Then repeat as much as you can.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: ralex</title>
		<link>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-7143</link>
		<pubDate>Mon, 27 Nov 2006 07:25:44 +0000</pubDate>
		<guid>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-7143</guid>
					<description>haha.. Joe Cracker is such a noob. I'm pretty sure he was being sarcastic about the Myspace thing too, but wow... would it hurt to be a bit funny sarcastic and not so much negative-i-have-nothing-to-contribute sarcastic?</description>
		<content:encoded><![CDATA[<p>haha.. Joe Cracker is such a noob. I&#8217;m pretty sure he was being sarcastic about the Myspace thing too, but wow&#8230; would it hurt to be a bit funny sarcastic and not so much negative-i-have-nothing-to-contribute sarcastic?
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: roy</title>
		<link>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-7122</link>
		<pubDate>Sun, 26 Nov 2006 18:49:59 +0000</pubDate>
		<guid>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-7122</guid>
					<description>Joe Cracker, We do't what to see how easy you made the money. If you could give hints on what you are promoting or how do you use Myspace,we would appreciate:)</description>
		<content:encoded><![CDATA[<p>Joe Cracker, We do&#8217;t what to see how easy you made the money. If you could give hints on what you are promoting or how do you use Myspace,we would appreciate:)
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Eli</title>
		<link>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-7119</link>
		<pubDate>Sun, 26 Nov 2006 18:04:27 +0000</pubDate>
		<guid>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-7119</guid>
					<description>lol
Joe Cracker you are my new favorite reader!

Well buddy, I guess the best advice I can give you is to stick with what you know. :)

To answer your rivitingly brilliant economic question I'll attempt to explain it visually:
content-&gt; traffic-&gt; advertisers-&gt; money

BTW I don't censor the comments. So if your comment doesn't show up immediately, it usually means it got caught in the spam filter. I will eventually retrieve it. There is no need to panic or continue attempting to post it.</description>
		<content:encoded><![CDATA[<p>lol<br />
Joe Cracker you are my new favorite reader!</p>
<p>Well buddy, I guess the best advice I can give you is to stick with what you know. <img src='http://www.BlueHatSEO.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>To answer your rivitingly brilliant economic question I&#8217;ll attempt to explain it visually:<br />
content-> traffic-> advertisers-> money</p>
<p>BTW I don&#8217;t censor the comments. So if your comment doesn&#8217;t show up immediately, it usually means it got caught in the spam filter. I will eventually retrieve it. There is no need to panic or continue attempting to post it.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Joe Cracker</title>
		<link>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-7117</link>
		<pubDate>Sun, 26 Nov 2006 17:16:20 +0000</pubDate>
		<guid>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-7117</guid>
					<description>Hi Its Joe Cracker again 

Yea this is too much work. And you kinda brushed through the steps. If I were a newbie I wouldn't have a clue what to do and nor would I make much money. Like your last article regarding the screensavers and the spyware installation your idea kinda sucks. 

So after you scrape and steal other people's content where does the money come in? Huh? Does it magically appear in your bank? Or at your door? Or in your computer? If I'm gonna be a content thief I hope to get paid so maybe when my ass gets sued I'll have enough to settle..lol

Why not use myspace? I make so much money with that. I run my program a few times and p00f 100's of dollars in my account. I be raking in dough. 

it is so easy, too. I wish everythign in life were that easy. Just last month, I made a ton of cash. 

-Cracker</description>
		<content:encoded><![CDATA[<p>Hi Its Joe Cracker again </p>
<p>Yea this is too much work. And you kinda brushed through the steps. If I were a newbie I wouldn&#8217;t have a clue what to do and nor would I make much money. Like your last article regarding the screensavers and the spyware installation your idea kinda sucks. </p>
<p>So after you scrape and steal other people&#8217;s content where does the money come in? Huh? Does it magically appear in your bank? Or at your door? Or in your computer? If I&#8217;m gonna be a content thief I hope to get paid so maybe when my ass gets sued I&#8217;ll have enough to settle..lol</p>
<p>Why not use myspace? I make so much money with that. I run my program a few times and p00f 100&#8217;s of dollars in my account. I be raking in dough. </p>
<p>it is so easy, too. I wish everythign in life were that easy. Just last month, I made a ton of cash. </p>
<p>-Cracker
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Rose Water</title>
		<link>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-7087</link>
		<pubDate>Sat, 25 Nov 2006 21:50:49 +0000</pubDate>
		<guid>http://www.BlueHatSEO.com/complete-guide-to-scraping-pt-2-crawling/#comment-7087</guid>
					<description>This is offtopic, but I was researching domain names and found a site that shows in the the defunct .gb domain search by adding an ampersand:

http://www.google.com/search?q=site%3Agb&#38;ie=utf-8&#38;oe=utf-8&#38;rls=org.mozilla:en-US:official&#38;client=firefox-a

Do you realize the power of this?
You could have your site end up in a site:edu or site:gov search, although this could just be a frontend glitch.
I'm not sure if it would rank better either, since most edu and gov sites have been well established.</description>
		<content:encoded><![CDATA[<p>This is offtopic, but I was researching domain names and found a site that shows in the the defunct .gb domain search by adding an ampersand:</p>
<p><a href="http://www.google.com/search?q=site%3Agb&amp;ie=utf-8&amp;oe=utf-8&amp;rls=org.mozilla:en-US:official&amp;client=firefox-a" rel="nofollow">http://www.google.com/search?q=site%3Agb&amp;ie=utf-8&amp;oe=utf-8&amp;rls=org.mozilla:en-US:official&amp;client=firefox-a</a></p>
<p>Do you realize the power of this?<br />
You could have your site end up in a site:edu or site:gov search, although this could just be a frontend glitch.<br />
I&#8217;m not sure if it would rank better either, since most edu and gov sites have been well established.
</p>
]]></content:encoded>
				</item>
</channel>
</rss>
