<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Schadenfreude &#187; web</title>
	<atom:link href="http://www.ralree.com/tag/web/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.ralree.com</link>
	<description>Malicious enjoyment derived from observing someone else's misfortune</description>
	<lastBuildDate>Thu, 09 Feb 2012 01:49:15 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>wget mirroring with external references</title>
		<link>http://www.ralree.com/2011/12/03/wget-mirroring-with-external-references/</link>
		<comments>http://www.ralree.com/2011/12/03/wget-mirroring-with-external-references/#comments</comments>
		<pubDate>Sun, 04 Dec 2011 05:49:32 +0000</pubDate>
		<dc:creator>Erik</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[scripting]]></category>
		<category><![CDATA[web]]></category>
		<category><![CDATA[wget]]></category>

		<guid isPermaLink="false">http://www.ralree.com/?p=23252</guid>
		<description><![CDATA[I was having trouble mirroring a website that had all its images hosted on a different domain, which happened to be random subdomains of cloudfront.net.  I tried adding *.cloudfront.net to the -D parameter, but that didn&#8217;t work.  It turns out it&#8217;s smart enough to figure out that all subdomains in the domain list should be included as well: wget -mkpEK -D www.allshepherdrescue.org,cloudfront.net -H -t 3 \ --restrict-file-names=windows http://www.allshepherdrescue.org/ This goes into mirror mode, changes relative links to the proper form, [...]]]></description>
			<content:encoded><![CDATA[<p>I was having trouble mirroring a website that had all its images hosted on a different domain, which happened to be random subdomains of cloudfront.net.  I tried adding *.cloudfront.net to the -D parameter, but that didn&#8217;t work.  It turns out it&#8217;s smart enough to figure out that all subdomains in the domain list should be included as well:</p>
<pre>wget -mkpEK -D www.allshepherdrescue.org,cloudfront.net -H -t 3 \
     --restrict-file-names=windows http://www.allshepherdrescue.org/</pre>
<p>This goes into mirror mode, changes relative links to the proper form, fixes the query string urls to static ones, and downloads all files from the domains in the -D parameter.  The manpage details all of this.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.ralree.com/2011/12/03/wget-mirroring-with-external-references/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Ruby to generate RSS feeds for sites that don&#8217;t offer them</title>
		<link>http://www.ralree.com/2009/08/23/ruby-to-generate-rss-feeds-for-sites-that-dont-offer-them/</link>
		<comments>http://www.ralree.com/2009/08/23/ruby-to-generate-rss-feeds-for-sites-that-dont-offer-them/#comments</comments>
		<pubDate>Sun, 23 Aug 2009 14:53:41 +0000</pubDate>
		<dc:creator>Erik</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[code]]></category>
		<category><![CDATA[generation]]></category>
		<category><![CDATA[gist]]></category>
		<category><![CDATA[github]]></category>
		<category><![CDATA[rss]]></category>
		<category><![CDATA[ruby]]></category>
		<category><![CDATA[rubyrss]]></category>
		<category><![CDATA[scraping]]></category>
		<category><![CDATA[web]]></category>
		<category><![CDATA[xpath]]></category>

		<guid isPermaLink="false">http://www.ralree.com/?p=22648</guid>
		<description><![CDATA[There&#8217;s this site that has an equipment exchange I wanted to keep track of. Yet, it&#8217;s done with what seems to be a custom php file rather than vbulletin, so none of the usual RSS feeds from the site apply to it. So, I decided to make a scraper/feed-generator to get me the latest version every 5 minutes and generate a nice RSS feed, so I can view it in Google Reader. The volume of posting is low enough that [...]]]></description>
			<content:encoded><![CDATA[<p>There&#8217;s this site that has an equipment exchange I wanted to keep track of.  Yet, it&#8217;s done with what seems to be a custom php file rather than vbulletin, so none of the usual RSS feeds from the site apply to it.  So, I decided to make a scraper/feed-generator to get me the latest version every 5 minutes and generate a nice RSS feed, so I can view it in Google Reader.  The volume of posting is low enough that this won&#8217;t be annoying to see in my daily feeds.</p>
<p>I usually use Ruby for this because it offers Hpricot, a very nice and fast scraper and XPath interface. This time, I resolved to find something that does RSS generation better, and I stumbled upon <a href="http://rubyrss.com/">RubyRSS</a>, which <strong>happens to be in the core ruby distribution</strong>!<br />
<span id="more-22648"></span><br />
Here&#8217;s what I ended up with after about an hour:</p>
<p><script src="http://gist.github.com/173318.js"></script></p>
<p>Now this is impressive if you look at the fail of html <code>id</code> and <code>class</code> attributes coming out of the original page.  I had to base everything off of the links to the items that were not images, and then the structure <em>up the tree</em> from there (see the liberal use of <code>.parent</code>).  I&#8217;ve rediscovered that Hpricot is awesome (_why, come back to us!), and that it truly only takes 30 lines of code to generate a nice RSS feed in ruby.  The resultant RSS feed for MDShooters Classifieds site is <a href="http://www.ralree.com/mdshooters_classifieds.xml">here</a>.</p>
<p>And now, yet another RSS feed generator: <a href="http://ralree.com/md_super_ads.xml">MD Super Ads</a></p>
<p>Here&#8217;s the code:</p>
<p><script src="http://gist.github.com/173623.js"></script></p>
]]></content:encoded>
			<wfw:commentRss>http://www.ralree.com/2009/08/23/ruby-to-generate-rss-feeds-for-sites-that-dont-offer-them/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Annoying Web 2.0 Stuff</title>
		<link>http://www.ralree.com/2008/07/01/annoying-web-2-0-stuff/</link>
		<comments>http://www.ralree.com/2008/07/01/annoying-web-2-0-stuff/#comments</comments>
		<pubDate>Tue, 01 Jul 2008 03:31:00 +0000</pubDate>
		<dc:creator>Erik</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[blogs]]></category>
		<category><![CDATA[computers]]></category>
		<category><![CDATA[internet]]></category>
		<category><![CDATA[lame]]></category>
		<category><![CDATA[rss]]></category>
		<category><![CDATA[web]]></category>

		<guid isPermaLink="false">http://www.ralree.info/2008/07/01/annoying-web-2-0-stuff</guid>
		<description><![CDATA[Have you ever wondered what it would be like if no one ever upgraded to Web 2.0? I mean, really, what have we gained. Here&#8217;s a list of things that I personally find really stupid about the phenomenon&#8230; Have you ever wondered what it would be like if no one ever upgraded to Web 2.0? I mean, really, what have we gained. Here&#8217;s a list of things that I personally find really stupid about the phenomenon: Lists I think there&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<p>Have you ever wondered what it would be like if no one ever upgraded to Web 2.0?  I mean, really, what have we gained.  Here&#8217;s a list of things that I personally find really stupid about the phenomenon&#8230;</p>
<p>            <span id="more-13038"></span></p>
<p>Have you ever wondered what it would be like if no one ever upgraded to Web 2.0?  I mean, really, what have we gained.  Here&#8217;s a list of things that I personally find really stupid about the phenomenon:</p>
<h1>Lists</h1>
<p>I think there&#8217;s a list for everything now.  Everyone wants to be David Letterman.  Here&#8217;s a lists of lists of things, found on Digg by searching for &#8220;10 things&#8221;:</p>
<ul>
<li><a href="http://www.geekarmy.com/geekblog/politics/top-10-things-americans-want-but-cant-have/">Top 10 Things Americans Want but Can’t Have</a></li>
<li><a href="http://www.psychologytoday.com/rss/pto-20030823-000001.html">Procrastination: Ten Things To Know</a></li>
<li><a href="http://blogs.ittoolbox.com/bi/confessions/archives/technologist-manifesto-or-things-everyone-in-it-should-know-7715">Things Everyone in IT Should Know</a></li>
<li><a href="http://gizmodo.com/5020306/things-no-one-gives-microsoft-credit-for-but-should">Things No One Gives Microsoft Credit For (But Should)</a></li>
<li><a href="http://www.thelandsalmon.com/lifestyle/16-names-of-things-you-never-knew-had-names.html">16 Names Of Things You Never Knew Had Names</a></li>
<li><a href="http://eightandfive.com/2006/11/02/stay-on-target/">Things Republicans want you to forget</a></li>
</ul>
<p>Now, lists are all fine and dandy, but nowadays, most lists people put up are simply a ploy to get traffic, show more ads, and make money.  Now, I have no problem with this at all &#8211; profits are the best incentive.  But, really, it&#8217;s getting a little insane.  Instead of lists, which are easy to skim, why not hide all your points in some real writing?  Many of my friends and I, who are internet addicts, have realized that our reading speed and comprehension have suffered over the years due to the way we surf, which is skimming everything for that small chunk of information you want.</p>
<h1>Social Bookmarking Icon Diarrhea</h1>
<p>This picture says it all:</p>
<p><img src="http://ralree.info/assets/2008/7/1/annoying_icons.png" alt="Stupid Icons"/></p>
<h1>Really Big RSS Feed Images</h1>
<p>
<img src="http://ralree.info/assets/2008/7/1/rsspo1.png" alt=""/></p>
<h1>Giant Footers</h1>
<p>Here&#8217;s one:</p>
<p><a href="http://louisville.edu/"><img src="http://ralree.info/assets/2008/7/1/uglyfooter_medium.jpg" alt=""/></a></p>
<p>Nobody looks at the bottom of your site.  Sorry.</p>
<h1>Lame Layouts</h1>
<h2>How to waste 2/3 of the screen</h2>
<p><img src="http://ralree.info/assets/2008/7/1/4124-medium.jpg" alt=""/></p>
<h2>How to waste 3/4 of the screen</h2>
<p><img src="http://ralree.info/assets/2008/7/1/3016-medium.jpg" alt=""/><br />
<img src="http://ralree.info/assets/2008/7/1/2822-medium.jpg" alt=""/></p>
]]></content:encoded>
			<wfw:commentRss>http://www.ralree.com/2008/07/01/annoying-web-2-0-stuff/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
	</channel>
</rss>

