<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Wysmedia.com &#187; download</title>
	<atom:link href="http://www.wysmedia.com/tag/download/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.wysmedia.com</link>
	<description>~ We make IT easy for you ~</description>
	<lastBuildDate>Fri, 08 May 2009 07:02:02 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Mirroring a Website with Httrack</title>
		<link>http://www.wysmedia.com/2009/05/mirroring-a-website-with-httrack/</link>
		<comments>http://www.wysmedia.com/2009/05/mirroring-a-website-with-httrack/#comments</comments>
		<pubDate>Fri, 08 May 2009 06:28:59 +0000</pubDate>
		<dc:creator>adwin</dc:creator>
				<category><![CDATA[Fun]]></category>
		<category><![CDATA[Others]]></category>
		<category><![CDATA[software]]></category>
		<category><![CDATA[download]]></category>
		<category><![CDATA[httrack]]></category>
		<category><![CDATA[wordpress]]></category>

		<guid isPermaLink="false">http://www.wysmedia.com/?p=84</guid>
		<description><![CDATA[How to download / mirror website using Httrack. in this page, I explain how to download Wordpress codex. I love to have a offline version of Wordpress Codex so I can read when I am offline or away and there are no internet connection.]]></description>
			<content:encoded><![CDATA[<p>I have been use this for quite long long times. Before I was using <a href="http://www.gnu.org/software/wget/">GNU Wget (let&#8217;s call it wget)</a> to download a website (mirroring) because in the past I have limited internet connection (very expensive, because it use time based and slow). Wget is good for those who loves using linux and console (it available in windows as well). <span id="more-84"></span></p>
<p><img src="http://farm4.static.flickr.com/3397/3511642805_225e1a463f.jpg" alt="image of httrack window" /></p>
<p><a href="http://www.httrack.com/">Httrack</a> is windows based free software that using capabilities like those in wget. It can mirror a website, download all related pages. I used this software especially for downloading documentation sites. </p>
<p>Here is my Rules to download a <a href="http://codex.wordpress.org/Developer_Documentation">codex.wordpress.com</a> (just for Developer Documentation and its related pages) : </p>
<blockquote><p>+*.png +*.gif +*.jpg +*.css +*.js -ad.doubleclick.net/* -mime:application/foobar<br />
-http://codex.wordpress.org/Mailing_Lists<br />
-http://codex.wordpress.org/IRC<br />
-http://wordpress.org/support<br />
-http://codex.wordpress.org/Contributing_to_WordPress<br />
-http://codex.wordpress.org/Automated_Testing<br />
-*action=*<br />
-http://codex.wordpress.org/Codex:Community_Portal<br />
-*Help:*<br />
-http://codex.wordpress.org/Current_events<br />
-http://codex.wordpress.org/Special:Recentchanges<br />
-http://codex.wordpress.org/Special:Randompage<br />
-http://codex.wordpress.org/Development_Team</p></blockquote>
<p>In there I just need the Function References but for some reason I don&#8217;t want to download &#8220;unused&#8221; pages ..which I won&#8217;t read like (http://codex.wordpress.org/Current_events,http://codex.wordpress.org/Special:Recentchanges, etc). You can see there that I can put regular expression wildcard into the rules. I also put the limit to 3 depths parsing only so that I dont need to download large amount of pages.</p>
<p><img src="http://farm4.static.flickr.com/3569/3511649179_b883bf2bbb.jpg" alt="Rules for Downloading codex.wordpress.com" /></p>
<p>The nice part, is you can update the download without downloading all one by one in the future. Just don&#8217;t delete the files generated by Httrack. </p>
<p>Here is my complete commands that generate by Httrack.</p>
<blockquote><p>winhttrack -qwr3%e0C2%Ps0u1%s%uN0%I0p3DaK0H0%kf2A25000%f#f -F &#8220;Mozilla/4.5 (compatible; HTTrack 3.0x; Windows 98)&#8221; -%F &#8220;<!-- Mirrored from %s%s by HTTrack Website Copier/3.x [XR&#038;CO'2008], %s -->&#8221; -%l &#8220;en, en, *&#8221; http://codex.wordpress.org/Developer_Documentation -O1 &#8220;C:\Downloads\codex-wordpress\codex wordpress&#8221; +*.png +*.gif +*.jpg +*.css +*.js -ad.doubleclick.net/* -mime:application/foobar -http://codex.wordpress.org/Mailing_Lists -http://codex.wordpress.org/IRC -http://wordpress.org/support -http://codex.wordpress.org/Contributing_to_WordPress -http://codex.wordpress.org/Automated_Testing -*action=* -http://codex.wordpress.org/Codex:Community_Portal -*Help:* -http://codex.wordpress.org/Current_events -http://codex.wordpress.org/Special:Recentchanges -http://codex.wordpress.org/Special:Randompage -http://codex.wordpress.org/Development_Team </p></blockquote>
<p>ps: you can use httrack in console so you can create an scheduler to download / update / mirroring websites frequently.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.wysmedia.com/2009/05/mirroring-a-website-with-httrack/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

<!-- www.000webhost.com Analytics Code -->
<script type="text/javascript" src="http://analytics.hosting24.com/count.php"></script>
<noscript><a href="http://www.hosting24.com/"><img src="http://analytics.hosting24.com/count.php" alt="web hosting" /></a></noscript>
<!-- End Of Analytics Code -->
