Hacking with wget

I frequently find myself wanting to look at a few web pages while I’m w/o Net access, for example, when I’m on a plane. I’m heading out on a three-day trip tomorrow and so I quickly hacked together a tool I can use in situations like this using wget and ActiveWords.

I designated a folder where web site downloads will go. I wrote a simple batch command to invoke wget with the right parameter to download a URL and the pages immediately associated with it into that folder. Then I created a simple ActiveWord command to launch the batch file and pass it a URL parameter.

The batch file is called wget_page.bat.

cd "C:\Documents and Settings\ssimeonov\My Documents\spider_downloads"
c:\dev\bin\wget\wget --recursive --level=1 --page-requisites --convert-links --html-extension %1

I called my ActiveWords command wget. Create a new script command and add the code below to it.

<WORKPAD><LT>"c:\dev\bin\wget_page.bat" <INPUTBOX><GT><CTRL>s</CTRL>

Instead of ActiveWords, you can use Launchy to kick off the batch file. You will also have to tweak the code to add your own paths, etc.

About Simeon Simeonov

I'm an entrepreneur, hacker, angel investor and reformed VC. I am currently Founder & CTO of Swoop, a search advertising platform. Through FastIgnite I invest in and work with a few great startups to get more done with less. Learn more, follow @simeons on Twitter and connect with me on LinkedIn.
This entry was posted in Uncategorized and tagged , , , , , . Bookmark the permalink.

5 Responses to Hacking with wget

  1. misha says:

    Hi, looks useful, however why not just wget the whole site, then run it locally: not sure I understand the usefulness of ActiveWords.

  2. Misha, ActiveWords make this simple. I don’t have to open a shell, update paths, type the name of a batch file, etc.

  3. Updated 24/7 live scores, odds, and results for professional, collegiate, fantasy, NFL, NBA, College Basketball, Horse Racing, Fantasy Sports Teams Plus Real-Time Sports Scores and Stats

  4. Vivek Puri says:

    http://www.webaroo.com/ seems to do that out of the box. Although you would miss out on the hacking part.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s