Wednesday, August 29, 2012

Website Copier

HTTrack is a free (GPL, libre/free software) and easy-to-use offline browser utility. It allows you to download a World Wide Web site from the Internet to a local directory, building recursively all directories, getting HTML, images, and other files from the server to your computer. HTTrack arranges the original site's relative link-structure. Simply open a page of the "mirrored" website in your browser, and you can browse the site from link to link, as if you were viewing it online. HTTrack can also update an existing mirrored site, and resume interrupted downloads. HTTrack is fully configurable, and has an integrated help system.

WinHTTrack is the Windows 2000/XP/Vista/Seven release of HTTrack, and WebHTTrack the Linux/Unix/BSD release. See the download page.

Installing

[root@(none) ~]# yum install httrack*

Running

[root@(none) ~]# httrack

Welcome to HTTrack Website Copier (Offline Browser) 3.43-9+libhtsjava.so.2
Copyright (C) Xavier Roche and other contributors
To see the option list, enter a blank line or try httrack --help

Enter project name :test

Base path (return=/root/websites/) :

Enter URLs (separated by commas or blank spaces) :www.batan.go.id/ptlr/layanan/

Action:
(enter)    1    Mirror Web Site(s)
    2    Mirror Web Site(s) with Wizard
    3    Just Get Files Indicated
    4    Mirror ALL links in URLs (Multiple Mirror)
    5    Test Links In URLs (Bookmark Test)
    0    Quit
: 1

Proxy (return=none) :

You can define wildcards, like: -*.gif +www.*.com/*.zip -*img_*.zip
Wildcards (return=none) :

You can define additional options, such as recurse level (-r), separed by blank spaces
To see the option list, type help
Additional options (return=none) :

---> Wizard command line: httrack www.batan.go.id/ptlr/layanan/  -O "/root/websites/test"  -%v 

Ready to launch the mirror? (Y/n) :Y

WARNING! You are running this program as root!
It might be a good idea to use the -%U option to change the userid:
Example: -%U smith

Mirror launched on Wed, 29 Aug 2012 07:06:48 by HTTrack Website Copier/3.43-9+libhtsjava.so.2 [XR&CO'2010]
mirroring www.batan.go.id/ptlr/layanan/ with the wizard help..
Done.
Thanks for using HTTrack!
*
[root@(none) ~]#


Testing
Check the contents of the folder /root /websites /.
Change the owner from root to igos or your name:
[root@(none) ~]# chown -R igos:igos /root/websites/

No comments:

Post a Comment