Difference between revisions of "Offline mirror with wget"

From Noah.org
Jump to navigationJump to search
Line 4: Line 4:
  
 
<pre>
 
<pre>
wget -m -np -p -P OUTPUT_DIR -k -nH --cut-dirs=2 http://www.example.org/dir1/dir2/index.html
+
wget --mirror --no-parent --page-requisites --convert-links --no-host-directories --cut-dirs=2 --directory-prefix=OUTPUT_DIR http://www.example.org/dir1/dir2/index.html
 
</pre>
 
</pre>
  
 
<pre>
 
<pre>
-m : Mirror. It is currently equivalent to -r -N -l inf --no-remove-listing.
+
--mirror : Mirror is currently equivalent to -r -N -l inf --no-remove-listing.
-np : no parent files - only download files that are under the given URL.
+
--no-parent : Do not ascend to the parent directory. Only download files that are under the given URL.
-p : Download all page requisites to display the page.
+
--page-requisites : Download all page requisites necessary to display the page (images, CSS, javascript, etc.).
-P <OUTPUT_DIR> : path to the mirror site destination
+
--convert-links : Convert links in the pages so that they work locally relative to the OUTPUT_DIR.
-k : convert links in the pages so that they work under the OUTPUT_DIR.
+
--no-host-directories : Don't create host name directories.
-nH : don't create host name directory.
+
--cut-dirs=n : Remove n directories from the path of the URL.
--cut-dirs=n : remove n directories from the path of the URL.
+
--directory-prefix=<OUTPUT_DIR> : Set path to the destination directory where files will be saved.
 
</pre>
 
</pre>

Revision as of 04:10, 20 August 2008


This downloads the given document and all parts it needs to be viewed offline. The number set by --cut-dirs must match the number of parent directories in the URL (dir1 and dir2).

wget --mirror --no-parent --page-requisites --convert-links --no-host-directories --cut-dirs=2 --directory-prefix=OUTPUT_DIR http://www.example.org/dir1/dir2/index.html
--mirror : Mirror is currently equivalent to -r -N -l inf --no-remove-listing.
--no-parent : Do not ascend to the parent directory. Only download files that are under the given URL.
--page-requisites : Download all page requisites necessary to display the page (images, CSS, javascript, etc.).
--convert-links : Convert links in the pages so that they work locally relative to the OUTPUT_DIR.
--no-host-directories : Don't create host name directories.
--cut-dirs=n : Remove n directories from the path of the URL.
--directory-prefix=<OUTPUT_DIR> : Set path to the destination directory where files will be saved.