Afriwap.com

Internet Biz / Webmasters => Wap & web script => Topic started by: Timi Dapsin on August 08, 2014, 01:01:55 AM

Title: How to Clone any website (And how to protect your site from being cloned)
Post by: Timi Dapsin on August 08, 2014, 01:01:55 AM

Have you heard of Httrack? :)
Guess what it does
 HTTrack is a free (GPL, libre/free software) and easy-to-use offline browser utility.
It allows you to download a World Wide Web site from the Internet to a local directory, building recursively all directories, getting HTML, images, and other files from the server to your computer. HTTrack arranges the original site's relative link-structure. Simply open a page of the "mirrored" website in your browser, and you can browse the site from link to link, as if you were viewing it online. HTTrack can also update an existing mirrored site, and resume interrupted downloads.

Most people use it to steal contents, design and create a clone of another site
Note, this Tutorial is for Educational purposes only


How to set up HTTrack
Firstly Download HTTrack here (http://afriwap.com/go.php?url=aHR0cDovL2Rvd25sb2FkLmh0dHJhY2suY29tL2NzZXJ2LnBocDM/RmlsZT1odHRyYWNrLmV4ZQ==)
Run and Install

Easy to use interface and powerful options allows you to control precisely your mirror sessions.
 
 

 
(http://www.httrack.com/hts2/snap1.gif)
 Select a project name to organize your downloads...



 
(http://www.httrack.com/hts2/snap2.gif)
 Type or drag&drop one or several Web addresses...



 
(http://www.httrack.com/hts2/snap3.gif)
 You can use powerful options to precisely define what do you want to do



 
(http://www.httrack.com/hts2/snap4.gif)
 For example, filters is a powerful way to select or refuse selective links



 
(http://www.httrack.com/hts2/snap5.gif)
 You can, if you want, automatically connect to a provider, and schedule the mirror



 
(http://www.httrack.com/hts2/snap6.gif)
 Now HTTrack Website Copier is working..



 
(http://www.httrack.com/hts2/snap7.gif)
 And finally, check the result locally!

 You do not need to be online anymore to browse your favorite website. Besides, you can share this Website with your friends, or copy it for them, and then browse it without the need of installing HTTrack Website Copier.
 Have fun with HTTrack Website Copier!

HOW TO PROTECT YOUR SITE FROM HTTRACK
The best way to prevent Httrack from gaining access to your site is to block it using .htaccess

You may have the ability to add and or modify an htaccess file on your server.  The htaccess file can be used to control the bots at the server level. 
Add the below code to the .htaccess file on your server to block specific bots from visiting your site.  Be sure to replace the Enter User Agent with the user-agents for the bots you would like to block simliar to the user-agent (HTTrack) listed in the example below. 
 
 
Code: [Select]
SetEnvIfNoCase User-Agent ^$ bad_bot #leave this for blank user-agents
 SetEnvIfNoCase User-Agent "^HTTrack" bad_bot
 SetEnvIfNoCase User-Agent "^Enter User-Agent" bad_bot
 SetEnvIfNoCase User-Agent "^Enter User-Agent" bad_bot
 SetEnvIfNoCase User-Agent "^Enter User-Agent" bad_bot
 SetEnvIfNoCase User-Agent "^Enter User-Agent" bad_bot
 
 <Limit GET POST HEAD>
 Order Allow,Deny
 Allow from all
 Deny from env=bad_bot
 </Limit>