Artwork

内容由HPR Volunteer and Hacker Public Radio提供。所有播客内容(包括剧集、图形和播客描述)均由 HPR Volunteer and Hacker Public Radio 或其播客平台合作伙伴直接上传和提供。如果您认为有人在未经您许可的情况下使用您的受版权保护的作品,您可以按照此处概述的流程进行操作https://zh.player.fm/legal
Player FM -播客应用
使用Player FM应用程序离线!

HPR4293: HTTrack website copier software

 
分享
 

Manage episode 461179697 series 32765
内容由HPR Volunteer and Hacker Public Radio提供。所有播客内容(包括剧集、图形和播客描述)均由 HPR Volunteer and Hacker Public Radio 或其播客平台合作伙伴直接上传和提供。如果您认为有人在未经您许可的情况下使用您的受版权保护的作品,您可以按照此处概述的流程进行操作https://zh.player.fm/legal

This show has been flagged as Clean by the host.

The Wayback Machine by The Internet Archive is a very good resource for web sites no longer existing or older revisions of them.

However, sometimes I have also found it is nice and useful to have my own copy of a web site. It means I have control over the copy, it can be accessed offline and no world wide wait for the page to load.

My most typical use case if for web sites that I am manager of myself. For one or another reason, I want to keep a snapshot of the site. I have also used it for fact based sites which I want to always have access to, like a reference book. One of my recent use cases was a magazine that has closed down and announced the web site will also soon be terminated. Although it is available in the Wayback machine, I wanted to have a copy myself for a short period of time.

The software I use for this HTTrack. This software is available for Windows, Android, Linux and unix-like systems. It is at least for some platforms available with a graphical user interface. I have myself only used HTTrack with the terminal interface on Linux. HTTrack is a free and open source software.

In its simplest way to operate, it is just to type "httrack" followed by the url to the start page of the site to be copied.

In many cases this works well, I get a perfect copy. In other cases, it works less well. First of all, of course, I do not copy very big websites, both for the amount of time it takes and the disc space. What is stated in the robot textfile can also matter for the result. Another issue can be the folder structure of the site, HTTrack may not find all folders in its default setup, for example how images are stored. I have myself also got issues when menues and links not works normally where I instead have to right click to open the link.

The HTTrack web site has quite a lot of information in the documentation and it also has a forum. And in the terminal, there is also good help about all additional available commands. I have in general for my usage found the simple first attempt to copy sites gives perfect or good enough result directly without need to research details.

So, when I want to preserve snapshot of earlier releases of my own sites or when I want to have an offline and preserved copy of an important site, I consider HTTrack to be an easy to use and yet powerful tool. I am aware other similar tools exist, but this is the one I currently use.

HTTrack website copier website: https://www.httrack.com/

Provide feedback on this episode.

  continue reading

859集单集

Artwork
icon分享
 
Manage episode 461179697 series 32765
内容由HPR Volunteer and Hacker Public Radio提供。所有播客内容(包括剧集、图形和播客描述)均由 HPR Volunteer and Hacker Public Radio 或其播客平台合作伙伴直接上传和提供。如果您认为有人在未经您许可的情况下使用您的受版权保护的作品,您可以按照此处概述的流程进行操作https://zh.player.fm/legal

This show has been flagged as Clean by the host.

The Wayback Machine by The Internet Archive is a very good resource for web sites no longer existing or older revisions of them.

However, sometimes I have also found it is nice and useful to have my own copy of a web site. It means I have control over the copy, it can be accessed offline and no world wide wait for the page to load.

My most typical use case if for web sites that I am manager of myself. For one or another reason, I want to keep a snapshot of the site. I have also used it for fact based sites which I want to always have access to, like a reference book. One of my recent use cases was a magazine that has closed down and announced the web site will also soon be terminated. Although it is available in the Wayback machine, I wanted to have a copy myself for a short period of time.

The software I use for this HTTrack. This software is available for Windows, Android, Linux and unix-like systems. It is at least for some platforms available with a graphical user interface. I have myself only used HTTrack with the terminal interface on Linux. HTTrack is a free and open source software.

In its simplest way to operate, it is just to type "httrack" followed by the url to the start page of the site to be copied.

In many cases this works well, I get a perfect copy. In other cases, it works less well. First of all, of course, I do not copy very big websites, both for the amount of time it takes and the disc space. What is stated in the robot textfile can also matter for the result. Another issue can be the folder structure of the site, HTTrack may not find all folders in its default setup, for example how images are stored. I have myself also got issues when menues and links not works normally where I instead have to right click to open the link.

The HTTrack web site has quite a lot of information in the documentation and it also has a forum. And in the terminal, there is also good help about all additional available commands. I have in general for my usage found the simple first attempt to copy sites gives perfect or good enough result directly without need to research details.

So, when I want to preserve snapshot of earlier releases of my own sites or when I want to have an offline and preserved copy of an important site, I consider HTTrack to be an easy to use and yet powerful tool. I am aware other similar tools exist, but this is the one I currently use.

HTTrack website copier website: https://www.httrack.com/

Provide feedback on this episode.

  continue reading

859集单集

所有剧集

×
 
Loading …

欢迎使用Player FM

Player FM正在网上搜索高质量的播客,以便您现在享受。它是最好的播客应用程序,适用于安卓、iPhone和网络。注册以跨设备同步订阅。

 

快速参考指南

边探索边听这个节目
播放