Archive Software
Why do I care about Archive Software?
- ArchiveBox
- I have set this up personally
- I realize it can not used logged in session to scrape therefore another solution is required to scrape stuff like paid substacks or stuff accessed through a VPN
- Feed in URL's, get them archived
- Won't be able to save the current page you are on
- ArchiveBox Exporter
- Anchorage
- CLI tool to backup to ArchiveBox or Internet Archive
- Save page with single file
- I have this installed on my browsers
- I still can't automate scraping logged in content
- DiskerNet
- The index does not work nicely
- Launch chromium and index every page you go to
- Does not work with brave or firefox
- I would prefer a chrome extension
- ArchiveWeb.page
- Extension that will save pages to browser storage
- Has desktop app
- I have no idea what format it is using or how to export
- dweb-mirror
- Just for mirroring Internet Archive Stuff
- irchiver
- ONLY SUPPORTS WINDOWS
- Takes images of EVERY PAGE YOU GO ON
- Full text Stearch