Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

What I'm looking for is more of a solution to save a webpage as it is. Kind of like Internet Archive but better for individual pages. The thing with Internet archive is some things stop working, like mobile view etc


Perhaps ArchiveBox[0] will work for you? A self-hosted archiver to save websites in various formats. Has a section on that page for alternatives as well that might work too.

[0]: https://archivebox.io/


Singlefile [1] works pretty well for me for that use case.

It has the added advantage that the file format is just plain HTML, and together with “reader mode” in most browsers, it’s a great way to save long-form text or other mostly static pages for later reference.

It obviously doesn’t work for very dynamic pages, let alone web apps.

[1] https://github.com/gildas-lormeau/SingleFile


You can also use Zotero paired with the Zotero Connector Chrome/Firefox extension for this, I use Zotero as my document archival in addition to my academic citation manager. The Zotero Connector saves any PDF or web page opened in your browser to a local PDF/HTML file, and for HTML under the hood is uses the SingleFile extension to package the whole web page, images and all, into just one file.

It works well - now I'm just looking for a good way to annotate/highlight the local HTML/PDF/ePUB files cross-OS cross-platform. KOReader (https://github.com/koreader/koreader) works pretty well for this with its new hash-based storage option.


In addition to the various tools mentioned elsewhere sometimes saving the page as markdown in Joplin with their page saving tool is the best way for further use


In addition to archivebox and singlefile/singlefilez, I'd recommend scoop[0] and archiveweb/webrecorder[1]. Both create WACZ format archives using a browser which has slightly better fidelity than the way archivebox creates WARCs (using wget). There's also Save Page WE[2][3] which does something similar to singlefile.

0. https://github.com/harvard-lil/scoop

1. https://archiveweb.page/

2. https://addons.mozilla.org/en-US/firefox/addon/save-page-we/

3. https://chrome.google.com/webstore/detail/save-page-we/dhhpe...




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: