Getting Started:
Some researchers archive just a sample of their digital work, while others opt for full reproducibility by capturing all the web pages of a site and or software and database files and it is entirely up to you which path you choose to embrace.
Remember: Lots of Copies Keeping Stuff Safe (LOCKSS)
Websites
The ability to archive a website is partially based upon how well a site has been developed, so it is suggested that you review our best practices web development guidelines.
The Internet Archive might capture your website, but there is no guarantee that the crawler will find your site and even if it does there are known fidelity issues. Using the Save Page Now feature, users can point the Internet Archive to their web-page, but unlike the paid subscription, Archive-It, users of Save Page Now have limited control in regards to scoping the crawl.
When users are logged in with their free Archive.org account, SPN-generated archives can be saved to that user’s “My web archive” public gallery of archived pages.
Archive all the web pages linked from an email message? Well, you are in luck because now you can forward that email to “savepagenow@archive.org” and after a few minutes you will get an email back filled with Wayback Machine playback URLs.
Read more about Archive.org's Save Page Now feature.
Features (including saving outbound links)of Save Page Now.
If your project is a website, a Web Archive (WARC) file capture of your website is a standard approach to archiving. Visit each page that you desire capturing by using Conifer. Note: You must play the entire recording of a video or audio file. Download the capture as a WARC file, then test using ReplayWeb.page before including it as a part of your deposit in CUNY Academic Works.
ArchiveWeb, is analogous to Conifer accepts it requires installation: https://archiveweb.page/
HTTRACK (a Windows based tool), downloads HTML (non-archival) pages and associated media.
Monitor the Self-Hostable,Open Source section for new tools that might appear.
Reach out to Stephen Klein if you need assistance.
Always use the Library of Congress' recommended file formats. See DPC's The Global List of Digitally Endangered Species 2021.
Note: still images, audio, or video files may supplement a thesis or capstone, thesis dissertation submission in CUNY Academic Works by combining several supplemental files into a single .zip or .tar file.
Alternatively, upload image, audio, and video files to Internet Archive (archive.org) and organize the URLs for inclusion in your CUNY Academic Works submission. Please see the following for more info.
Applications and Software
See the following suggestions for archiving software, etc.
Some of the following is only relevant to institutions (I):
How to Talk to IT about Digital Preservation (I)
The difference between data backup and data archiving, and why it matters to you (I)
The ultimate guide to starting your digital preservation journey (I)
Digital preservation services at digital scholarship centers (I)
Digital Preservation Storage Criteria