anyone know of articles/blogs and/or has personal experience with:
AI-bot scraper hinderung web archiving / web preservation
let me know!
anyone know of articles/blogs and/or has personal experience with:
AI-bot scraper hinderung web archiving / web preservation
let me know!
Okay, as best as I can tell, if I want to give my colleagues a good alternative to drag-n-drop that preserves timestamps, I should recommend
```
robocopy \path\source \path\destination /e /dcopy:t /eta /z
```
Windows #digipres people, does that look right?
Brilliant floppy disk badges at the #BDCAM25 Conference #DigitalPreservation #DigiPres
The Xiph.Org Foundation has for many years hosted a collection of #lossless audio and video test clips to support the compression research community.
Unfortunately, that server may be going away soon. I'm looking at various options to keep the data online, but I wondered if it would be appropriate to upload a copy to @internetarchive The current collection is around 20 TB.
If so, what would be a good format? For clips originating in yuv, ffv1 or lossless vp9 compression is probably a good choice. (Uncompressed is usually best for benchmarking, but people can extract on their own.) Downloading multi-GB files is easier than it used to be. However, the highest quality for some clips are directories of png, tiff, or exr files. Not sure what to do with those. Tar them up? Will archive.org fall over if I upload 20,000 files as part of the same object?
#Wikidata and the sum of all video games − 2024 edition: status update on our endeavour to become the hub of all video game metadata: 110,000 items, 70 new identifier properties, and a lot of video game genres.
https://commonists.wordpress.com/2025/03/24/wikidata-and-the-sum-of-all-video-games-2024-edition/
#DPC members! Join us this Friday for an engaging Audiovisual Special Interest Group (AVSIG) session featuring one of our newest DPC members, RTÉ Archives.
Presentation by Adrienne Warburton (RTÉ Archives) on mass digitization of audio and audiovisual assets and RTÉ's #digitalpreservation work
Friday March 28th, 2-3pm UTC
Online members-only event
Register now: https://www.dpconline.org/events/eventdetail/475/-/audio-visual-special-interest-group
Hi fedi, another question re #SafeguardingResearch:
We got a couple of #Tableau data/visualizations that we need to archive.
But: the download is limited to Image, PDF or PowerPoint.
https://public.tableau.com/app/profile/dutytoserve/vizzes
Update: Currently looking into automating the step 'downloading PDF, with all sheets of the workbook'.
Ideas on how to accomplish this?
Question re #SafeguardingResearch
We encounter 'web applications' that our current method of archiving don't preserve.
Things like [we need a better example, this one is already gone (but the data preserved) https://social.coop/@edsu/114206452552797815]
We are mostly using https://github.com/openzim/zimit to create WARC files and combining them into a single ZIM.
(This uses the browsertrix crawler)
Any ideas on how to archive not just the content, but also the functionality of such applications?
#DigiPres #Web #Archiving
I'm working on enabling the #DigiPres parts of our org by making some of the #COPTR Tool Grid apps available securely in a managed #Windows environment - first cab off the rank is a #Java app with bundled #JRE, which won't play well with our env
does anyone have experience using #maven & #jpackage as part of a build pipeline to create OS-native installers? my short-term target is a Windows MSI, but once that's in place the same pipeline should be able to spit out native installers for macOS (which we also use) & Linux (which we don't - yet )
I'm at the point where I'm about to clone the repo & start tinkering but would much rather re-use something already built than figure it out from docs & example code...
#boost4reach pls
A #digipres project I really don't have the skill for (nor the time!) but wish someone could do: improving bagger.
It's like 90% of the way towards being a great GUI application, except that
1. It doesn't retain timestamps when copying files
2. The "save bag" option always generates MD5 manifests, no matter what algorithm was previously used
3. The file selection dialogs are very cumbersome
Here's an idle #digipres question: do we know the origins of the 3-2-1 rule? As in, where and when it was first put forward?
I bring it up often in grad school writing, where I'm supposed to cite it, and I always wonder about its ultimate source
Out now, zusammen mit @gamestudies.bsky.social:
Social Playing? Überlegungen zu einem vernetzten Spielerlebnis
TL;DR Wäre toll, wenn wir unser Gaming direkt "anspielbar" miteinander teilen könnten. Digital Humanities und #DigiPres wüssten bereits, wie es geht. Doch die Plattformökonomie der Games-Branche lässt das nicht zu; stattdessen dann Streaming mit twitch, bei dem wir nur unsere Displays abfilmen dürfen.
Ok #avpres question. We recently digitized an old umatic tape with a documentary on it. Only one audio stream in stereo with the left side being English and right side being Vietnamese. Currently the MKV only has English for the audio stream. Without splitting the audio stream into two, what is the best way to manage the technical metadata? #digipres
Dang, I'm only two pages into this article, and it's already just straight fire about the state of #digipres practice and literature
Kyna Herzinger, Caroline Daniels, and Heather Fox, “Preservation Not Paralysis: Reflections on Launching a Born-Digital Preservation Program,” Collections 17, no. 4 (December 1, 2021): 347–71, https://doi.org/10.1177/1550190620978221.
David Rosenthal has written up a fantastic blog post from a talk he did on the current state of archival storage - the tech, the pros, the cons, and more. I also like that he points out that archival storage is NOT the same as a data backup, because it might be easy to conflate the two. The post is well worth a read.
(Thank you to a few people who have shared this around!)
https://blog.dshr.org/2025/03/archival-storage.html
#archives #digipres
New blog post!
On Preserving Australian First Nations Digital Cultural Heritage
https://kristinamason.com/on-preserving-australian-first-nations-digital-cultural-heritage/
#glam #digipres #digitalhumanities #archives #museums #indigenous
Has anyone encountered Time Machine backups in their acquisitions/processing? And/or has anyone written about Time Machine backups from a #digipres perspective?
Now Released: Mental Health & Wellbeing in the Digital Preservation Community
This survey explores workplace wellbeing among digital preservation practitioners, highlighting:
• Effects of solitary work environments
• Impact of high workload demands
• Challenges of advocacy work
More info on https://www.dpconline.org/news/mental-health-and-wellbeing-report-launch Straight to the report? Follow this link https://doi.org/10.7207/mhw2025
“While much of the early literature in the field of digital archives is focused on very meticulous #digipres, the growing mass of digital data to be preserved has led many practitioners to streamline their workflows. What’s something you’ve chosen to deprioritize or cut from your workflows? How did you make the decision? What effect has it had on your work, your staff, and your collections?“
Just stumbled over "FOLIO":
https://docs.folio.org/docs/about-folio/
"an open source Library Services Platform (LSP)"
Is anyone familiar with this in practice?
#DigiPres