The Xiph.Org Foundation has for many years hosted a collection of #lossless audio and video test clips to support the compression research community.
Unfortunately, that server may be going away soon. I'm looking at various options to keep the data online, but I wondered if it would be appropriate to upload a copy to @internetarchive The current collection is around 20 TB.
If so, what would be a good format? For clips originating in yuv, ffv1 or lossless vp9 compression is probably a good choice. (Uncompressed is usually best for benchmarking, but people can extract on their own.) Downloading multi-GB files is easier than it used to be. However, the highest quality for some clips are directories of png, tiff, or exr files. Not sure what to do with those. Tar them up? Will archive.org fall over if I upload 20,000 files as part of the same object?