Delete comment from: DSHR's Blog
Professor Wildani and myself often had discussions about what (if anything) could be labeled as "Archive by accident" and if there's value in it/should we care. The net result of the discussions was a resounding "Who knows", and back to the usual problems around identifying high-value data without an oracle lest we become packrats and data hoarders and all the problems that entails.
I remember at a Daghstuhl workshop a few years back (you were there, I believe), talking with folks about doing crude automatic triage, tossing near-duplicates, flagging things for a human to pick at etc, under the assumption that we will inevitably toss things that may be valuable, but we may still wind up with a greater corpus of "useful" stuff with a reduced workload. Potentially an intractable problem, but we can dream :)
Regardless, interesting stuff! Thanks for sharing!
Aug 22, 2018, 12:39:23 PM
Posted to Optical media durability