Research and Development for Digital Cultural Heritage Preservation: A (Virtual and In-Person) Open Forum

The following is a guest post from Joshua Sternfeld, National Endowment for the Humanities and Gail Truman, Truman Technologies. The statements and ideas expressed here are attributed solely to the authors and do not necessarily reflect those of any federal agency or institution.

The collection on the wall from user patentboy on <a href="">Flickr</a>.

The collection on the wall from user patentboy on Flickr.

As the National Digital Stewardship Alliance prepares to add new categories of content to its 2015 National Agenda for Digital Stewardship, including digital art and software, now is the ideal opportunity to assess the state of research and development for the preservation of digital cultural heritage.  In many respects, digital cultural heritage is dependent on some of the same systems, standards and tools used by the entire digital preservation community.  Practitioners in the humanities, arts, and information and social sciences, however, are increasingly beginning to question common assumptions, wondering how the development of cultural heritage-specific standards and best practices would differ from those used in conjunction with other disciplines.  As many in the humanities and arts point out, digital cultural heritage materials encompass a dizzying array of formats, genres, disciplines, and institution repository types, which bring with them unique intellectual and technical challenges for their preservation.  Most would agree that preserving the bits alone is not enough, and that a concerted, continual effort is necessary to steward these materials over the long term.

We might think of the development of digital cultural heritage standards and practices as a two-way street.  On the one hand, a humanistic or artistic perspective may challenge digital preservation norms that often originate from industry leaders in the private sector or disciplines that seem distant, or even antithetical, to the needs of the humanities and arts user communities.  On the other hand, by elevating the needs of this user community – from artists and scholars, to educators, curators, media makers and students – we may be able to influence a combination of public and private interests to support more targeted user-centric development.  For example, we are just now beginning to consider how adjustments to conventional storage architectures, such as use of abstraction and distributed, cloud-based services, may result in radically different means of organizing, sharing and visualizing cultural heritage data.

The humanities and arts can also bring heightened clarity or awareness of practices and concepts — including selection of content, appraisal, and authenticity — inherent to all digital preservation.  As pressure mounts to ingest exponentially increasing amounts of data, repository stewards are facing difficult decisions to streamline the acquisition and preservation of their collections.  By their nature, the humanities encourages critical interrogation of selection practices, even as they move toward automation.  Similarly the appraisal of digital data by preservationists and users alike has exceeded the capacity of human intervention alone, which has necessitated creative solutions to generating metadata, mining and visualizing “big data,” and accessing complex audiovisual and interactive media.

For the 2014 Digital Preservation Conference hosted by NDSA, the two of us, on behalf of the NDSA Arts and Humanities Content Working Group, will lead an open discussion to identify pervasive issues found in digital cultural heritage that in time may lead to standards and practices adopted widely by those working in museums, archives, libraries, arts organizations, universities and beyond.

In many ways, the session will serve as a follow-up to the 2012 Digital Preservation plenary session “Preserving Digital Culture.” During that session, Megan Winget, then at the University of Texas at Austin, characterized the preservation of digital cultural heritage as a series of “wicked problems,” each of which is “novel and unique” and for which no single solution is “right or wrong, but [only] better and worse.”  If there was one message from the session, it was that work in digital cultural heritage requires a creative balance of intellectual, theoretical, technical, social, and aesthetic matters.  Building upon a spate of initiatives, conferences and studies in recent years, this year’s session will pose whether and how we can both embrace the novel properties intrinsic to each work or collection, while investigating the possibility of developing shared practices and standards.

At the heart of the discussion we’ll pose this question:  What elements contribute to a successful research and development project in digital cultural heritage that results in the adoption of standards and practices?  While it may seem obvious that an interdisciplinary project team comprised of members with diverse backgrounds ought to be a given, finding just the right balance – not to mention resolving differences in methodologies, vocabulary, and theories — may seem more elusive.  Expanded adoption of a new standard or practice requires significant buy-in from the community by tapping into an ever-evolving scaffolding of knowledge, data, case studies, education and tools in order to sustain continued growth and investment.  In short, a more organized and concerted effort is needed, which historically has proven difficult in the arts and humanities-related preservation fields.

The second half of the discussion will move toward areas of current or possible future interest.  The recent work underway by a team assembled by the Smithsonian in the area of time-based media and digital art can serve as a model in building a collaborative, on-the-ground framework for research and development.  Similarly, a series of conferences investigating the preservation of software, including Preserving.exe, has revealed the importance of integrating diverse voices from the cultural heritage community.  Other areas open for discussion that may benefit (or have already benefited) from enhanced attention from the humanities and arts communities may include digital forensics, web archiving, mass digitization, sustainability or metadata schema development, to name just a few.

In true humanistic fashion, the forum will likely raise more questions than provide answers.  Nonetheless, as session chairs we hope that a framework for future discussion and action will emerge.  This blog posting, therefore, is intended to serve as an open invitation to the NDSA community and beyond to offer ideas, discussion points, challenges, areas of research and examples that may be submitted in the comments section below, and which will help inform the in-person session in July.  For those unable to attend the conference, the chairs will make any session materials accessible afterwards.


  1. Anne Wootton
    July 5, 2014 at 5:18 pm

    I’m interested in where journalism and news fit into the conception of digital cultural heritage preservation.

    We’ll be discussing this topic in a session on Wednesday 7/23 dedicated to Preserving Born Digital News, and we will attempt to focus on building connections between existing archival communities and resources with different news orgs and their respective digital preservation issues.

    But I doubt we will have time to focus more broadly on what role news plays in digital cultural heritage. This is a tricky question, given that the state of news today is one of flux and evolution. For example, the NDNP is a partnership of the Library of Congress and NEH. What about the small news orgs of today, whether regional newspapers shifting focus to online-first or national news organizations creating digital news interactives and applications?

    My hope is that some of the conclusions and takeaways from Tuesday’s R&D for Digital Cultural Heritage Preservation panel will inform Wednesday’s digital news preservation panel, and our efforts moving forward.

    See blog post on this topic forthcoming on the Signal the week of 7/7.

    See also:

  2. Deborah Kempe
    July 6, 2014 at 4:51 pm

    As a librarian in a museum-affiliated research library charged with establishing a web archiving program, I am very pleased to learn of the forum at the upcoming Digital Preservation Conference. The authors of this blogpost succinctly summarize the sometimes unique perspective of cultural heritage institutions attempting to address the broad need for digital preservation. From my perspective, the recognition of this need has been belated from the community and the consequences of loss of heritage are only now becoming more widely understood.
    Clearly there needs to be a collaborative approach that will bring together multiple stakeholders, in order to encourage individuals and institutions to preserve their artistic and research output, and to find harmonized ways to optimize what is collected and made accessible. It is a daunting task, but there is strength in numbers–I applaud the beginning of an effort at this year’s conference and hope it will lead to constructive next steps.
    Deborah Kempe, Frick Art Reference Library of The Frick Collection

  3. Gail Truman
    July 9, 2014 at 1:08 pm

    @Deborah – Are there specific stumbling blocks and areas you see as imperative to improve on, relating to the digital web archives and content at Frick?

  4. Amar Kapadia
    July 9, 2014 at 8:45 pm

    I’d be interested to see if cloud technologies can help with this problem.

  5. Tom Trimbath, Project Manager HCLE
    July 10, 2014 at 5:21 pm

    Where are there more details about the virtual part of the “(Virtual and In-Person) Open Forum”?

  6. Butch Lazorchak
    July 11, 2014 at 10:13 am

    @Tom, I think this blog post is the virtual part šŸ™‚

  7. Josh Sternfeld
    July 14, 2014 at 3:15 pm

    Indeed, Gail and I intended for the Comments section to be the start of a virtual forum. We are also monitoring a Q&A post over in the Digital Preservation site: We are open to other suggestions for sustaining the discussion!

  8. Lily Pregill
    July 17, 2014 at 8:46 am

    I work with the New York Art Resources Consortium and we are six months into a Mellon-funded project to develop a web archiving program to capture, preserve, and provide access to art historical research materials (not the art objects themselves). You can learn more about our project at:

    The sheer amount of born-digital and web-based materials in our domain make comprehensive collecting for any single institution (or in our case a consortium of three museum libraries) an impossible task due to the realities of resource limitations (data budgets, funding, staff, etc.). As a realist, Iā€™m somewhat comforted by the idea of the archival sliver, but am also interested in the LAM community developing scalable collaborative models to collectively address the need to preserve our cultural heritage. In the Archive-IT community, perhaps a registry of special interest groups could be used to show what seeds are actively collected to minimize duplication of effort. At NYARC, we have been talking with colleagues at other institutions that are also web archiving what we call artist file materials (artist websites, gallery announcements, etc.) and tossing around ideas about how we could track who is collecting what, provide an integrated search across multiple Archive-It partner collections, and perhaps develop a funding model to sustain our joint collecting activities. I would love to hear from the community about other ideas or examples!

    Looking forward to this discussion. Thanks to Gail and Josh for organizing this session.

  9. swati sinha
    October 19, 2014 at 2:30 am

    I would like to know what should be the Ph.D research problem on this preserving cultural heritage in computer vision. So can you help me by suggesting something on that.

Add a Comment

This blog is governed by the general rules of respectful civil discourse. You are fully responsible for everything that you post. The content of all comments is released into the public domain unless clearly stated otherwise. The Library of Congress does not control the content posted. Nevertheless, the Library of Congress may monitor any user-generated content as it chooses and reserves the right to remove content for any reason whatever, without consent. Gratuitous links to sites are viewed as spam and may result in removed comments. We further reserve the right, in our sole discretion, to remove a user's privilege to post content on the Library site. Read our Comment and Posting Policy.

Required fields are indicated with an * asterisk.