Summary drafted by Isabel Quintana
Meeting themes
Workflow
Human versus machine processing
Finding right mix for scalability and sustainability.
Role of data science and AI.
For both human and machine resolution we need an ensemble of information about an entity.
Administrative/provenance metadata
Provenance metadata very important for certain use cases, e.g. where identification of material is part of the scholarly enterprise. Examples: rare materials, archival data.
Not all consumers have the same needs, and tracking metadata provenance brings scalability issues. Needs must be defined by reference to real use cases and functional requirements.
We can keep the information at a basic level; or we can keep complex data but only expose a subset; or we could expose it all and allow the user to take only what they need.
Role of aggregator may be just that: to aggregate, and not necessarily to vouch for the data collected. But this makes recording provenance important.
How can we track this information? We don’t have a well established or widely shared model, infrastructure, or process for doing this.
Intellectual property/business model questions also play into this issue.
Keeping data in sync
Whose responsibility is this? What are the obligations of providers?
Even if one or two databases exchange data, that data does not usually make it back out to all the contributors to that database.
But heavily centralized and regulated processes (e.g. NACO) also impose costs.
Linked data issues
Some skepticism voiced about linked data.
Bridging or elimination of silos brings great potential for enriching knowledge. Importance of communities who do not participate in library authorities but have names that appear in library authorities. See e.g. RelFinder: http://www.visualdataweb.org/relfinder.php
It allows for very rich representation of relationships. What level of alignment is needed for interoperability and reuse?
What is the best way to do versioning?
Need better understanding of what data and services aggregators need to provide.
Local vs shared URIs
Mint at point of need vs requirements for stability.
Issues of trust/veracity of statements if we don’t maintain data locally and just point to another place.
Risks involved in sameAs assertions.
...
Perhaps NISO can shepherd some development of best practices.
Document and share existing or pilot workflows as model for other users. Examples exist among older projects as well.
What are social (e.g. membership) barriers to acceptance of existing authority infrastructure? E.g. SNAC is arguably more democratic than NACO.
How can we get smaller libraries and publishers to do some identity management “without even realizing they are doing it”? In other words, make is as seamless, automated and low-barrier as possible.
Need for sustainable business models.
Consider other community models, e.g. http://archesproject.org, http://vocab.getty.edu.
Potential areas for future work
Project outputs
Draft of white paper that will serve as a public discussion document, and outlining key issues and recommendations for joint action
Survey relevant communities on local authority creation
Compile actual or model workflows. Identify stakeholders and roles, including service providers
Share information on how Harvard is resolving issues with their local authorities, and model an identity management workflow
ISNI workflows and use cases
Workflows from BIBFLOW project
Identify patterns for sharing data
Further analysis of roles of stakeholders, e.g. publishers, service providers, libraries, academic institutions, scholars
Outline of how ISNI and VIAF relate to local authority needs
Possible draft specifications
Minimum viable product specification for identities (low-barrier entity creation)
Draft a document on obligations of data providers
Draft a specification for publishers to supply identity data
Examine CrossRef as a potential model
Application developers’ requirements
Potential collaborations and community actions
Explore opportunities to work with publishers to acquire IDs with their data
Bring issues concerning identities to the FOLIO project’s UX team; encourage collaboration with community partners
Document use cases and requirements for reconciliation as a service
Propose ways to share information on matching algorithms
Sustainability is a consideration
Utah IMLS-funded regional authority control project
Consider how such a service may or may not interact with a larger hub such as ISNI.
Initiate discussion on lowering the barriers to NACO participation
Ways of sharing “how to” information
Other issues for further investigation
Use cases and models for administrative and provenance metadata
Change management
Sustainability and business models
Linked data infrastructure
Privacy
Others?
Next steps and second meeting
Future project discussions will be topic-oriented, with nominated discussion leaders.
- Asynchronous communications preferred, discussion leads will schedule periods for focussed discussion
- Additional participants on Google Group welcome
- Consider spinning off discussions in other forums if appropriate
Partners may consider possibilities for further grant proposals
Project leads in discussions concerning the possibility of holding the second meeting in DC
Projected dates for second meeting in March or April 2017