Session Block 1 – Wednesday, June 3, 10:30-12:00
A1: Secure Virtual Research Environments
- Time: 10:30 - 12:00
- Location: Blegen Hall 130
- Chair: Jen Darragh
- Track: Research Data Management
An overview of the University of Alberta Health Research Data Repository (HRDR) secure virtual research environment
- Presenter: James Doiron, University of Alberta
- Abstract: Located within the Faculty of Nursing at the University of Alberta, Canada, the Health Research Data Repository (HRDR) is a secure virtual research environment (VRE) developed to support the security, confidentiality, access, and management of health related research data. The HRDR's operational phase commenced in January 2013 and at the time of the writing of this abstract thus far has provided support to over forty-five multi-disciplinary and collaborative health related research projects, both quantitative and qualitative in nature, and with an excess of 125 users across local, national, and international institutions accessing these. Project level services provided by the HRDR includes such things as support for grant writing and ethics submissions; data management planning, guidance and training; comprehensive assessments for resource needs including security, project space set-up, access, and analytic software requirements; detailed user orientations; completion of privacy impact assessments; data acquisitions; and secure file transferring (ingests/extracts). Examples of health related research projects that have benefited from these services will be presented. Additionally, a brief overview of the development and current status of the HRDR, including its policies and procedures, technical infrastructure, and cost recovery model will be discussed.
Improving Access to Documentation on Restricted Labor Market Data
- Presenter: Stephanie Jacobs, Cornell University
- Abstract: The Cornell Institute for Social and Economic Research (CISER) and the Institute for Employment Research (IAB) of the German Federal Employment Agency have developed a data service enabling approved researchers to securely access confidential administrative microdata on labor markets. The data files contain detailed information on employment, unemployment benefit receipts, participation in labor market programs and registered job search, and a large number of socio-economic characteristics. Remote access to IAB's Scientific Use Files is available to any researcher approved by IAB, no affiliation with Cornell is required. This presentation demonstrates how researchers can use CED2AR to search through the metadata for IAB's Scientific Use Files (SUF) to locate variables of interest and other documentation essential to formulating and carrying out a research plan. CED2AR, funded by the National Science Foundation (NSF) and developed by the Cornell Node of the NSF Census Research Network (NCRN), is designed to improve the discover-ability of both public and restricted data. The project is based upon leading metadata standards and ingests data from a variety of sources. The addition of IAB SUF to CED2AR enables researchers to search across multiple labor market data series including the US Census Bureau's LEHD.
Sharing Research Data in Academia
- Presenters: Benedikt Fecher (HIIG & DIW, Berlin); Sascha Friesike (HIIG, Berlin); Marcel Hebing (DIW, Berlin); Stephanie Linek (ZBW, Berlin)
- Abstract: Shared research data in academia is associated with considerable benefits. It makes studies reproducible and enables other researchers to ask new questions based on old data. Thereby data sharing in academia makes research more transparent and fosters innovation. However, curating, archiving and making data available for others is far from being the rationale for good scientific practice. The research project "Data Sharing in Academia" http://data-sharing.org aims to identify factors for efficient data re-use. After a systematic review of scientific publications on data sharing and a qualitative analysis as part of the SOEP User Survey 2014, we conducted a quantitative survey of researchers from all disciplines (n ~ 1500). In this presentation we present a generic framework for data sharing in academia and the first results from our survey.
A web-based data management tool for collaborative studies
- Presenters: Dafina Kurti and Alexia Katsanidou, GESIS Leibniz Institute for the Social Sciences
- Abstract: In large-scale collaborative research projects, be it national or cross national data collections, each project phase - study planning, fieldwork, data processing and documentation, and data depositing - require a careful data management and a good coordination among different teams. The web-based data management portal will provide a virtual collaborative work space for researcher teams and different stakeholders of projects. By including modularised management tools, the platform will support research projects according to their requirements, allowing an easy, structured, time saving, and secure communication, workflow and data transfer, no matter in which phase of research life cycle the project is. We will present the final concept of this infrastructure with fully fledged user cases, which was developed based on the evaluation of a) practices in existing data management portals (EVS, ISSP, ESS), b) researcher needs (European Election Study, Eurofund), c) and the recommendations we worked out together with the project managers and principal investigators.
The ch-x "experiment": building experience in good data management practices
- Presenter: Alexandra Stam, Swiss Centre of Expertise in the Social Sciences (FORS)
- Abstract: The Swiss Federal Surveys of Adolescents (ch-x) are long established large-scale surveys conducted amongst 19-21 year old Swiss citizens. They consist of near full coverage of young men drafted to the army (about 40,000), as well as a sample of 2,000 women of the same age. While data service staff at FORS (Swiss Centre of Expertise in the Social Sciences) are usually occupied with the archiving of national survey data, they won the open competition to lead the 2016-17 ch-x edition. This was a great opportunity for FORS, not only to produce and make available fascinating data on youth mobility for secondary use, but also to strengthen staff expertise in key areas of data management. The ch-x project provides a unique opportunity to reflect on data management throughout the life cycle, whether by challenging accepted best practices or by better understanding pressures that prevent good data management. The presentation will address our data management practices during the first phase of the project, from study conception to the finalization of a lengthy paper questionnaire. Of particular interest is the aspect of documentation, and how our experiences of good - and bad - data management practices can benefit the larger community.
First forays into research data dissemination: a tale from the Kansas City Fed
- Presenters: San Cannon and Deng Pan, Federal Reserve Bank of Kansas City
- Abstract: The Federal Reserve System has a long tradition of doing economic research; each of the 12 Reserve Banks and the Board of Governors have research departments that together publish more than 400 working papers and journal articles annually. Unfortunately, there has never been a tradition of regularly making the data from those papers publicly available. A pilot program being undertaken by the Research Division at the Federal Reserve Bank of Kansas City aims to make such data available for research reuse. Working as a pilot participant for a new dissemination platform, we have had to educate economists, build metadata specifications, recruit contributors, collaborate with technology and legal staff, and coordinate and build coalitions across multiple functions at our institution and others. This presentation will outline the challenges faced and obstacles overcome as we worked to create the infrastructure and workflow, as well as starting the paradigm shift needed, to make research data publication a regular part of the research life cycle.
A3: Enabling Public Use of Public Data
- Time: 10:30 - 12:00
- Location: Blegen Hall 135
- Chair: Florio Arguillas
- Track: Data Infrastructure and Applications
Enhancing Dissemination of Statistical Information in Uganda by Uganda Bureau of Statistics
- Presenter: Winny Akullo, Public Procurement and Disposal of Public Assets Authority
- Abstract: A quantitative study was carried out to investigate ways of enhancing dissemination of statistical information in Uganda. The purpose of the study was to seek ways to enhance the dissemination of statistical information by Uganda Bureau of Statistics. The objectives were: to establish the extent to which statistical information is available in the country; establish the challenges UBOS faces in disseminating statistical information; establish the users' level of satisfaction in accessing statistical information; identify challenges users face in accessing statistical information; and propose strategies for enhancing dissemination of statistical information. 119 users of statistical information participated in this study and 17 UBOS staff were purposely selected because they are charged with dissemination of statistical information. Questionnaires and interviews were used to collect data. Data was analyzed using Microsoft Excel and presented in form of frequencies, figures and tables. The study established that the UBOS website was one of main channels used for disseminating statistical information, however, it is inaccessible. Worse still, most of the publications disseminated cannot be found on the website information. Among the recommendations, UBOS should use as many available channels in disseminating statistical information in multiple formats and languages and establish regional resource centers. to increase accessibility.
And Data for all
- Presenter: Mark Mitchell, University of Glasgow
- Abstract: This paper will discuss the experiences of the UBDC at the University of Glasgow in supplying "Open" data generated by the City of Glasgow council to public and academic users. Areas covered in this paper will include the standardisation of metadata, data linkage and visualisation services supplied to both communities and will include real world use cases of how the data has been used and how outreach to the public, specifically, was under taken. In addition to this, the paper will also touch upon the utilisation of additional visualisation techniques such as a fully geo-referenced Minecraft Map of the UK to enable users to interpret data in a different manner from more traditional techniques. Finally, the paper will cover additional examples of enabling the public to engage with data and how this growing field is key for academic and non-academic study to understand how better can be utilised to improve the understanding of the Urban environment and how individuals interact with it.
Finding space in an Open Data world
- Presenter: Margherita Ceraolo, UK Data Service
- Abstract: There is a global momentum toward open data, with national governments and IGOs such as the IMF, World Bank and UN embracing and promoting open data. As part of the UK Data Service's commitment to the principle that data which are publicly funded should be publicly accessible, the Service offers an increasing range of open data including UK census data, qualitative data, survey data-sets and international macrodata. In the open data environment, there is a need for the Service to continue to adapt and find innovative ways to improve and enhance the users' data experience. What are the benefits of accessing open data via the UK Data Service? With this question in mind, we have begun to explore ways in which we can develop our open data offering for the Social Science community. This presentation focuses on the international macrodata provision and illustrates our approach to developing the delivery platform, UKDS.Stat, with the aim of making it an invaluable resource for anyone interested in international socio-economic data. It will describe the specific methods we are considering such as providing APIs, visualisation, integrating social media, and acting as brokers to highlight the impact of our data.
A4: Training Data Users I
- Time: 10:30 - 12:00
- Location: Blegen Hall 150
- Chair: Lynda Kellam
- Track: Data Services Professional Development
Teaching users to work with research data
- Presenter: Sarah King-Hele, UK Data Service, University of Manchester
- Abstract: The UK Data Service is a resource funded to support researchers, students, lecturers and policymakers who depend on high-quality social and economic data. This presentation will discuss the methods we use to teach users about what data we have available and how to get started using the data for research. Our approaches include a rolling series of webinars, face to face presentations and practical workshops. We also provide online materials to help users to get the most out of the service. These training methods allow the users to learn about our range of data and how to use them in a variety of formats so that we can provide support to meet the needs of different kinds of users. We also discuss how this training programme fits in with other methods training and how we can develop it for new kinds of data and a wider range of users.
The Carrot: Outcomes from a Campus-Wide Grant Program for Creating Data-Driven Assignments
- Presenter: Katharin Peter, University of Southern California Libraries
- Abstract: Efforts to embrace big data, data analytics, and data visualization methods often overlook the widespread need to develop foundational data literacy competencies. This presentation will share the results of one university's efforts to promote data literacy through a competitive, campus-wide grant program for faculty implementing data-driven assignments in undergraduate courses. As part of the grant program, 12 faculty from a variety of disciplines received support from instructional designers and data librarians to develop and implement data-driven assignments in support of their course learning outcomes. This presentation will discuss the outcomes of the grant program as well as opportunities and strategies for promoting and supporting the creation of data-driven assignments.
Sustainability of Social Science Data Archives: A Historical Network Perspective
- Presenters: Kristin R. Eschenfelder (University of Wisconsin-Madison), Kalpana Shankar (University College Dublin), Greg Downey (UW-Madison), Rebecca Lin (UW-Madison)
- Abstract: This paper will summarize preliminary results from a study to analyze the history of sustainability in social science data archives (SSDA). The purpose of the study is to draw out what sustainability challenges SSDA have faced and what strategies they have employed to remain sustainable and relevant given massive changes in technologies, users, data types, revenue sources and data markets. The paper will summarize historical analysis of documentation from the 1960s to the early 2000s from ICPSR, the UK Data Archive, the Roper Center for Public Opinion Research. The paper will also include a historical network analysis of interaction among SSDA as represented in full run of IASSIST Quarterly articles from 1960 to the early 2000s. The project's broader goal is to understand how the history of SSDAs can contribute to current conversations on the long term sustainability of other knowledge infrastructures. To this end, the project seeks to address the successes and failures that SSDA experienced in trying to remain relevant and funded, and the longitudinal changes in relationships among SSDA as they have collaborated and competed to support research in the social sciences.
A5: RDM Services
- Time: 10:30 - 12:00
- Location: Blegen Hall 120
- Chair: Laurence Horton
- Track: Research Data Management
The Stakeholder Analysis for The Research Data Management Services for the Public Policy researchers
- Presenter: Jungwon Yang, University of Michigan
- Abstract: Since the National Science Foundation now requires a data management plan for proposed grant applications after January 18, 2011, many academic libraries have started to develop research data management service. One of the emerging issues related to the new service is the role of liaison librarian. Articles have noted that to enhance scholarly productivity, liaison librarians need to participate in the entire life-cycle of the research. Liaison librarians also need to be the team builder among library experts for an effective data management service. Yet, it is not clear how a liaison librarian can identify faculty's needs and who need to be the library team for a faculty's research data management, since the researchers' knowledge of data management varies across their personal experience as well as academic discipline. Moreover, the topic and scope of research will highly affect the decision of which library experts will be needed for the research's data management. Given these circumstances, the stakeholder analyses for the faculty will be useful for determining the scope and degree of library service. I will report how the stakeholder analysis help me to customized data management service for the Public Policy faculty at the University of Michigan.
Developing Research Data Services Vision(s): An Analysis of North American Academic Libraries
- Presenters: Inna Kouper (Indiana U), Mayu Ishida (University of Manitoba), Kathleen Fear (University of Rochester), Sarah Williams (University of Illinois at Urbana Champaign), and Christine Kollen (University of Arizona)
- Abstract: Many libraries are implementing or getting ready to implement research data services (RDS) (see, for example, http://www.acrl.ala.org/acrlinsider/archives/6297). Oftentimes, these initiatives are reactive, responding to pressures originating outside the library, such as national or funder mandates for data management planning and data sharing. To provide effective support for researchers, libraries must be proactive and develop a shared vision of what they are trying to accomplish. Can such a vision supersede institutional differences while still accommodating diversity in implementation? In this presentation we discuss a set of vision statements grounded in an analysis of the drivers of RDS vision as well as libraries' current goals and activities in RDS. We developed these statements based on our examination of documents that advance the need for RDS, such as the funding agencies' requirements, the US Office of Science and Technology Policy memo, and the Canadian Tri-Agency's proposal of a data management plan mandate; a content analysis of North American academic library webpages; and interviews with library deans and other administrators. Finally, we describe how our five institutions are responding to this vision and how our implementations of the vision vary depending on the disciplinary and institutional context.
A Coordinated, Decentralized Approach to Data Management Services: From Education to Everyday
- Presenters: Jon Jeffryes, Alice Motes, Amy Neeser, and Amy West (University of Minnesota)
- Abstract: We will describe the strategies, methods, and outcomes of the University of Minnesota Libraries’ coordinated, decentralized approach to providing data management education to library staff and users. This presentation outlines the educational challenges in navigating organizational structures, disciplinary commonalities/differences, staff training, and researcher training in a large research institution.
Staff are increasingly engaging in data management activities across the libraries system, including collaborative workgroups focused on data management, liaisons’ work with data producers, and library staff’s own research data needs. We will discuss how to coordinate these diffused activities without stifling flexibility or creativity and how to incorporate these practices into routine work. One challenge facing libraries has been disciplinary differences regarding data management and sharing practices. We will discuss what strategies and methods can address the commonalities and disciplinary differences of researchers’ needs.
Finally, we will discuss our approach to staff and researcher training as our data management services have grown and developed. We use scenario-based exercises, webinars, and user-facing workshops to incorporate this into everyday library work to better serve the research community. Through staff education, we are building capacity to train our researchers through workshops, data management consultations, and comprehensive data management plans.