Comprehensive Data Management Planning & Services

2019 Day of Data Agenda

Cornell Day of Data 2019.  Any person. Any study. Any data.

Register today, space is limited!

* Please note, registration preference will be given to members of the Cornell community. If space does become an issue, we encourage you to join us remotely (Zoom details to follow, closer to time of event).

Day of Data Agenda

Time Event
9:00 - 9:15 Registration
9:15 - 9:30 Welcome and Opening Remarks
9:30 - 10:30 Keynote Presentation: David Mimno, Associate Professor Information Science
10:30 - 10:45 Break
10:45 - 11:45 Panel: Data Challenges and Solutions
11:45 - 12:00 Lightning Round Introduction to FAIR services fair
12:00 - 1:15 Networking Lunch (Registration Required)
1:15 - 2:15 Afternoon Session / Workshops Block 1 (Registration Required)
2:15 - 2:30 Break
2:30 - 3:30 Afternoon Session / Workshops Block 2 (Registration Required)
3:30 - 3:45 Break
3:45 - 4:45 Afternoon Session / Workshops Block 3 (Registration Required)

Note: Room details will be added as we finalize workshops and spaces

Keynote Speaker

Head and shoulders picture of Professor David Mimno

David Mimno will share his creative and impactful ways of engaging data, and will help learners & researchers from all kinds of backgrounds identify the potential for their data to impact communities, both local and beyond.

David Mimno is an associate professor in the department of Information Science at Cornell University. He holds a PhD from UMass Amherst and was previously the head programmer at the Perseus Project at Tufts and a researcher at Princeton University. His work is supported by the Sloan foundation and the NSF.  His interests include topic modeling, finding links between machine learning/natural language processing and humanistic questions, and collaboration across disciplines.

Learn more about Dr Mimno's work at

Panel: Data Challenges and Solutions

No matter where you are in your research career, there will always be new and different obstacles to overcome, and new and different ways to overcome them! Our morning panel will be a multidisciplinary conversation about how several researchers faced and worked through their own data management challenges.  Join us to learn new perspectives on addressing data-related challenges in your research; there will be time for questions and discussions following the panelists' presentations.

Panelists include:

  • Drew Margolin: Drew is an assistant professor in the Department of Communication. His research focuses on understanding the role that accountability, credibility, and legitimacy within social networks and communities play in shaping observable discourse.
  • Katherine Muller: Katherine is a post-doctoral associate in the Drinkwater lab in the Horticulture Section of the School of Integrative Plant Science. She studies plant-microbe interactions in legume cover crops used to improve soil fertility.
  • Kelsey Utne: Kelsey is a PhD student in the Department of History. Her research focuses on memorialization and public history in late colonial and early postcolonial South Asia, with a particular interest in war memorials, cemeteries, and cremation grounds.

Lunch and Data FAIR

Lunch is provided for registered attendees (registration is available at this link:; everyone is encouraged to wander through a data services FAIR during lunch and learn about services at Cornell to make your data Findable, Accessible, Interoperable and Reusable.

Afternoon Workshops

Afternoon workshops will be organized along 4 tracks: data exploration and analysis; infrastructure; data management tools and practices; and data discovery. We will also have an option for a birds-of-a-feather / informal discussion groups during the final hour. Registration is required for workshops.

Workshop Schedule

Time Data Exploration and Analysis Infrastructure Data Management Tools and Practices Data Discovery
1:15 - 2:15 Data Visualization Big Data Computing   Navigating Demographic Data with the American Census
2:30 - 3:30 Working with Twitter Data Globus Research Record-Keeping Roper Center
3:45 - 4:45 Birds-of-a-Feather Birds-of-a-Feather Protect Your Research Data Data Sharing and Metadata

Workshop descriptions

1:15-2:15 PM

  • Data Visualization Data Visualization is one of the most important ways to communicate your research findings.  Erika Mudrak (CSCU) will present guidelines and best practices for creating graphs and figures that accurately and effectively convey the results of your research and the message you want to tell.
  • Big Data Computing Representatives from IT @ Cornell, the Center for Advanced Computing (CAC) and the Institute for Biotechnology will discuss various options for big data storage and cluster computing available to Cornell researchers.
  • Navigating Demographic Data with the American Census: past, present, future Are you conducting statistical evaluations of US census data? Or, do you need to know how to access, assess and appropriately use demographic data from public sources? Join this workshop with Warren Brown (CISER), Lynda Kellam (CISER), and Amelia Kallaher (Mann Library) to learn about the US Census and how to access data and digital content from the Decennial Censuses, IPUMS, NHGIS, and the new platform.

2:30-3:30 PM

  • Working with Twitter Data Twitter provides a rich global source of social media content.  Melanie Walsh (Information Science) will discuss how to collect and analyze Twitter data with the Python/command line tool "twarc".
  • Globus Globus is a research data management system that provides fast, reliable and secure file transfers and data sharing, as well as publication and discovery functionality.  Ben Trumbore (Cornell Center for Advanced Computing) will provide an overview of Globus, and how to use it.
  • Research Record-keeping Learn basic good practices, and get an introduction to two tools available to you here at Cornell. The Open Science Framework (OSF) is a free, open platform to support research and enable collaboration; Lab Archives is an online electronic lab notebook.   Erica Johns (Mann Library) and Wendy Kozlowski (Cornell Library) will demo the platforms and discuss how they can be used for research record-keeping and project management.
  • Roper Center The Roper Center for Public Opinion Research at Cornell University is the world's largest archive of public opinion data, with a collection of over 24,000 datasets and a database of more than 720,000 US national polling questions and topline results.  Kathleen Weldon (Roper Center) will provide an overview of the archive and the Center's new user portal, including a review of collaborative grant opportunities for Cornell researchers interested in access to the "hidden treasures" of the Roper collection.

3:45-4:45 PM 

  • Protect your Research Data Data protection isn't a secret at Cornell! Meryl Bursic (Cornell Privacy Office) and Elena Goloborodko (CISER) will discuss the basics of information privacy and cybersecurity, and share campus resources available to help protect your research.
  • Data Sharing and Metadata Many funders and publishers are mandating researchers to provide public access to research results. Come learn about current requirements, options for how to make your data publicly accessible, and best practices around metadata to provide alongside your data so it can be found and understood. (Instructor: Wendy Kozlowski, Cornell Library)
  • Birds-of-a-feather / Informal Discussions: Join fellow students, faculty, and staff to discuss topics that are of interest to you! You'll be able to network and exchange ideas, insights, and experiences. To suggest a topic for one of these sessions please use this birds-of-a-feather box note, or suggest ideas on the day of the workshop.

Register today, space is limited!

Have questions? Contact