GitHub for Data Analysis – September 2021

Event Phone: 1-610-715-0115

We're sorry, but all tickets sales have ended because the event is expired.

There are no upcoming dates for this event.


Cancellation Policy: If you cancel your registration at least two weeks before the course is scheduled to begin, you are entitled to a full refund (minus a processing fee of $50).
In the unlikely event that Statistical Horizons LLC must cancel a seminar, we will do our best to inform you as soon as possible of the cancellation. You would then have the option of receiving a full refund of the seminar fee or a credit towards another seminar. In no event shall Statistical Horizons LLC be liable for any incidental or consequential damages that you may incur because of the cancellation.
A 3-Day Remote Seminar Taught by Aaron Gullickson, Ph.D.

Git is a free, open-source distributed version control system that is used by programmers and data analysts to track project progress efficiently, code without fear of error, and collaborate sanely. Although version control was originally developed for software development, data scientists have adopted its use to facilitate efficient project management and to easily disseminate research materials (such as code) to broader communities.

GitHub, a website that provides online open-access git repositories, has emerged as a leading choice for data analysts and researchers seeking to collaborate and share projects using git. GitHub provides a variety of additional features and workflows that improve the experience of using git.

This seminar will familiarize you with using git through GitHub and demonstrate how to integrate GitHub into a research workflow. The seminar will focus on teaching you the basic git workflow and how to use git and GitHub to simplify research collaboration.

The course will introduce you to the basic workflow of git including how to commit, push, and pull changes to underlying research material and how to create and clone repositories through GitHub. The course will also show you how to create separate branches of code for saner collaboration and how to merge branches using GitHub pull requests.

You will learn both command line tools for working with git and several tools for using git through a GUI interface. We will use RStudio to demonstrate how to use git to manage a project, but the principles of using git apply broadly to any statistical software package that uses scripting.

The seminar will be very hands on and you will learn how to create and manage your own remote repositories through GitHub. You are welcome to bring projects to the course for which you would like to construct GitHub repositories.

Venue: