Data Analysis Challenge

Throughout this course we will be working with a variety of datasets on a larger data analysis task. This task (as explained in the first tutorial) centers around scientific publications of the IEEE Visualization conference. The full dataset is available here: http://vispubdata.org.

Throughout the class we will work with a number of challenges related to this dataset. The first challenge surrounds questions of gender equality. Gender equality is an important problem in science but also to most businesses and enterprises. In order to analyze questions relating to gender equality we will have to be able to identify genders from the author names of scientific publications.

Your Task

  • Form a group 4 students (we will have done this in class, if you weren't there search for a team on Slack)