Data Analysis in Social Science 3
Module title | Data Analysis in Social Science 3 |
---|---|
Module code | SSI2007 |
Academic year | 2021/2 |
Credits | 15 |
Module staff | Dr Alexey Bessudnov (Convenor) |
Duration: Term | 1 | 2 | 3 |
---|---|---|---|
Duration: Weeks | 11 |
Number students taking module (anticipated) | 60 |
---|
Module description
Basic knowledge of statistics and data analysis is often not enough for dealing with more complicated problems in the social
sciences, as well as in market research, applied policy analysis, and data-driven journalism. This module introduces you to
more advanced techniques for social data analysis using the statistical programming language R and in particular the tidyverse framework. These techniques are especially useful while working with large and 'messy' data sets. While some statistical theory is covered in this module, the discussion of statistical concepts is generally non-mathematical and intuitive and is based on numerous examples from social sciences. The module assumes familiarity with basic descriptive statistics and linear regression analysis.
SSI1006 (POL/SOC1041) and SSI2005 (POL/SOC2077) are the pre-requisites for this module.
Module aims - intentions of the module
The aim of this module is to introduce you to more advanced quantitative techniques for the analysis of social data. More
specifically, you will learn how to clean, transform, reshape and visualise data in R, a statistical programming language, and
tidyverse, a collection of tidyverse packages. You will also learn the fundamentals of programming in R, such as conditional
statements, loops and functions. After completing this module, you will be able to independently conduct data analysis in R.
Employers in many industries value this skill.
Intended Learning Outcomes (ILOs)
ILO: Module-specific skills
On successfully completing the module you will be able to...
- 1. clean and prepare your data for statistical analysis in R;
- 2. conduct statistical analysis using selected methods at the intermediate level in R;
ILO: Discipline-specific skills
On successfully completing the module you will be able to...
- 3. apply statistical data analysis techniques to social science problems;
- 4. clearly explain the results of statistical analysis in substantive terms and relate them to substantive social science problems;
ILO: Personal and key skills
On successfully completing the module you will be able to...
- 5. report the results of statistical analysis in writing in a way that would be understood by non-specialists; and
- 6. use general-purpose statistical software (such as R) for the analysis of social data at the intermediate level.
Syllabus plan
Whilst the module’s precise content will vary from year to year, it is envisaged that the syllabus will cover some of the following themes:
Data types and structures in R
Data import with readr and data.table
Data manipulation with dplyr
Data visualisation with ggplot2
Iteration
Functions
Reproducible research and effective presentation of statistical results
Learning activities and teaching methods (given in hours of study time)
Scheduled Learning and Teaching Activities | Guided independent study | Placement / study abroad |
---|---|---|
22 | 128 | 0 |
Details of learning activities and teaching methods
Category | Hours of study time | Description |
---|---|---|
Scheduled learning and teaching activity | 22 | 11 x 2 hour lectures / computer lab sessions |
Guided independent study | 78 | Reading and preparation for lectures and lab sessions |
Guided independent study | 50 | Data analysis and writing of the statistical report |
Formative assessment
Form of assessment | Size of the assessment (eg length / duration) | ILOs assessed | Feedback method |
---|---|---|---|
Github assignments | 2 Github assignments | 1-6 | Written feedback via Github |
Mock ELE test | 5 questions on ELE (about 30 minutes) | 1-6 | ELE feedback |
Summative assessment (% of credit)
Coursework | Written exams | Practical exams |
---|---|---|
100 | 0 | 0 |
Details of summative assessment
Form of assessment | % of credit | Size of the assessment (eg length / duration) | ILOs assessed | Feedback method |
---|---|---|---|---|
ELE test at the end of the module | 50 | ELE test (90 minutes) | 1-6 | ELE feedback |
Final statistical report | 50 | 2000 words | 1-6 | Written feedback |
Details of re-assessment (where required by referral or deferral)
Original form of assessment | Form of re-assessment | ILOs re-assessed | Timescale for re-assessment |
---|---|---|---|
Final statistical report | Statistical report (2000 words) | 1-6 | August/September reassessment period |
ELE test | ELE test (90 minutes) | 1-6 | August/September reassessment period |
Indicative learning resources - Basic reading
G.Grolemund, H.Wickham. R for Data Science. O’Reilly (2017). Available at https://r4ds.had.co.nz/
W.Chang, R Graphics Cookbook, 2nd ed., O’Reilly (2019). Available at https://r-graphics.org/
Credit value | 15 |
---|---|
Module ECTS | 7.5 |
Module pre-requisites | SSI1006 (POL/SOC1041) and SSI2005 (POL/SOC2077) |
Module co-requisites | SSI2005 (POL/SOC2077) if not taken before |
NQF level (module) | 5 |
Available as distance learning? | No |
Origin date | 11/12/2019 |
Last revision date | 17/12/2019 |