Aggregate Group Dataset
Whereas EPR focuses on ethnic groups within countries, the Aggregate Group data define ethnic groups independently of country borders. This makes it possible to examine how geopolitical transformations, such as the collapse of the USSR, have affected ethnic groups over time. The AG-data are derived from the EPR Core, EPR-TEK and GeoEPR datasets (Version 2018). We have made the following adjustments to these datasets:
EPR
- So-called “umbrella groups” (see EPR) and groups within the same country that have the same TEK linkage were re-coded as a single group
- Added aggregate group-level variables relating to the political status, territorial size and conflict involvement of the aggregate group as a whole
TEK
- Coded the most relevant TEK linkage for each group
GeoEPR
- Added polygons for “statewide” irrelevant groups, which are not covered by the original GeoEPR dataset (e.g. the Germans or Portuguese)
- Added polygons for groups during periods in which they are coded as politically irrelevant.
Data
The Aggregate Group-level data consist of the following files:
- EPR-AG_segment_level_dataset.csv: Adjusted group-year dataset containing AG-level variables. Contains yearly observations for each “segment” that belongs to an aggregate group (e.g. Russians in Russia, Ukraine, Kazakhstan, ...)
- EPR-AG_ag_level_dataset.csv: Aggregate group-year dataset. Contains one observation per year for each aggregate Group (e.g. Russians across all of Eurasia)
- TEK-AG.csv: List of the most relevant TEK linkage for each EPR group
- GeoEPR-AG.geojson: Combined polygons of each aggregate group over time