Home
Articles
Data and methods

Data and methods

Data, and the methods used to analyze them, are the foundation for evidence-based research. Articles in this subject area discuss the value of different types of data collection, and explain important statistical and econometric methods that provide ways to summarize and present information, and to identify and quantify correlation or causality.

- Program evaluation
- Data and methods
Counting on count data models

Quantitative policy evaluation can benefit from a rich set of econometric methods for analyzing count data

Rainer Winkelmann, May 2015

Often, economic policies are directed toward outcomes that are measured as counts. Examples of economic variables that use a basic counting scale are number of children as an indicator of fertility, number of doctor visits as an indicator of health care demand, and number of days absent from work as an indicator of employee shirking. Several econometric methods are available for analyzing such data, including the Poisson and negative binomial models. They can provide useful insights that cannot be obtained from standard linear regression models. Estimation and interpretation are illustrated in two empirical examples.
MoreLess
Measuring disincentives to formal work

Does formal work pay? Synthetic measurements of taxes and benefits can help identify incentives and disincentives to formal work

Michael Weber, December 2015

Evidence from transition economies shows that formal work may not pay, particularly for low-wage earners. Synthetic measurements of work disincentives, such as the formalization tax rate or the marginal effective tax rate, confirm a significant positive correlation between these measurements and the probability of informal work. These measures are especially informative for impacts at lower wage levels, where informality is highest. Policymakers who want to increase formal work can use these measurements to determine optimal labor taxation rates for low-wage earners and reform benefit design.
MoreLess
Measuring employment and unemployment

Should statistical criteria for measuring employment and unemployment be re-examined?

Andrea BrandoliniEliana Viviano, August 2018

Measuring employment and unemployment is essential for economic policy. Internationally agreed measures (e.g. headcount employment and unemployment rates based on standard definitions) enhance comparability across time and space, but changes in real labor markets and policy agendas challenge these traditional conventions. Boundaries between different labor market states are blurred, complicating identification. Individual experiences in each state may vary considerably, highlighting the importance of how each employed or unemployed person is weighted in statistical indices.
MoreLess
- Behavioral and personnel economics
- Data and methods
Can lab experiments help design personnel policies?

Employers can use laboratory experiments to structure payment policies and incentive schemes

Marie Claire Villeval, November 2016

Can a company attract a different type of employee by changing its compensation scheme? Is it sufficient to pay more to increase employees’ motivation? Should a firm provide evaluation feedback to employees based on their absolute or their relative performance? Laboratory experiments can help address these questions by identifying the causal impact of variations in personnel policy on employees’ productivity and mobility. Although they are collected in an artificial environment, the qualitative external validity of findings from the lab is now well recognized.
MoreLess
- Behavioral and personnel economics
- Data and methods
Relative deprivation in the labor market

The choice of reference group crucially determines subjective deprivation and thus affects labor market behavior

Paolo Verme, June 2017

Why do different population groups (e.g. rural vs. urban, youth vs. elderly and men vs. women) experience the same objective labor status differently? One hypothesis is that people are more concerned with relative deprivation than objective deprivation and they value their own status relative to the status of their peers—the reference group. One way to test this hypothesis in the labor market is to measure individual differences in labor status while controlling for characteristics that define population groups. This measure is called “relative labor deprivation” and can help policymakers to better understand how labor claims are generated.
MoreLess
- Data and methods
Using linear regression to establish empirical relationships

Linear regression is a powerful tool for estimating the relationship between one variable and a set of other variables

Marno Verbeek, February 2017

Linear regression is a powerful tool for investigating the relationships between multiple variables by relating one variable to a set of variables. It can identify the effect of one variable while adjusting for other observable differences. For example, it can analyze how wages relate to gender, after controlling for differences in background characteristics such as education and experience. A linear regression model is typically estimated by ordinary least squares, which minimizes the differences between the observed sample values and the fitted values from the model. Multiple tools are available to evaluate the model.
MoreLess
- Labor markets and institutions
- Data and methods
Statistical profiling of unemployed jobseekers

The increasing availability of big data allows for the profiling of unemployed jobseekers via statistical models

Bert van Landeghem Sam Desiere Ludo Struyven , February 2021

Statistical models can help public employment services to identify factors associated with long-term unemployment and to identify at-risk groups. Such profiling models will likely become more prominent as increasing availability of big data combined with new machine learning techniques improve their predictive power. However, to achieve the best results, a continuous dialogue between data analysts, policymakers, and case workers is key. Indeed, when developing and implementing such tools, normative decisions are required. Profiling practices can misclassify many individuals, and they can reinforce but also prevent existing patterns of discrimination.
MoreLess
- Migration and ethnicity
- Data and methods
The use of natural experiments in migration research

Data on rapid, unexpected refugee flows can credibly identify the impact of migration on native workers’ labor market outcomes

Semih Tumen, October 2015

Estimating the causal effect of immigration on the labor market outcomes of native workers has been a major concern in the literature. Because immigrants decide whether and where to migrate, immigrant populations generally consist of individuals with characteristics that differ from those of a randomly selected sample. One solution is to focus on events such as civil wars and natural catastrophes that generate rapid and unexpected flows of refugees into a country unrelated to their personal characteristics, location, and employment preferences. These “natural experiments” yield estimates that find small negative effects on native workers’ employment but not on wages.
MoreLess
- Labor markets and institutions
- Data and methods
Measuring income inequality

Summary measures of inequality differ from one another and give different pictures of the evolution of economic inequality over time

Ija Trapeznikova, July 2019

Economists use various metrics for measuring income inequality. Here, the most commonly used measures—the Lorenz curve, the Gini coefficient, decile ratios, the Palma ratio, and the Theil index—are discussed in relation to their benefits and limitations. Equally important is the choice of what to measure: pre-tax and after-tax income, consumption, and wealth are useful indicators; and different sources of income such as wages, capital gains, taxes, and benefits can be examined. Understanding the dimensions of economic inequality is a key first step toward choosing the right policies to address it.
MoreLess
- Data and methods
The importance of measuring dispersion in firm-level outcomes

Ignoring the large variation in firm-level outcomes can create misunderstandings about the consequences of many policies

Chad Syverson, May 2014

Recent research has revealed enormous variation in performance and growth among firms, which both drives and is driven by large reallocations of inputs and outputs across firms (churning) within industries and markets. These differences in firm-level outcomes and the associated turnover of firms affect many economic policies (both labor- and non-labor-oriented), on both a microeconomic and a macroeconomic scale, and are affected by them. Properly evaluating these policies requires familiarity with the sources and consequences of firm-level variation and within-industry reallocation.
MoreLess

Data and methods

Subject Editor

Quantitative policy evaluation can benefit from a rich set of econometric methods for analyzing count data

Does formal work pay? Synthetic measurements of taxes and benefits can help identify incentives and disincentives to formal work

Should statistical criteria for measuring employment and unemployment be re-examined?

Employers can use laboratory experiments to structure payment policies and incentive schemes

The choice of reference group crucially determines subjective deprivation and thus affects labor market behavior

Linear regression is a powerful tool for estimating the relationship between one variable and a set of other variables

The increasing availability of big data allows for the profiling of unemployed jobseekers via statistical models

Data on rapid, unexpected refugee flows can credibly identify the impact of migration on native workers’ labor market outcomes

Summary measures of inequality differ from one another and give different pictures of the evolution of economic inequality over time

Ignoring the large variation in firm-level outcomes can create misunderstandings about the consequences of many policies