Data and methods

Data, and the methods used to analyze them, are the foundation for evidence-based research. Articles in this subject area discuss the value of different types of data collection, and explain important statistical and econometric methods that provide ways to summarize and present information, and to identify and quantify correlation or causality.

  • Counting on count data models

    Quantitative policy evaluation can benefit from a rich set of econometric methods for analyzing count data

    Rainer Winkelmann, May 2015
    Often, economic policies are directed toward outcomes that are measured as counts. Examples of economic variables that use a basic counting scale are number of children as an indicator of fertility, number of doctor visits as an indicator of health care demand, and number of days absent from work as an indicator of employee shirking. Several econometric methods are available for analyzing such data, including the Poisson and negative binomial models. They can provide useful insights that cannot be obtained from standard linear regression models. Estimation and interpretation are illustrated in two empirical examples.
    MoreLess
  • Measuring disincentives to formal work

    Does formal work pay? Synthetic measurements of taxes and benefits can help identify incentives and disincentives to formal work

    Michael Weber, December 2015
    Evidence from transition economies shows that formal work may not pay, particularly for low-wage earners. Synthetic measurements of work disincentives, such as the formalization tax rate or the marginal effective tax rate, confirm a significant positive correlation between these measurements and the probability of informal work. These measures are especially informative for impacts at lower wage levels, where informality is highest. Policymakers who want to increase formal work can use these measurements to determine optimal labor taxation rates for low-wage earners and reform benefit design.
    MoreLess
  • Measuring employment and unemployment

    Should statistical criteria for measuring employment and unemployment be re-examined?

    Measuring employment and unemployment is essential for economic policy. Internationally agreed measures (e.g. headcount employment and unemployment rates based on standard definitions) enhance comparability across time and space, but changes in real labor markets and policy agendas challenge these traditional conventions. Boundaries between different labor market states are blurred, complicating identification. Individual experiences in each state may vary considerably, highlighting the importance of how each employed or unemployed person is weighted in statistical indices.
    MoreLess
  • Can lab experiments help design personnel policies?

    Employers can use laboratory experiments to structure payment policies and incentive schemes

    Marie Claire Villeval, November 2016
    Can a company attract a different type of employee by changing its compensation scheme? Is it sufficient to pay more to increase employees’ motivation? Should a firm provide evaluation feedback to employees based on their absolute or their relative performance? Laboratory experiments can help address these questions by identifying the causal impact of variations in personnel policy on employees’ productivity and mobility. Although they are collected in an artificial environment, the qualitative external validity of findings from the lab is now well recognized.
    MoreLess
  • Relative deprivation in the labor market

    The choice of reference group crucially determines subjective deprivation and thus affects labor market behavior

    Paolo Verme, June 2017
    Why do different population groups (e.g. rural vs. urban, youth vs. elderly and men vs. women) experience the same objective labor status differently? One hypothesis is that people are more concerned with relative deprivation than objective deprivation and they value their own status relative to the status of their peers—the reference group. One way to test this hypothesis in the labor market is to measure individual differences in labor status while controlling for characteristics that define population groups. This measure is called “relative labor deprivation” and can help policymakers to better understand how labor claims are generated.
    MoreLess
  • Using linear regression to establish empirical relationships

    Linear regression is a powerful tool for estimating the relationship between one variable and a set of other variables

    Marno Verbeek, February 2017
    Linear regression is a powerful tool for investigating the relationships between multiple variables by relating one variable to a set of variables. It can identify the effect of one variable while adjusting for other observable differences. For example, it can analyze how wages relate to gender, after controlling for differences in background characteristics such as education and experience. A linear regression model is typically estimated by ordinary least squares, which minimizes the differences between the observed sample values and the fitted values from the model. Multiple tools are available to evaluate the model.
    MoreLess
  • Statistical profiling of unemployed jobseekers

    The increasing availability of big data allows for the profiling of unemployed jobseekers via statistical models

    Statistical models can help public employment services to identify factors associated with long-term unemployment and to identify at-risk groups. Such profiling models will likely become more prominent as increasing availability of big data combined with new machine learning techniques improve their predictive power. However, to achieve the best results, a continuous dialogue between data analysts, policymakers, and case workers is key. Indeed, when developing and implementing such tools, normative decisions are required. Profiling practices can misclassify many individuals, and they can reinforce but also prevent existing patterns of discrimination.
    MoreLess
  • The use of natural experiments in migration research

    Data on rapid, unexpected refugee flows can credibly identify the impact of migration on native workers’ labor market outcomes

    Semih Tumen, October 2015
    Estimating the causal effect of immigration on the labor market outcomes of native workers has been a major concern in the literature. Because immigrants decide whether and where to migrate, immigrant populations generally consist of individuals with characteristics that differ from those of a randomly selected sample. One solution is to focus on events such as civil wars and natural catastrophes that generate rapid and unexpected flows of refugees into a country unrelated to their personal characteristics, location, and employment preferences. These “natural experiments” yield estimates that find small negative effects on native workers’ employment but not on wages.
    MoreLess
  • Measuring income inequality

    Summary measures of inequality differ from one another and give different pictures of the evolution of economic inequality over time

    Ija Trapeznikova, July 2019
    Economists use various metrics for measuring income inequality. Here, the most commonly used measures—the Lorenz curve, the Gini coefficient, decile ratios, the Palma ratio, and the Theil index—are discussed in relation to their benefits and limitations. Equally important is the choice of what to measure: pre-tax and after-tax income, consumption, and wealth are useful indicators; and different sources of income such as wages, capital gains, taxes, and benefits can be examined. Understanding the dimensions of economic inequality is a key first step toward choosing the right policies to address it.
    MoreLess
  • The importance of measuring dispersion in firm-level outcomes

    Ignoring the large variation in firm-level outcomes can create misunderstandings about the consequences of many policies

    Chad Syverson, May 2014
    Recent research has revealed enormous variation in performance and growth among firms, which both drives and is driven by large reallocations of inputs and outputs across firms (churning) within industries and markets. These differences in firm-level outcomes and the associated turnover of firms affect many economic policies (both labor- and non-labor-oriented), on both a microeconomic and a macroeconomic scale, and are affected by them. Properly evaluating these policies requires familiarity with the sources and consequences of firm-level variation and within-industry reallocation.
    MoreLess
show more