La colección de Documentos de Trabajo del INE tiene como objetivo la difusión de trabajos originales de investigación relacionados con la actividad propia de una oficina de estadística y desarrollados por el personal de INE con la posible colaboración de investigadores de otras instituciones. 
Los trabajos están orientados al desarrollo de aspectos metodológicos o al análisis y estudio de los resultados de las operaciones estadísticas oficiales desde distintas perspectivas.

The collection of INE Working Papers is intended for disseminating original pieces of research related to the activity performed by a statistics office and developed by INE personnel with the possibility of collaboration of researchers from other institutions.
Work is geared towards developing methodological aspects or analysing and studying the results of official statistical transactions from different perspectives.

  • Early Estimates of the Industrial Turnover Index using Statistical Learning AlgorithmsS. Barragán, L. Barreñada, J.F. Calatrava, J.C. Gálvez Sáenz de Cueto,J.M. Martín del Moral, E. Rosa-Pérez and D. Salgado
    • Doc.
      03/2022
      Resumen / Abstract
      We use statistical learning algorithms to improve timeliness of the Spanish Industrial Turnover Index. The main idea is to use a gradient boosting algorithm to make a prediction for every single industrial turnover value not yet collected during the data collection, data editing and estimation phases. Regressors are constructed from the historical unit-level time series, current aggregated turnover moments and quantiles, and aggregated values of related industrial surveys. Accuracy indicators are also computed so that a quantitative trade-off between accuracy and timeliness can be appraised. This mass imputation exercise provides us with a nowcasting proposal which can be readily extended to many similar design-based surveys.
      Palabras clave / Key words
      Machine Learning, Statistical Learning, Industrial Turnover Index, timeliness improvement, missing data imputation
      Documento / Document
  • Sistema de Identificación para unidades estadísticas complejasEsteban Barbado Miguel, Pedro García Segador, Pilar Montero Robles, Valentín Llorente García y Miriam Hernandez Valencia
    • Doc.
      01/2022
      Resumen / Abstract
      En los últimos años, la arquitectura del Directorio Central de Empresas (DIRCE) ha incorporado a los grupos empresariales y las empresas como nuevas unidades estadísticas que son consistentes con el resto de las que ya estaban incluidas en el DIRCE como unidades legales y locales. A nivel global, los grupos de empresas no suelen estar dotados de personalidad jurídica. Tampoco lo son las empresas compuestas por agrupaciones de unidades legales siguiendo métodos puramente estadísticos. Esto implica la no disponibilidad de un identificador legal. Esta situación limita los análisis de la vida de estas unidades a lo largo del tiempo y afectan especialmente a la producción de Indicadores de Demografía Empresarial. Es necesario desarrollar nuevos procesos orientados al diseño de un Sistema de Identificación robusto para unidades estadísticas complejas. En este documento se describe la construcción de este identificador y la gestión dinámica a lo largo del tiempo de identificadores para gruposempresariales y empresas operando en su seno.
      Palabras clave / Key words
      Unidades estadísticas, grupos empresariales, empresas, identificador, seguimiento a lo largo del tiempo
      Documento / Document
  • Propuesta para la Elaboración de un Indicador de Calidad de Vida UrbanaAlex Costa, Antonio Argüeso, Dolors Cotrina y Sergio Porcel
    • Doc.
      02/2022
      Resumen / Abstract

      Versión actualizada (abril 2023): Documento

      Versión anterior (abril 2022): Documento

      La medición multidimensional de la calidad de vida urbana es un objetivo relacionado con tres tendencias de la estadística oficial de las últimas décadas: aproximar el bienestar de la población, acercarse al territorio, hasta llegar a la realidad urbana, y utilizar datos de origen administrativo. Desarrollar una estadística de esta naturaleza supone aumentar la calidad del sistema estadístico, porque genera una información directamente relevante para el diseño y evaluación de las políticas públicas y porque, además, lo hace en un contexto de eficiencia, por el hecho de trabajar, necesariamente si la referencia es de nivel municipal, con registros administrativos.

      Palabras clave / Key words
      indicador de calidad de vida urbana, estandares de la poblacion, entorno urbano, oferta urbana, retos urbanos
      Documento / Document
  • On new data sources for the production of official statisticsD. Salgado and B. Oancea
    • Doc.
      01/2020
      Resumen / Abstract
      In the past years we have witnessed the rise of new data sources for the potential production of official statistics, which, by and large, can be classified as survey, administrative, and digital data. Apart from the differences in their generation and collection, we claim that their lack of statistical metadata, their economic value, and their lack of ownership by data holders pose several entangled challenges lurking the incorporation of new data into the routinely production of official statistics. We argue that every challenge must be duly overcome in the international community to bring new statistical products based on these sources. These challenges can be naturally classified into different entangled issues regarding access to data, statistical methodology, quality, information technologies, and management. We identify the most relevant to be necessarily tackled before new data sources can be definitively considered fully incorporated into the production of official statistics.
      Palabras clave / Key words
      Digital data, administrative data, Big Data, official statistical production
      Documento / Document
  • The ESA 2010 pension table: An integrated view on the functioning of pension systems in SpainSixto Muriel de la Riva, Carlos J. Valero Rodríguez, Andrés García Carreira
    • Doc.
      01/2019
      Resumen / Abstract
      The inexorable impact of the population ageing, the peculiarities of pay-as-you-go pensionschemes of public systems and the increasing role played by private systems in developedsocieties emphasize the need of a harmonized measure of accrued ¿to date pension rights andobligations in them as one the main priorities for the statistical systems. Current national accountsstandards (SNA 2008 and ESA 2010) already include guidelines for the registration in their systemsof all employment related private pension obligations/rights regardless of whether they aresystems with or without constitution of reserves. In addition, they propose the recording of allpension schemes, including contingent obligations/rights accrued in public systems in asupplementary table. The supplementary table on accrued-to-date pension entitlements in socialinsurance will allow us to see the evolution of all pension rights stocks and the flows that motivatetheir variations, regardless the fact they are non-contingent financial assets/liabilities for thehouseholds/pension managers or not. Both the objectives and data compiled in the table presentobvious conceptual difficulties and require a high level of expert knowledge in the financial,insurance and actuarial fields. Thus, in the Spanish case, the close collaboration with externalagencies from various areas has been a basic component of the project, as a clear example ofinter-institutional cooperation towards the highest standards of quality in official statistics. Inaddition, a highly flexible and adaptable SAS® software (PensINE) has been developed by INE forthe actuarial estimation of accrued to date pension obligations/rights in public defined benefitschemes, which brings together a large part of the fruits of this collaboration. Finally, a didacticdissemination of the pension tables results as a tool for analysing the functioning of nationalpensions systems but not as a measure of their future sustainability is a challenging issue that theEuropean Statistical System and other international organizations face nowadays.
      Palabras clave / Key words
      Pensions, national accounts, ageing
      Documento / Document
  • Data organisation and process design based on functional modularity for a standard production processE. Esteban, M. Novás, S. Saldaña, D. Salgado, L.
    • Doc.
      01/2018
      Resumen / Abstract
      We propose to use the principles of functional modularity to cope with the essentialcomplexity of statistical production processes. Moving up in the direction of internationalstatistical production standards (GSBPM and GSIM), data organisation and processdesign under a combination of object-oriented and functional computing paradigms areproposed. The former comprises a standardised key-value pair abstract data model wherekeys are constructed by means of the structural statistical metadata of the productionsystem. The latter makes a profuse usage of the principles of functional modularity(modularity, data abstraction, hierarchy, and layering) to design production steps. Weprovide a proof of concept focusing upon an optimization approach to selective editingapplied to real survey data in standard production conditions at Statistics Spain (INE).Several R packages have been prototyped implementing these ideas. We also sharediverse aspects raising from the practicalities of the implementation.
      Palabras clave / Key words
      Production Architecture, Key-value Pair Data Model, Standardisation, Functional
      Documento / Document
  • A modern vision of official statistical productionD. Salgado
    • Doc.
      03/2016
      Resumen / Abstract
      This work is devoted to defend the claim that the modernisation and industrialisation ofofficial statistical production needs a unified combination of statistics and computerscience in its very principles. We illustrate our vision with concrete proposals undercurrent implementation at Statistics Spain. Following a bottom-up approach we give aprecise formulation of the estimation problem in a finite population, which by usingfunctional modularity principles has allowed us to propose a methodologicalclassification of level-3 production tasks within the Generic Statistical Business ProcessModel. Additionally, in the same spirit we show our attempts to industrialise thestatistical data editing phase by carefully combining rigorous statistical methodologyproposals with a light-weight object-oriented software implementation. Finally, we arguethat the new sources of information for official statistics will underline the need for thisunified combination.
      Palabras clave / Key words
      Modernization, Industrialization, Statistical Production, Statistical methodology, Computer Science
      Documento / Document
  • Process metadata development and implementation under the GSBPM v5.0 at Statistics Spain (INE)D. Salgado, A.I. Sánchez-Luengo
    • Doc.
      02/2016
      Resumen / Abstract
      Statistics Spain (INE) has recently developed and is currently implementing a standard for the documentation of all statistical production processes. This standard is based upon the Generic Statistical Business Process Model (GSBPM) and comprises a third level of sub-processes adapted to our needs. Each sub-process is documented by specifying its inputs, outputs, throughput, tools, documentation, and responsible unit(s). We borrow from computer science general principles such as modularity, abstraction, hierarchy, and layering to cope with the inherent complexity of a statistical production system. Here we offer a general description of the creation of this standard and of its on-going implementation. We include some reflections about the main difficulties towards a modern industrialised statistical production system.
      Palabras clave / Key words
      Process metadata, GSBPM, modernization of official statistics
      Documento / Document
  • Iris: Codificador automático internacional de Causas de muerteJesús Carrillo, Mª del Rosario González
    • Doc.
      01/2016
      Resumen / Abstract
      La Estadística de Defunciones según la causa de muerte es una de las mayores fuentes de información para la investigación epidemiológica y para la toma de decisiones en políticas sanitarias y sociales. La estadística considera la causa básica de defunción. La selección de la causa básica de defunción se basa en las reglas descritas en la Clasificación Internacional de Enfermedades (CIE). Aunque codificadores cualificados realizan la selección de la causa básica, discrepancias en la interpretación de la CIE reducen la homogeneidad de las estadísticas de mortalidad a nivel internacional. El interés por mejorar la calidad de los datos ha llevado a los investigadores a desarrollar sistemas de codificación y selección de la causa básica de defunción. Iris se presenta como un software prometedor, resultado de muchos años de esfuerzo y cooperación internacional, utilizado actualmente por un número creciente de países.
      Palabras clave / Key words
      Estadística de Defunciones según la causa de muerte, causa básica de defunción, Clasificación Internacional de Enfermedades, codificador automático, codificación
      Documento / Document
  • Propuesta de cuenta de producción de los hogares en España en 2010. Estimación de la serie 2003-2010Carlos Angulo, Sara Hernández
    • Doc.
      01/2015
      Resumen / Abstract
      Existen actividades no de mercado realizadas por los hogares que no se tienen en cuenta en la estimación del PIB, como son las relacionadas con la preparación de alimentos, con la limpieza del hogar o con el cuidado de niños y ancianos. En este documento de trabajo se miden y se valoran tales actividades para agregarlas a las cifras de la contabilidad nacional y obtener así una cuenta de producción de los hogares y el PIB extendido con las valoraciones del trabajo doméstico.
      Palabras clave / Key words
      Empleo del tiempo, trabajo no remunerado, producción no de mercado de los hogares, cuentas satélite de los hogares, trabajo doméstico, PIB extendido
      Documento / Document
  • Standardising the editing phase at Statistics Spain: a little step beyond EDIMBUSSilvia Rama, David Salgado
    • Doc.
      05/2014
      Resumen / Abstract
      We propose a slight generalization of the generic EDIMBUS editing and imputationstrategy based on the notion of statistical production function and the inclusion of editingduring data collection therein. Some first consequences are introduced such as theparametrization of the strategy in terms of the amount of cross-sectional informationavailable for the execution of these functions and a minimal set of specification rules forthem (already present in the literature). Also, we pose specific examples of the editingfunction whose goal is the selection of units for interactive editing so as to optimiseresources. The whole proposal fits within the efforts for the modernisation of thestatistical production process conducted at Statistics Spain.
      Palabras clave / Key words
      Editing strategy, EDIMBUS strategy, production function
      Documento / Document
  • Application of the optimization approach to selective editing in the Spanish Industrial Turnover Index and Industrial New Orders Received Index SurveyR. López-Ureña, M. Mancebo, S. Rama, D. Salgado
    • Doc.
      04/2014
      Resumen / Abstract
      We describe in detail the redesign process of the editing and imputation strategy of theSpanish Industrial Turnover Index and Industrial New Orders Received Index survey. Thisprocess incorporates the optimization approach to selective editing in its combinatorialversion, which we show to contain the score function approach for output editing as aparticular case. We also include considerations about editing during data collection and astandardized expression for edits in short-term business statistics. The process embracesfrom the design of the new edits to their implementation in production. As a global result,the rate of selected units for interactive editing (the most resource-consuming directlyimpinging on both costefectiveness and response burden) has been reduced 20percentage points on average without diminishing data quality.
      Palabras clave / Key words
      Selective editing, optimization approach, editing and imputation strategy design
      Documento / Document
  • Additional questions to better measure the self-declared professional status and how to link the mismatches produced in previous series through an econometric modelJavier Orche Galindo, Miguel Ángel García Martínez
    • Doc.
      03/2014
      Resumen / Abstract
      From 2009 onwards, it was decided to include in the Spanish LFS questionnaire some additional questions for workers who self-declared being members of cooperatives, unpaid family workers or self-employed so that the professional status was better measured. Since then, the previously observed mismatch upward in the level on the total number of selfemployed workers was almost completely adjusted. In the new data on professional status, it was also distinguished which of them had changed from self-employment to wage employment due to the supplementary questions. Therefore, after several quarters, it was possible to fit the change in professional status through aneconometric model and a set of significant explanatory variables obtained from the rest of the questionnaire. Finally, we managed to get a good enough model and could be able to set downin the self-employed 2005-2008 series and the corresponding rise (by the same amount) in the wage employment series.
      Palabras clave / Key words
      Labour Force Survey, professional status, self-employment, backcasting, logistic model, imputation, goodness of logistic models
      Documento / Document
  • Comparación de los ingresos del trabajo entre la Encuesta de Condiciones de Vida y las fuentes administrativasPilar Vega, José María Méndez
    • Doc.
      02/2014
      Resumen / Abstract
      En este documento de trabajo se hace un estudio comparativo entre los datos de las rentas del trabajo recogidos mediante entrevista personal en la Encuesta de Condiciones de Vida (ECV) de 2011 y los datos provenientes de fuentes administrativas. Tanto en los ingresos del trabajo por cuenta ajena como por cuenta propia se hace un análisis de los perceptores de ingresos, así como de las diferencias existentes entre los importes de los ingresos que el informante proporciona en la ECV y los que proporcionan las Fuentes Tributarias.
      Palabras clave / Key words
      Encuesta de Condiciones de Vida, ingresos del trabajo, fuentes administrativas.
      Documento / Document
  • Otras facetas de la Encuesta de Empleo del Tiempo 2009-2010Esperanza Vivas, Carlos Angulo, Sara Hernández, Raquel del Val
    • Doc.
      01/2014
      Resumen / Abstract
      Los análisis contenidos en este documento de trabajo abarcan diversos objetivos particulares de la Encuesta de Empleo del Tiempo 2009-2010 que no han tenido cabida en publicaciones anteriores. Así, el primer capítulo describe los lugares donde se desarrolla la actividad humana, el segundo analiza cómo las parejas reparten las responsabilidades del hogar y el tercero proporciona una valoración económica de las actividades productivas no de mercado de los hogares españoles.
      Palabras clave / Key words
      Empleo del tiempo, lugar, distribución de responsabilidades del hogar, trabajo no remunerado, cuenta satélite de los hogares
      Documento / Document
  • Proyecto para la capitalización del gasto en I+D en los nuevos sistemas de cuentas nacionales: estimación de su impacto sobre el PIB y compilación de una cuenta satélite de I+D / Project for the capitalization of expenditure on R in new systems of national accounts: estimating its impact on GDP and compilation of a satellite account of R Alfredo Cristóbal Cristóbal, Mariano Gómez del Moral, Belén González Olmos
    • Doc.
      02/2013
      Resumen / Abstract

      Versión en españolEnglish version

      La medición de las variables y agregados económicos asociados a la actividad de I+D constituye un reto estadístico, especialmente en lo que se refiere a la integración y el análisis de los datos de las estadísticas básicas de I+D en un marco conceptual que permita relacionarlos con las variables y agregados macroeconómicos fundamentales. Así, es necesaria la elaboración de un instrumento que integre conceptual y económicamente la información básica disponible y que lo haga, además, en un marco contable consistente y comparable a escala internacional, como es el Sistema de Cuentas Nacionales.

      The measurement of the variables and economic aggregates associated with R & D is a statistical challenge,especially as it relates to the integration and analysis of data from basic statistics of R & D within a conceptualframework enabling associate variables and key macroeconomic aggregates. Thus, it is necessary to develop atool that integrates conceptual and economically the basic information available and to do so, more over, in aconsistent and comparable accounting framework at international level, such as the System of National Accounts.

      Palabras clave / Key words
      Investigación, desarrollo, formación, bruta, capital, cuenta, satélite / Research and development, gross fixed capital formation, satellite account
      Documento / Document
  • Alternativas en la construcción de un indicador multidimensional de calidad de vida / Alternatives in the construction of a multidimensional quality of life indicatorAntonio Argüeso, Teresa Escudero, José María Méndez, María José Izquierdo
    • Doc.
      01/2013
      Resumen / Abstract

      Versión en españolEnglish version

      La medición multidimensional de la calidad de vida es uno de los aspectos con mayor futuro dentro de la estadística oficial. Distintas iniciativas internacionales animan a la recopilación de informes sobre esta materia y en particular al desarrollo de indicadores que intenten sintetizar la medición en un único indicador. Se presenta un análisis de la evolución de la calidad de vida en España basada en el estudio de nueve dimensiones usando como fuentes diversas encuestas entre las que destaca la encuesta de Condiciones de Vida. Se proponen además dos formas alternativas de sintetizar esa medición con sendos indicadores globales. Finalmente se analizan brevemente los retos a los que se enfrenta la estadística oficial para la medición de la calidad de vida.

      Multidimensional measurement of quality of life is one of the aspects with greater future potential in official statistics. Different international initiatives encourage the compiling of reports on this matter and in particular the development of indicators set out to synthesize measurement in a single indicator. We present an analysis of the trend in the quality of life in Spain based on the study of nine dimensions using as sources various surveys, prominent amongst which is the Survey on Income and Living Conditions (EU- SILC). In addition, two alternative ways of synthesizing that measurement are put forward, each with global indicators. Finally, the challenges official statistics are facing in measuring quality of life are examined briefly.

      Palabras clave / Key words
      Indicadores de calidad de vida, medición multidimensional, condiciones de vida / Quality of life indicators, multidimensional measurement, living conditions
      Documento / Document
  • Uso de fuentes administrativas para la reducción de carga y costes en las encuestas estructurales de empresas (UFAES) / Use Of Administrative Sources To Reduce StatisticalBurden And Costs In Structural Business Surveys(UFAES)Jorge Saralegui, Cristina González, Ignacio Arbués
    • Doc.
      06/2012
      Resumen / Abstract

      Versión en españolA reduced english version

      El uso de fuentes administrativas con fines estadísticos forma parte de la actividad corriente del Instituto Nacional de Estadística español (INE) en diversas áreas. El proyecto UFAES supone un nuevo salto cualitativo en estas actividades, con objetivos orientados a reducir de manera significativa el tamaño muestral de las dos grandes operaciones estructurales de empresas en el INE.

      The use of administrative sources with statistical purposes is part of the current activity of the National Statistics Institute (INE, Spain), in various fields. UFAES project provides a new qualitative impulse to these activities, with objectives oriented to significantly reduce the sample size of the major INE annual structural business surveys.

      Palabras clave / Key words
      Uso, datos, fiscales, encuestas, económicas, reducción, costes, carga, estadística, empresas, administrativas, fines, estadísticos / integration, tax, microdata, enterprise, surveys, indirect, estimation, change, enterprise, structural, variables
  • Implementing a corporate-wide metadata driven production process at INE SpainPedro Revilla, José Luis Maldonado, José Luis Bercebal, Francisco Hernández
    • Doc.
      05/2012
      Resumen / Abstract
      As other national statistical institutes, INE has started the transition from the numerous stovepipe-like chains of production to more integrated production processes. The Generic Statistical Business Process Model (GSBPM) provides a framework for the development of this goal. This paper describes INE experiences developing this new model, based on a single standardized production line for all surveys, supported by metadata systems, generic and standardized tools and corporative databases.
      Palabras clave / Key words
      Process reengineering, Enterprise architecture, European Statistical System
      Documento / Document
  • Implementing a Quality Assurance Framework based on the Code of Practice at the National Statistical Institute of SpainPedro Revilla, Asunción Piñán
    • Doc.
      04/2012
      Resumen / Abstract

      Quality has always been a constant concern at the National Statistical Institute of Spain (INE). Nevertheless, a more systematic approach has been implemented since the LEG on quality recommendations and especially since the adoption of the Code of Practice. This paper describes INE experiences implementing a Quality Assurance Framework based on the Code of Practice and in the Sponsorship on Quality recommendations.

      A quality structure was created, made up of a Quality Unit, a Quality Manager and a Quality Committee. Through this Committee, all INE units are involved in quality, taking decisions that, once approved by the Board of Directors, are adopted throughout the organization. Moreover, implementing a Quality Assurance Framework based on the Code of Practice is an INE project for 2012.

      Calculating the indicators of the Barometer of Quality, implementing a reference metadata system including a quality report, implementing a satisfaction survey, and adopting the GSBPM as a good practice are some of the actions put in practice.

      Palabras clave / Key words
      Quality assurance framework, Code of Practice, European statistics
      Documento / Document
  • Two greedy algorithms for a binary quadratically constrained linear program in survey data editingDavid Salgado, Ignacio Arbués, María Elisa Esteban
    • Doc.
      03/2012
      Resumen / Abstract
      We propose a binary quadratically constrained linear program as an approach to selective editing. In a practice-oriented framework and allowing for some overediting whilst strictly fulfilling accuracy restrictions, we propose two greedy algorithms to find feasible suboptimal solutions. Their running times are quartic and cubic, respectively, in the number of sampling units and linear in the number of restrictions. We present computational evidence from several hundreds of instances randomly generated.
      Palabras clave / Key words
      Combinatorial optimization, quadratic constraint, linear program, greedy algorithm, selective editing.
      Documento / Document
  • Testing the predictive ability of two classes of modelsIgnacio Arbués, Cristina Casaseca, Ramiro Ledo, Silvia Rama
    • Doc.
      02/2012
      Resumen / Abstract
      We propose tests for the null that the best model of a class produces as good forecasts as the best model of another one. Forecasts are evaluated using a loss function. Thus, causality can be tested if only the models in one class use a certain input. This is applied to the unemployment/inflation and industrial orders/production relationships. We find causality for the USA, but neither for France nor Spain.
      Palabras clave / Key words
      Evaluating forecasts, Loss function, Model selection, Causality, Bootstrap, Monte Carlo.
      Documento / Document
  • Analysis of the calendar effects on the Industry Turnover and New Orders Received IndicesSilvia Rama, Ignacio Arbués, María Mancebo, Luis Andrés de las Mozas, Eva María Vicente
    • Doc.
      01/2012
      Resumen / Abstract

      Most economic monthly time series contain calendar effects. It is important to remove the calendar variation to allow an effective assessment of the variation due to other factors.

      Several methods exist which can adjust for trading-day and holiday effects in monthly economic time series. This paper reviews these methods and shows the procedure for determining the calendar adjustment carried out on the Industrial Turnover and New Orders Received Indices.

      Palabras clave / Key words
      Working-day adjustment, dynamic regression, ARIMA, model identification.
      Documento / Document
  • El INE y su producción estadística: una nota histórica sobre los últimos 50 añosMariano Gómez del Moral
    • Doc.
      11/2011
      Resumen / Abstract
      En este artículo se ofrece la historia del INE en los últimos cincuenta años a través de los principales productos estadísticos que han caracterizado su actividad. La descripción toma como puntos de referencia tres de los hitos políticos y económicos más relevantes de la historia de España en ese periodo, que se vinculan con tres clases de necesidades informativas y tres modos diferentes de abordar la producción de los datos que han permitido la cobertura de las mismas. El artículo proporciona igualmente un detalle de las principales líneas estratégicas que han soportado el quehacer de la institución y que le han permitido situarse en el grupo de cabeza de las oficinas de estadística pública europeas.
      Palabras clave / Key words
      Hitos, líneas estratégicas, bien público, marco internacional, planificación
      Documento / Document
  • Modelling irrigation water consumption at the micro data level in the Survey of Production Methods in Agriculture 2009 (Spain)Jorge Saralegui Gil, Fernando Celestino Rey
    • Doc.
      10/2011
      Resumen / Abstract

      Preliminary studies for the second phase of the Agricultural Census 2009 (Survey ofProduction Methods) recommended not to include in the survey forms the quantity of water consumed ( required by the EU regulation) as an specific question, mainly due to the risk of high measurement errors. Therefore it was decided to launch a project of model assisted estimation in several stages, described in the paper:

      I) Theoretical water needs. After several treatments, the theoretical water requirements per crop are estimated based on an agroclimatic model.

      II) Adjustments for irrigation efficiency. In this stage the irrigation water needs per crop is imputed according to the irrigation techniques used by the holding.

      III) Management efficiency. Final estimation of effective consumption is implemented and adjusted to external sources to take into account the management efficiency of irrigation.

      Palabras clave / Key words
      Water used per crop. Theoretical water needs. Evapotranspiration. Irrigation efficiency
      Documento / Document
  • Metodología de estimación de Diplomados en Estadística del Estado en las delegaciones provinciales del INEJulio César Hernández Sánchez, Cristobal Rojas Montoya
    • Doc.
      09/2011
      Resumen / Abstract
      El objetivo es estimar los diplomados en estadística del estado necesarios en cada una de las delegaciones provinciales del INE. La metodología sugiere identificar diferentes componentes en la estimación. Se estimarán cuatro modelos para los bloques de operaciones económicas, demográficas, bienales y censo electoral y padrón, que se acumulan para obtener la primera predicción. Además, se creará un modelo global y ambas predicciones se combinarán.
      Palabras clave / Key words
      DEE, censo electoral, padrón, cargas de trabajo, estimación, delegaciones provinciales del INE
      Documento / Document
  • Integrating administrative data into the LFS data collection. The Spanish experience obtaining the variable INDECIL from administrative sources.Miguel Ángel García Martínez, Javier Orche Galindo
    • Doc.
      08/2011
      Resumen / Abstract
      Information on the level of wages of the main job is compulsory in the LFS since 2009 (year of reference). Asking for income in household surveys is a sensitive issue that can affect the response rates and the confidence of the respondents. It was decided to obtain information from administrative sources. The Spanish LFS does not ask the personal identification number of the respondents. The solution applied in the Spanish LFS was to incorporate the PIDN (personal identification number) from the register of population matching the information for both, personal and location variables and to use this PIDN to link through the Social Security and Tax databases and incorporate the data on salaries needed to calculate the variable requested in the LFS. A general view of all the processes involved, the difficulties that we had to overcome and the main findings obtained in the preparation of the information are described.
      Palabras clave / Key words
      Labour force survey, record linkage, microintegration, combination of administrative data, validation of data sources, best estimation method
      Documento / Document
  • Towards a corporate-wide electronic data collection system at the National Statistical Institute of SpainPedro Revilla, José Luis Maldonado, José Manuel Bercebal
    • Doc.
      07/2011
      Resumen / Abstract
      Electronic collections present new challenges and opportunities in order to improve editing tasks. They offer the possibility of using built-in edits in electronic questionnaires previously not possible in paper or other modes of data collection. This topic covers all issues relating to methods or strategies about editing of data acquired through electronic data collection (CAPI, CATI, CAWI, etc) and the way the respondents can carry out editing when using electronic questionnaires. Other related topics may include comparisons of editing practices between electronic collections and other collection modes, as well as different problems using multimode data collections. Measuring the respondent burden and the quality and reliability of the responses in order to provide valuable information to other survey processes is another issue of interest. Papers describing editing strategies to improve relationship with respondents or the general editing process are also welcome.
      Palabras clave / Key words
      Electronic questionnaries, electronic data reporting, web questionnaries, CAWI, IRIA,
      Documento / Document
  • Sampling coordination of business surveys in the Spanish National Statistics InstituteDolores Lorca, M. Concepción Molina, Gonzalo Parada, Ana Revilla
    • Doc.
      06/2011
      Resumen / Abstract

      The Spanish NSI works in several alternatives in order to reduce the statistical burden in business surveys. One of them is the use of sampling coordination techniques to reducethe overlap between samples of different surveys.

      The use of the same sampling frame for business surveys (the Central Business Register) has allowed to obtain coordinated samples using the Permanent Random Number (PRN)technique. A statistical burden function is defined and used to coordinate the samples obtained each year.

      Palabras clave / Key words
      Statistical burden, sampling coordination, business survey
      Documento / Document
  • Multivariate Wiener-Kolmogorov Filtering by Polynomial MethodsFélix Aparicio-Pérez
    • Doc.
      05/2011
      Resumen / Abstract
      The exact computation of a general multivariate Wiener-Kolmogorov filter is usually done in state-space form. This paper develops a new method to solve the problem within the polynomial framework. To do so, several matrix polynomial techniques are used. To obtain the initial values, some new techniques are also developed. Finally, some extensions and applications are outlined.
      Palabras clave / Key words
      Wiener-Kolgomorov filter, polynomial matrices
      Documento / Document
  • Exploiting auxiliary information: selective editing as a combinatorial optimization problemDavid Salgado
    • Doc.
      04/2011
      Resumen / Abstract
      We formulate selective editing as a combinatorial optimization problem whose solution establishes which sampled units contain influential errors and thus must undergo interactive editing within a generic editing and imputation strategy. This optimization problem arises naturally from considerations on editing resources savings and estimates accuracy control. Cross-sectional auxiliary information is taken into account through linear mixed models assisting the construction of the problem's feasibility region. We provide a general algorithm for the univariate version of this problem, i.e. for editing one single variable. By applying this proposal to each questionnaire variable we illustrate its use upon the Spanish industrial turnover index and industrial new orders received index surveys. A reduction of interactive editing with a controllably increase of estimates error is observed.
      Palabras clave / Key words
      Selective editing, combinatorial optimization, auxiliary information, linear mixed models
      Documento / Document
  • Study of variance estimation methods in the Spanish Labour Force Survey (EPA)Gerardo Azor Martínez, Juan V. Jiménez Llorente, Carlos Pérez Arriero, Juana Porras Puga
    • Doc.
      03/2011
      Resumen / Abstract
      The aim of this paper is to compare different methods for calculating sampling errors in the Spanish Labour Force Survey (EPA). Half sample replication (HSR) is the method currently employed to this end. We compare its results with those obtained with two other more recent techniques, standard delete-one jackknife and Rao-Wu-Yue bootstrap. The paper begins with a brief description of the EPA methodology, and goes on with a theoretical presentation of the above mentioned methods, followed by the coefficient of variation (CV) calculated for the estimates of the most important EPA variables in 2009. Finally, we present a more detailed study for the autonomous community of Galicia. In this NUTS2 the sample has been enlarged in the third quarter of 2009, and this fact allows us to study the changes in the estimates of the variance, in relation to the change in sample size.
      Palabras clave / Key words
      Sampling errors, half sample replication, jackknife, bootstrap
      Documento / Document
  • On the error of backcast estimates using conversion matrices under a change of classificationIgnacio Arbués, Natalia López
    • Doc.
      02/2011
      Resumen / Abstract
      The classifications used by statistical agencies are sometimes updated. Hence, for the sake of comparability, it is necessary to estimate data from past periods according to the new classification. A frequently used method to calculate the estimates is through the use of Conversion Matrices. We present a theoretical analysis of this method and show with a practical example that it is possible to obtain useful estimates of the error.
      Palabras clave / Key words
      Change of Classification, Backcasting
      Documento / Document
  • Linking data from administrative records and the Living Conditions SurveyJosé María Méndez Martín, Pilar Vega Vicente
    • Doc.
      01/2011
      Resumen / Abstract
      The Encuesta de Condiciones de Vida (Living Conditions Survey, LCS) is an annual survey compiled by the Instituto Nacional de Estadística (Spanish National Statistics Institute, INE). Access to administrative records offers a good opportunity to improve the quality of the relevant data and allow the use of a more efficient collection method. This paper offers a comparative analysis of different income components by linking the survey data with available data from the Spanish Tax Agency or Social Security system.
      Palabras clave / Key words
      Living Conditions Survey, administrative records, household income
      Documento / Document
  • Monthly Demographic Now Cast: monthly estimates of migration flows in SpainMiguel Ángel Martínez Vidal, Sixto Muriel de la Riva
    • Doc.
      08/2010
      Resumen / Abstract
      The Monthly Demographic Now Cast has the goal of covering a traditional lack of information about the demographic juncture. Besides, it is a very innovative statistical action, which introduces a new monthly basis in demographic analysis and shows a high level of accuracy immediately after the reference period. Particularly, its results are decisive in providing advanced estimates of current migration flows for the calculation of Spain¿s Population Now Cast.
      Palabras clave / Key words
      Monthly demographic estimations; immigration; emigration; expanding coefficient; registered flows
      Documento / Document
  • Determining the MSE-optimal cross section to forecastIgnacio Arbués
    • Doc.
      07/2010
      Resumen / Abstract
      We address the problem of which subset of time series to select among a given set in order to forecast another series. The forecasts are evaluated in terms of Mean Squared Error. We propose a family of criteria for which weak and strong consistency results are proved. The criteria are compared to some well-known hypothesis tests by means of Monte Carlo experimentation and a real-data example.
      Palabras clave / Key words
      forecasting, model selection, VARMA models
      Documento / Document
  • INE-Spain strategy on population estimates and projections: facing the challenge of the statistical measure of populationMiguel Ángel Martínez Vidal, Sixto Muriel de la Riva
    • Doc.
      06/2010
      Resumen / Abstract
      National Statistics Institute of Spain presents new actions focused on improving the available statistical sources of demographic information and providing accurate population figures and punctual, detailed and consistent information on current demographic evolution, in a context of general concerns about the current and future evolution of the population pyramid.
      Palabras clave / Key words
      Demographic flows; population now cast; demographic projections
      Documento / Document
  • Effects of rotation groups, interviewing modes and interviewers on the LFS estimatesFlorentina Álvarez, Juana Porras
    • Doc.
      05/2010
      Resumen / Abstract
      This paper examines the influence of three factors (rotation groups, method of interview and interviewer effect) on the main estimations in the Labour Force Survey (LFS), by performing probit and homogeneity analysis. It also shows that the influence of the interview method is partially due to the different representation of foreign people in the CATI and CAPI samples. Finally, it highlights the importance of a correct identification of the bias sources and outlines the future plans to improve the standardization in the Spanish LFS.
      Palabras clave / Key words
      Bias Sources, LFS, Rotation Groups, CAPI y CATI, Interviewer effects
      Documento / Document
  • Towards advanced methods for computing life tablesSixto Muriel de la Riva, Margarita Cantalapiedra Malaguilla, Federico López Carrión
    • Doc.
      04/2010
      Resumen / Abstract
      INE Spain puts compiled data and literature in a series of tests and comparative analyses in order to select the best-suited methodologies for computing life tables, trying to reach an optimal use of the available data on deaths, better international comparability of mortality indicators and an enhanced approach to measuring mortality over small territorial areas.
      Palabras clave / Key words
      Mortality tables; risks of dying; regional mortality tables; different methods to build mortality tables
      Documento / Document
  • Changes of classification in continuous statistics: Calculation of retrospective series. Application to the Quarterly Labour Cost SurveyAmelia Fresneda Pacheco, María Ramos Charbonnier
    • Doc.
      03/2010
      Resumen / Abstract
      A change in the system of classifications of a statistic produces new estimations that are not comparable to the previous ones. Frequently it is desirable to provide temporal series of the results with a temporal horizon enough to allow its analysis, and that's why it is necessary to elaborate retrospective series in the new system. This document tries to explain the procedure applied to adjust the estimations about Labour Costs in Spain to the new National Classification of Economic Activities (CNAE09), that is the spanish version of the EU Classification of Economic Activities, NACE rev 2
      Palabras clave / Key words
      Backcasting, Calibration, Post-stratified estimator, Stratified Randon Sampling, National Classification of Economic Activities, Quarterly Labour Cost Survey
      Documento / Document
  • A Class of stochastic optimization problems with application to selective data editingIgnacio Arbués, Margarita González, Pedro Revilla
    • Doc.
      02/2010
      Resumen / Abstract
      We present a new class of stochastic optimization problems where the solutions belong to an infinite-dimensional space of random variables. We prove existence of the solutions and show that under convexity conditions, a duality method can be used. The search for a good selective editing strategy is stated as a problem in this class, setting the expected workload as the objective function to minimize and imposing quality constraints.
      Palabras clave / Key words
      Stochasting Programming, Optimization in Banach Spaces, Selective Editing, Score Function
      Documento / Document
  • Elaboración de un indicador sintético de medio ambiente. Resultados derivados de la Encuesta de Hogares y Medio Ambiente 2008Carmen Teijeiro Breijo, Carlos Angulo Martín
    • Doc.
      01/2010
      Resumen / Abstract
      El objetivo de la elaboración del indicador de Medio Ambiente es posicionar a los hogares y personas en función de su grado de sensibilización con los problemas medioambientales, teniendo en cuenta tanto su comportamiento colectivo como individual. El indicador propuesto pretende sintetizar de modo más manejable la información multidimensional recogida en la Encuesta de Hogares y Medio Ambiente 2008, que ha realizado el INE y permite establecer comparaciones por características socioeconómicas y territoriales.
      Palabras clave / Key words
      Indicador sintético, análisis multivariante, medio ambiente
      Documento / Document