Sampling and Estimation from Finite Populations. Yves Tille

Чтение книги онлайн.

Читать онлайн книгу Sampling and Estimation from Finite Populations - Yves Tille страница 11

Sampling and Estimation from Finite Populations - Yves Tille

Скачать книгу

      The almost 20 years I spent in Neuchâtel were dotted with multiple adventures. I am particularly grateful to Philippe Eichenberger and Jean‐Pierre Renfer, who successively headed the Statistical Methods Section of the Federal Statistical Office. Their trust and professionalism helped to establish a fruitful exchange between the Institute of Statistics of the University of Neuchâtel and the Swiss Federal Statistical Office.

      I am also very grateful to the PhD students that I have had the pleasure of mentoring so far. Each thesis is an adventure that teaches both supervisor and doctoral student. Thank you to Alina Matei, Lionel Quality, Desislava Nedyalkova, Erika Antal, Matti Langel, Toky Randrianasolo, Eric Graf, Caren Hasler, Matthieu Wilhelm, Mihaela Guinand‐Anastasiade, and Audrey‐Anne Vallée who trusted me and whom I had the pleasure to supervise for a few years.

       Yves Tillé

      Neuchâtel, 2018

      This book contains teaching material that I started to develop in 1994. All chapters have indeed served as a support for teaching, a course, training, a workshop or a seminar. By grouping this material, I hope to present a coherent and modern set of results on the sampling, estimation, and treatment of nonresponses, in other words, on all the statistical operations of a standard sample survey.

      In producing this book, my goal is not to provide a comprehensive overview of survey sampling theory, but rather to show that sampling theory is a living discipline, with a very broad scope. If, in several chapters demonstrations have been discarded, I have always been careful to refer the reader to bibliographical references. The abundance of very recent publications attests to the fertility of the 1990s in this area. All the developments presented in this book are based on the so‐called “design‐based” approach. In theory, there is another point of view based on population modeling. I intentionally left this approach aside, not out of disinterest, but to propose an approach that I deem consistent and ethically acceptable to the public statistician.

      I would like to thank all the people who, in one way or another, helped me to make this book: Laurence Broze, who entrusted me with my first sampling course at the University Lille 3, Carl Särndal, who encouraged me on several occasions, and Yves Berger, with whom I shared an office at the Université Libre de Bruxelles for several years and who gave me a multitude of relevent remarks. My thanks also go to Antonio Canedo who taught me to use LaTeX, to Lydia Zaïd who has corrected the manuscript several times, and to Jean Dumais for his many constructive comments.

      I wrote most of this book at the École Nationale de la Statistique et de l'Analyse de l'Information. The warm atmosphere that prevailed in the statistics department gave me a lot of support. I especially thank my colleagues Fabienne Gaude, Camelia Goga, and Sylvie Rousseau, who meticulously reread the manuscript, and Germaine Razé, who did the work of reproduction of the proofs. Several exercises are due to Pascal Ardilly, Jean‐Claude Deville, and Laurent Wilms. I want to thank them for allowing me to reproduce them. My gratitude goes particularly to Jean‐Claude Deville for our fruitful collaboration within the Laboratory of Survey Statistics of the Center for Research in Economics and Statistics. The chapters on the splitting method and balanced sampling also reflect the research that we have done together.

       Yves Tillé

      Bruz, 2001

      1.1 Introduction

      Scientific truth is often presented as the consensus of a scientific community at a specific point in time. The history of a scientific discipline is the story of these consensuses and especially of their changes. Since the work of Thomas Samuel Kuhn (1970), we have considered that science develops around paradigms that are, according to Kuhn (1970, p. 10), “models from which spring particular coherent traditions of scientific research.” These models have two characteristics: “Their achievement was sufficiently unprecedented to attract an enduring group of adherents away from competing modes of scientific activity. Simultaneously, it was sufficiently open‐ended to leave all sorts of problems for the redefined group of practitioners to resolve.” (Kuhn, 1970, p. 10).

      Many authors have proposed a chronology of discoveries in survey theory that reflect the major controversies that have marked its development (see among others Hansen & Madow, 1974; Hansen et al., 1983; Owen & Cochran, 1976; Sheynin, 1986; Stigler, 1986). Bellhouse (1988a) interprets this timeline as a story of the great ideas that contributed to the development of survey sampling theory. Statistics is a peculiar science. With mathematics for tools, it allows the methodology of the other disciplines to be finalized. Because of the close correlation between a method and the multiplicity of its fields of action, statistics is based on a multitude of different ideas from the various disciplines in which it is applied.

      The theory of survey sampling plays a preponderant role in the development of statistics. However, the use of sampling techniques has been accepted only very recently. Among the controversies that have animated this theory, we find some of the classical debates of mathematical statistics, such as the role of modeling and a discussion of estimation techniques. Sampling theory was torn between the major currents of statistics and gave rise to multiple approaches: design‐based, model‐based, model‐assisted, predictive, and Bayesian.

      The development of statistics (etymologically, from German: analysis of data about the state) is inseparable from the emergence of modern states in the 19th century. One of the most outstanding personalities in the official statistics of the 19th century is the Belgian Adolphe Quételet (1796–1874). He knew of Laplace's method and maintained a correspondence with him. According to Stigler (1986, pp. 164–165), Quételet was initially attracted to the idea of using partial data. He even tried to apply Laplace's method to estimate the population of the Netherlands in 1824 (which Belgium was a part of until 1830). However, it seems that he then rallied to a note from Keverberg (1827) which severely criticized the use of partial data in the name of precision and accuracy:

      In my opinion, there is only one way to arrive at an exact knowledge of the population and the elements

Скачать книгу