Probability with R. Jane M. Horgan

Чтение книги онлайн.

Читать онлайн книгу Probability with R - Jane M. Horgan страница 12

Probability with R - Jane M. Horgan

Скачать книгу

target="_blank" rel="nofollow" href="#u35053f73-9b3c-5b00-8eab-afd259d2a886">Part I, “The R Language” now contains:new and improved R procedures, and an introduction to packages and interfaces (Chapter 1);examples on apps to illustrate outliers, to calculate statistics in a data frame and statistics appropriate to skewed data (Chapter 2);an introduction to linear regression, with a discussion of its importance as a tool in machine learning. We show how to obtain the line of best fit with the training set, and how to use the testing set to examine the suitability of the model. We also include extra graphing facilities (Chapter 3).

      2 In Part II, “Fundamentals of Probability”:Chapter 4 has been extended with extra examples on password recognition and new R functions to address hash table collision, server overload and the general allocation problem;The concept of “independence” has now been extended from pairs to multiply variables (Chapter 6);Chapter 7 contains new material on machine learning, notably the use of Bayes' theorem to develop spam filters.

      3 Part III “Discrete Distributions” now includes:an introduction to bivariate discrete distributions, and programming techniques to handle large conditional matrices (Chapter 9);an algorithm to simulate the Markov property of the geometric distribution (Chapter 10);an extension of the reliability model of Chapter 8 to the general reliability model (Chapter 11);an update of the lottery rules (Chapter 12);an extended range of Poisson applications such as network failures, website hits, and virus attacks (Chapter 13).

      4 In Part IV “Continuous Distributions”:Chapters 16 and 17 have been reorganized. Chapter 17 now concentrates entirely on queues while Chapter 16 is extended to deal with the applications of the exponential distribution to lifetime models.

      5 Part V “Tailing Off”has extra exercises on recent applications of computing.

      6 We have added three new appendices: Appendix A gives the data set used in Part I, Appendix B derives the coefficients of the line of best fit and Appendix F contains new proofs of the Markov and Chebyshev inequalities. The original appendices A, B, and C have been relabeled C, D, and E.

      7 A separate index containing R commands and functions has been added.

      All errors in the first edition have hopefully been corrected. I apologize in advance for any new errors that may escape my notice in this edition; should they arise, they will be corrected in the companion website.

      Jane M. Horgan

       Dublin City University

       Ireland

       2019

      This book is offered as a first introduction to probability, and its application to computer disciplines. It has grown from a one‐semester course delivered over the past several years to students reading for a degree in computing at Dublin City University. Students of computing seem to be able happily to think about Database, Computer Architecture, Language Design, Software Engineering, Operating Systems, and then to freeze up when it comes to “Probability,” and to wonder what it might have to do with computing. Convincing undergraduates of the relevance of probability to computing is one of the objectives of this book.

      One reason for writing this has been my inability to find a good text in which probability is applied to problems in computing at the appropriate level. Most existing texts on probability seem to be overly rigorous, too mathematical for the typical computing student. While some computer students may be adept at mathematics, there are many who resist the subject. In this book, we have largely replaced the mathematical approach to probability by one of simulation and experimentation, taking advantage of the powerful graphical and simulation facilities of the statistical system R, which is freely available, and downloadable, from the web. The text is designed for students who have taken a first course in mathematics, involving just a little calculus, as is usual in most degree courses in computing. Mathematical derivations in the main text are kept to a minimum: when we think it necessary, algebraic details are provided in the appendices. To emphasize our attitude to the simulation and experimentation approach, we have chosen to incorporate instructions in R throughout the text, rather than put them back to an appendix.

      Features of the book which distinguish it from other texts in probability include

       R is used not only as a tool for calculation and data analysis, but also to illustrate the concepts of probability, to simulate distributions, and to explore by experimentation different scenarios in decision‐making. The R books currently available skim over the concepts of probability, and concentrate on using it for statistical inference and modelling.

       Recognizing that the student better understands definitions, generalizations and abstractions after seeing the applications, almost all new ideas are introduced and illustrated by real, computer‐related, examples, covering a wide range of computer science applications.

      Although we have addressed in the first instance computer scientists, we believe that this book should also be suitable for students of engineering and mathematics.

      There are in all five parts to the book, starting in Part I with an introduction to R. This presents the procedures of R needed to summarize and provide graphical displays of statistical data. An introduction to programming in R is also included. Not meant to be a manual, this part is intended only to get the student started. As we progress, more procedures of R are introduced as the need arises.

      Part II sets the foundations of probability, and introduces the functions available in R for examining them. R is used not only for calculating probabilities involving unwieldy computations but also for obtaining probabilities through simulation. Probability events and sample spaces are illustrated with the usual gambling experiments, as well as inspection of integrated‐circuit chips, and observation of randomness in computer programming. A discussion of the “Intel Chip Fiasco” leads on to the “balls and bins” problem, which in turn is applied to assigning jobs to processors. It is shown how Bayes' Theorem has important applications in modern‐day computer science such as machine learning and machine translation. Methods to assess reliability of a computer containing many systems, which in turn contain many components, are considered.

      Part III deals with discrete random variables. Nearly every chapter opens with a sequence of examples, designed to motivate

Скачать книгу