The Minimum Description Length Principle

MIT Press, 2007 - 703 pagina's

The minimum description length (MDL) principle is a powerful method of inductive inference, the basis of statistical modeling, pattern recognition, and machine learning. It holds that the best explanation, given a limited set of observed data, is the one that permits the greatest compression of the data. MDL methods are particularly well-suited for dealing with model selection, prediction, and estimation problems in situations where the models under consideration can be arbitrarily complex, and overfitting the data is a serious concern.This extensive, step-by-step introduction to the MDL Principle provides a comprehensive reference (with an emphasis on conceptual issues) that is accessible to graduate students and researchers in statistics, pattern classification, machine learning, and data mining, to philosophers interested in the foundations of statistics, and to researchers in other applied sciences that involve model selection, including biology, econometrics, and experimental psychology. Part I provides a basic introduction to MDL and an overview of the concepts in statistics and information theory needed to understand MDL. Part II treats universal coding, the information-theoretic notion on which MDL is built, and part III gives a formal treatment of MDL theory as a theory of inductive inference based on universal coding. Part IV provides a comprehensive overview of the statistical theory of exponential families with an emphasis on their information-theoretic properties. The text includes a number of summaries, paragraphs offering the reader a "fast track" through the material, and boxes highlighting the most important concepts.

Voorbeeld bekijken »

Inhoudsopgave

1 Learning Regularity and Compression	3

2 Probabilistic and Statistical Preliminaries	41

3 InformationTheoretic Preliminaries	79

4 InformationTheoretic Properties of Statistical Models	109

5 Crude TwoPart Code MDL	131

P A R T I I Universal Coding	165

6 Universal Coding with Countable Models	171

Normalized Maximum Likelihood	207

P A R T I I I Refined MDL	403

14 MDL Model Selection	409

15 MDL Prediction and Estimation	459

16 MDL Consistency and Convergence	501

17 MDL in Context	523

P A R T I V Additional Background	597

18 The Exponential or Maximum Entropy Families	599

19 InformationTheoretic Properties of Exponential Families	623

Bayes	231

Prequential Plugin	257

TwoPart	271

11 NMLWith Innite Complexity	295

12 Linear Regression	335

References	651

List of Symbols	675

Subject Index	679

Copyright

Philosophy of Information

Gedeeltelijke weergave - 2008

Alle resultaten in Google Boeken »

Over de auteur (2007)

Peter D. Grunwald is a researcher at CWI, the National Research Institute for Mathematics and Computer Science, Amsterdam, the Netherlands. He is also affiliated with EURANDOM, the European Research Institute for the Study of Stochastic Phenomena, Eindhoven, the Netherlands.

Bibliografische gegevens

Titel	The Minimum Description Length Principle Adaptive computation and machine learning
Auteur	Peter D. Grünwald
Editie	geïllustreerd
Uitgever	MIT Press, 2007
ISBN	0262072815, 9780262072816
Lengte	703 pagina's

Citatie exporteren	BiBTeX EndNote RefMan

Over Google Boeken - Privacybeleid - Servicevoorwaarden - Informatie voor uitgevers - Een probleem melden - Help - Google Homepage

Boeken

The Minimum Description Length Principle

Inhoudsopgave

Overige edities - Alles bekijken

Veelvoorkomende woorden en zinsdelen

Verwijzingen naar dit boek

Over de auteur (2007)

Bibliografische gegevens