"Computational Learning Theory (COLT) is a reasearch field devoted to studying the design and analysis of algorithms for making predictions about the future based on past experiences. The emphasis in COLT is on rigorous mathematical analysis. As a field with roots in theoretical computer science, COLT is largely concerned with computational and data efficiency. Much of the work in COLT can be traced to Valiant's seminal paper on "A theory of the learnable" (1984) as well as Gold's "Language identification in the limit" (1967). The annual Conference on Computational Learning Theory began in 1988; the European Conference on Computational Learning Theory and the Workshop on Algorithmic Learning Theory were formed soon after. COLT has strongly encouraged interaction with other fields that work on problems of prediction such as applied machine learning, statistics, information theory, pattern recognition and statistical physics, as well as other areas of computer science such as artificial intelligence, complexity theory and cryptography."
Freund and Schapire
BENVENISTE, A., P. PRIOURET and M. M?TIVIER, 1990. Adaptive algorithms and stochastic approximations - all 2 versions ». Springer-Verlag New York, Inc. New York, NY, USA. [Cited by 417] (24.19/year)
BERTSEKAS, D.P., 1987. Dynamic programming: deterministic and stochastic models. Prentice-Hall, Inc. Upper Saddle River, NJ, USA. [Cited by 708] (34.99/year)
BERTSEKAS, D.P., 1995. Dynamic Programming and Optimal Control - all 5 versions ». Athena Scientific. [Cited by 998] (81.57/year)
BERTSEKAS, D.P. and J.N. TSITSIKLIS, 1996. Neuro-Dynamic Programming - all 5 versions ». Athena Scientific. [Cited by 1452] (129.23/year)
CSISZAR, I. and J.G. KORNER, 1982. Information Theory: Coding Theorems for Discrete Memoryless Systems. Academic Press, Inc. Orlando, FL, USA. [Cited by 733] (29.05/year)
EISENBERG, B. and R.L. RIVEST, 1990. On the sample complexity of pac-learning using random and chosen examples. Morgan Kaufmann Publishers Inc. San Francisco, CA, USA. [Cited by 21] (1.22/year)
GOLDBERG, D.E., 1989. Genetic Algorithms in Search, Optimization and Machine Learning - all 8 versions ». Addison-Wesley Longman Publishing Co., Inc. Boston, MA, USA. [Cited by 19220] (1,054.00/year)
HANCOCK, T.R., 1990. Identifying ?-formula decision trees with queries. Morgan Kaufmann Publishers Inc. San Francisco, CA, USA. [Cited by 24] (1.39/year)
HARRISON, M.A.A. and M.A. HARRISON, 1978. Introduction to Formal Language Theory - all 2 versions ». Addison-Wesley Longman Publishing Co., Inc. Boston, MA, USA. [Cited by 560] (19.15/year)
KOHONEN, T., 1989. Self-organization and associative memory - all 6 versions ». Springer-Verlag New York, Inc. New York, NY, USA. [Cited by 4739] (259.88/year)
LAIRD, P.D., 1988. Learning from good and bad data - all 2 versions ». Kluwer Academic Publishers Norwell, MA, USA. [Cited by 73] (3.80/year)
LAVRAC, N. and S. DZEROSKI, 1993. Inductive Logic Programming: Techniques and Applications - all 2 versions ». Routledge New York, NY, 10001. [Cited by 457] (32.10/year)
LITTLE, R.J.A. and D.B. RUBIN, 1986. Statistical analysis with missing data - all 8 versions ». John Wiley & Sons, Inc. New York, NY, USA. [Cited by 4184] (197.03/year)
LLOYD, J.W., 1987. Foundations of logic programming - all 7 versions ». Springer-Verlag New York, Inc. New York, NY, USA. [Cited by 3150] (155.67/year)
MAASS, W., 1991. On-line learning with an oblivious environment and the power of randomization. Morgan Kaufmann Publishers Inc. San Francisco, CA, USA. [Cited by 20] (1.23/year)
MAASS, W. and G. TUR?N, 1994. How fast can a threshold gate learn?. MIT Press Cambridge, MA, USA. [Cited by 33] (2.49/year)
MACHTEY, M. and P. YOUNG, 1978. An Introduction to the General Theory of Algorithms - all 3 versions ». Elsevier Science Inc. New York, NY, USA. [Cited by 150] (5.13/year)
NEWELL, A., 1990. Unified theories of cognition - all 2 versions ». Harvard University Press Cambridge, MA, USA. [Cited by 2070] (120.10/year)
OSHERSON, D.N., M. STOB and S. WEINSTEIN, 1986. Systems that learn: an introduction to learning theory for cognitive and computer scientists - all 2 versions ». MIT Press Cambridge, MA, USA. [Cited by 123] (5.79/year)
RISSANEN, J., 1989. Stochastic Complexity in Statistical Inquiry Theory - all 2 versions ». World Scientific Publishing Co., Inc. River Edge, NJ, USA. [Cited by 1226] (67.23/year)
ROGERS, H., 1987. Theory of recursive functions and effective computability - all 3 versions ». MIT Press Cambridge, MA, USA. [Cited by 1551] (76.65/year)
SACKS, G.E., 1990. Higher recursion theory - all 2 versions ». Springer-Verlag New York, Inc. New York, NY, USA. [Cited by 105] (6.09/year)
SCHRIJVER, A., 1986. Theory of linear and integer programming - all 5 versions ». John Wiley & Sons, Inc. New York, NY, USA. [Cited by 2282] (107.46/year)
SCHWEFEL, H.P., 1981. Numerical Optimization of Computer Models. John Wiley & Sons, Inc. New York, NY, USA. [Cited by 911] (34.72/year)
WEGENER, I., 1987. The complexity of Boolean functions - all 7 versions ». John Wiley & Sons, Inc. New York, NY, USA. [Cited by 516] (25.50/year)
WIDROW, B. and S.D. STEARNS, 1985. Adaptive signal processing - all 5 versions ». Prentice-Hall, Inc. Upper Saddle River, NJ, USA. [Cited by 2768] (124.49/year)