outline of Machine Learning机器学习的概要

The following outline is provided as an overview of and topical guide to machine learning.

下面的概要是机器学习的概述和主题指南。

Machine learning – subfield of computer science1 that evolved from the study of pattern recognition and computational learning theory in artificial intelligence.[1] In 1959, Arthur Samuel defined machine learning as a "Field of study that gives computers the ability to learn without being explicitly programmed".[2] Machine learning explores the study and construction of algorithms that can learn from and make predictions on data.[3] Such algorithms operate by building a model from an example training set of input observations in order to make data-driven predictions or decisions expressed as outputs, rather than following strictly static program instructions.

机器学习-计算机科学（更确切地说软计算），它是从人工智能的模式识别和计算学习理论的研究发展而来的。1959年，阿瑟•塞缪尔将机器学习定义为“一门研究领域，它能让计算机在没有明确编程的情况下学习”。机器学习探索的是研究和构建可以学习和预测数据的算法。这种算法用样本训练集建立模型，以便做出数据驱动的预测或决定，并将其表示为输出，而不是严格遵循静态程序指令。

1 What type of thing is machine learning?

An academic discipline
A branch of science
- An applied science
  - A subfield of computer science
  - A branch of artificial intelligence
  - A subfield of soft computing

2 Branches of machine learning

2.1 Subfields of machine learning

Computational learning theory – studying the design and analysis of machine learning algorithms.[4]
Grammar induction
Meta learning

2.2 Cross-disciplinary fields involving machine learning

Adversarial machine learning
Predictive analytics
Quantum machine learning
Robot learning
- Developmental robotics

3 Applications of machine learning

Biomedical informatics
Computer vision
Customer relationship management –
Data mining
Email filtering
Inverted pendulum – balance and equilibrium system.
Natural language processing (NLP)
- Automatic summarization
- Automatic taxonomy construction
- Dialog system
- Grammar checker
- Language recognition
  - Handwriting recognition
  - Optical character recognition
  - Speech recognition
- Machine translation
- Question answering
- Speech synthesis
- Text mining
  - Term frequency–inverse document frequency (tf–idf)
- Text simplification
Pattern recognition
- Facial recognition system
- Handwriting recognition
- Image recognition
- Optical character recognition
- Speech recognition
Recommendation system
- Collaborative filtering
- Content-based filtering
- Hybrid recommender systems (Collaborative and content-based filtering)
Search engine
- Search engine optimization

4 Machine learning hardware

Graphics processing unit
Tensor processing unit
Vision processing unit

5 Machine learning tools

Comparison of deep learning software
- Comparison of deep learning software/Resources

5.1 Machine learning frameworks

Proprietary machine learning frameworks - Amazon Machine Learning - Microsoft Azure Machine Learning Studio - DistBelief – replaced by TensorFlow Open source machine learning frameworks - Apache Singa - Caffe - H2O - PyTorch - mlpack - TensorFlow - Torch - CNTK - Accord.Net

5.2 Machine learning libraries

Deeplearning4j
Theano
Scikit-learn

5.3 Machine learning algorithms

Almeida–Pineda recurrent backpropagation
ALOPEX
Backpropagation
Bootstrap aggregating
CN2 algorithm
Constructing skill trees
Dehaene–Changeux model
Diffusion map
Dominance-based rough set approach
Dynamic time warping
Error-driven learning
Evolutionary multimodal optimization
Expectation–maximization algorithm
FastICA
Forward–backward algorithm
GeneRec
Genetic Algorithm for Rule Set Production
Growing self-organizing map
HEXQ
Hyper basis function network
IDistance
K-nearest neighbors algorithm
Kernel methods for vector output
Kernel principal component analysis
Leabra
Linde–Buzo–Gray algorithm
Local outlier factor
Logic learning machine
LogitBoost
Manifold alignment
Minimum redundancy feature selection
Mixture of experts
Multiple kernel learning
Non-negative matrix factorization
Online machine learning
Out-of-bag error
Prefrontal cortex basal ganglia working memory
PVLV
Q-learning
Quadratic unconstrained binary optimization
Query-level feature
Quickprop
Radial basis function network
Randomized weighted majority algorithm
Reinforcement learning
Repeated incremental pruning to produce error reduction (RIPPER)
Rprop
Rule-based machine learning
Skill chaining
Sparse PCA
State–action–reward–state–action
Stochastic gradient descent
Structured kNN
T-distributed stochastic neighbor embedding
Temporal difference learning
Wake-sleep algorithm
Weighted majority algorithm (machine learning)

6 Machine learning methods

Instance-based algorithm
- K-nearest neighbors algorithm (KNN)
- Learning vector quantization (LVQ)
- Self-organizing map (SOM)
Regression analysis
- Logistic regression
- Ordinary least squares regression (OLSR)
- Linear regression
- Stepwise regression
- Multivariate adaptive regression splines (MARS)
Regularization algorithm
- Ridge regression
- Least Absolute Shrinkage and Selection Operator (LASSO)
- Elastic net
- Least-angle regression (LARS)
Classifiers
- Probabilistic classifier
  - Naive Bayes classifier
- Binary classifier
- Linear classifier
- Hierarchical classifier

6.1 Dimensionality reduction

Canonical correlation analysis (CCA)
Factor analysis
Feature extraction
Feature selection
Independent component analysis (ICA)
Linear discriminant analysis (LDA)
Multidimensional scaling (MDS)
Non-negative matrix factorization (NMF)
Partial least squares regression (PLSR)
Principal component analysis (PCA)
Principal component regression (PCR)
Projection pursuit
Sammon mapping
t-distributed stochastic neighbor embedding (t-SNE)

6.2 Ensemble learning

AdaBoost
Boosting
Bootstrap aggregating (Bagging)
Ensemble averaging – process of creating multiple models and combining them to produce a desired output, as opposed to creating just one model. Frequently an ensemble of models performs better than any individual model, because the various errors of the models "average out."
Gradient boosted decision tree (GBDT)
Gradient boosting machine (GBM)
Random Forest
Stacked Generalization (blending)

6.3 Meta learning

Inductive bias
Metadata

6.4 Reinforcement learning

Q-learning
State–action–reward–state–action (SARSA)
Temporal difference learning (TD)
Learning Automata

6.5 Supervised learning

Supervised learning - AODE - Artificial neural network - Association rule learning algorithms - Apriori algorithm - Eclat algorithm - Case-based reasoning - Gaussian process regression - Gene expression programming - Group method of data handling (GMDH) - Inductive logic programming - Instance-based learning - Lazy learning - Learning Automata - Learning Vector Quantization - Logistic Model Tree - Minimum message length (decision trees, decision graphs, etc.) - Nearest Neighbor Algorithm - Analogical modeling - Probably approximately correct learning (PAC) learning - Ripple down rules, a knowledge acquisition methodology - Symbolic machine learning algorithms - Support vector machines - Random Forests - Ensembles of classifiers - Bootstrap aggregating (bagging) - Boosting (meta-algorithm) - Ordinal classification - Information fuzzy networks (IFN) - Conditional Random Field - ANOVA - Quadratic classifiers - k-nearest neighbor - Boosting - SPRINT - Bayesian networks - Naive Bayes - Hidden Markov models - Hierarchical hidden Markov model Bayesian statistics - Bayesian knowledge base - Naive Bayes - Gaussian Naive Bayes - Multinomial Naive Bayes - Averaged One-Dependence Estimators (AODE) - Bayesian Belief Network (BBN) - Bayesian Network (BN) Decision tree algorithms - Decision tree - Classification and regression tree (CART) - Iterative Dichotomiser 3 (ID3) - C4.5 algorithm - C5.0 algorithm - Chi-squared Automatic Interaction Detection (CHAID) - Decision stump - Conditional decision tree - ID3 algorithm - Random forest - SLIQ Linear classifier - Fisher's linear discriminant - Linear regression - Logistic regression - Multinomial logistic regression - Naive Bayes classifier - Perceptron - Support vector machine

6.6 Unsupervised learning

Unsupervised learning - Expectation-maximization algorithm - Vector Quantization - Generative topographic map - Information bottleneck method Artificial neural networks - Feedforward neural network - Extreme learning machine - Convolutional neural network - Recurrent neural network - Long short-term memory (LSTM) - Logic learning machine - Self-organizing map Association rule learning - Apriori algorithm - Eclat algorithm - FP-growth algorithm Hierarchical clustering - Single-linkage clustering - Conceptual clustering Cluster analysis - BIRCH - DBSCAN - Expectation-maximization (EM) - Fuzzy clustering - Hierarchical Clustering - K-means clustering - K-medians - Mean-shift - OPTICS algorithm Anomaly detection - k-nearest neighbors classification (k-NN) - Local outlier factor

6.7 Semi-supervised learning

Active learning – special case of semi-supervised learning in which a learning algorithm is able to interactively query the user (or some other information source) to obtain the desired outputs at new data points.[5] [6]
Generative models
Low-density separation
Graph-based methods
Co-training
Transduction

6.8 Deep learning

Deep belief networks
Deep Convolutional neural networks
Deep Recurrent neural networks
Hierarchical temporal memory
Generative Adversarial Networks
Deep Boltzmann Machine (DBM)
Stacked Auto-Encoders

6.9 Other machine learning methods and problems

Anomaly detection
Association rules
Bias-variance dilemma
Classification
- Multi-label classification
Clustering
Data Pre-processing
Empirical risk minimization
Feature engineering
Feature learning
Learning to rank
Occam learning
Online machine learning
PAC learning
Regression
Reinforcement Learning
Semi-supervised learning
Statistical learning
Structured prediction
- Graphical models
  - Bayesian network
  - Conditional random field (CRF)
  - Hidden Markov model (HMM)
Unsupervised learning
VC theory

7 Machine learning research

List of artificial intelligence projects
List of datasets for machine learning research

8 History of machine learning

Timeline of machine learning

9 Machine learning projects

DeepMind
Google Brain

10 Machine learning organizations

Knowledge Engineering and Machine Learning Group

10.1 Machine learning conferences and workshops

Artificial Intelligence and Security (AISec) (co-located workshop with CCS)
Conference on Neural Information Processing Systems (NIPS)
ECML PKDD
International Conference on Machine Learning (ICML)

11 Machine learning publications

11.1 Books on machine learning

11.2 Machine learning journals

Machine Learning
Journal of Machine Learning Research (JMLR)
Neural Computation

12 Persons influential in machine learning

Alberto Broggi
Andrei Knyazev
Andrew McCallum
Andrew Ng
Anuraag Jain
Armin B. Cremers
Ayanna Howard
Barney Pell
Ben Goertzel
Ben Taskar
Bernhard Schölkopf
Brian D. Ripley
Christopher G. Atkeson
Corinna Cortes
Demis Hassabis
Douglas Lenat
Eric Xing
Ernst Dickmanns
Geoffrey Hinton – co-inventor of the backpropagation and contrastive divergence training algorithms
Hans-Peter Kriegel
Hartmut Neven
Heikki Mannila
Ian Goodfellow – Father of Generative & adversarial networks [7]
Jacek M. Zurada
Jaime Carbonell
Jeremy Slovak
Jerome H. Friedman
John D. Lafferty
John Platt – invented SMO and Platt scaling
Julie Beth Lovins
Jürgen Schmidhuber
Karl Steinbuch
Katia Sycara
Leo Breiman – invented bagging and random forests
Lise Getoor
Luca Maria Gambardella
Léon Bottou
Marcus Hutter
Mehryar Mohri
Michael Collins
Michael I. Jordan
Michael L. Littman
Nando de Freitas
Ofer Dekel
Oren Etzioni
Pedro Domingos
Peter Flach
Pierre Baldi
Pushmeet Kohli
Ray Kurzweil
Rayid Ghani
Ross Quinlan
Salvatore J. Stolfo
Sebastian Thrun
Selmer Bringsjord
Sepp Hochreiter
Shane Legg
Siraj Raval
Stephen Muggleton
Steve Omohundro
Tom M. Mitchell
Trevor Hastie
Vasant Honavar
Vladimir Vapnik – co-inventor of the SVM and VC theory
Yann LeCun – invented convolutional neural networks
Yasuo Matsuyama
Yoshua Bengio
Zoubin Ghahramani

13 See also

Outline of artificial intelligence
- Outline of computer vision
- Outline of natural language processing
Outline of robotics
Accuracy paradox
Action model learning
Activation function
Activity recognition
ADALINE
Adaptive neuro fuzzy inference system
Adaptive resonance theory
Additive smoothing
Adjusted mutual information
Aika (software)
AIVA
AIXI
AlchemyAPI
AlexNet
Algorithm selection
Algorithmic inference
Algorithmic learning theory
AlphaGo
AlphaGo Zero
Alternating decision tree
Apprenticeship learning
Causal Markov condition
Competitive learning
Concept learning
Decision tree learning
Distribution learning theory
Eager learning
End-to-end reinforcement learning
Error tolerance (PAC learning)
Explanation-based learning
Feature
GloVe
Hyperparameter
IBM Machine Learning Hub
Inferential theory of learning
Learning automata
Learning classifier system
Learning rule
Learning with errors
M-Theory (learning framework)
Machine learning control
Machine learning in bioinformatics
Margin
Markov chain geostatistics
Markov chain Monte Carlo (MCMC)
Markov information source
Markov logic network
Markov model
Markov random field
Markovian discrimination
Maximum-entropy Markov model
Multi-armed bandit
Multi-task learning
Multilinear subspace learning
Multimodal learning
Multiple instance learning
Multiple-instance learning
Never-Ending Language Learning
Offline learning
Parity learning
Population-based incremental learning
Predictive learning
Preference learning
Proactive learning
Proximal gradient methods for learning
Semantic analysis
Similarity learning
Sparse dictionary learning
Stability (learning theory)
Statistical learning theory
Statistical relational learning
Tanagra
Transfer learning
Variable-order Markov model
Version space learning
Waffles
Weka
Loss function
- Loss functions for classification
- Mean squared error (MSE)
- Mean squared prediction error (MSPE)
- Taguchi loss function
Low-energy adaptive clustering hierarchy

13.1 Other

Anne O'Tate
Ant colony optimization algorithms
Anthony Levandowski
Anti-unification (computer science)
Apache Flume
Apache Giraph
Apache Mahout
Apache SINGA
Apache Spark
Apache SystemML
Aphelion (software)
Arabic Speech Corpus
Archetypal analysis
Arthur Zimek
Artificial ants
Artificial bee colony algorithm
Artificial development
Artificial immune system
Astrostatistics
Averaged one-dependence estimators
Bag-of-words model
Balanced clustering
Ball tree
Base rate
Bat algorithm
Baum–Welch algorithm
Bayesian hierarchical modeling
Bayesian interpretation of kernel regularization
Bayesian optimization
Bayesian structural time series
Bees algorithm
Behavioral clustering
Bernoulli scheme
Bias–variance tradeoff
Biclustering
Binarization of consensus partition matrices
Binary classification
Bing Predicts
Bio-inspired computing
Biogeography-based optimization
Biplot
Bondy's theorem
Bongard problem
Bradley–Terry model
BrownBoost
Brown clustering
Burst error
CBCL (MIT)
CIML community portal
CMA-ES
CURE data clustering algorithm
Cache language model
Calibration (statistics)
Canonical correspondence analysis
Canopy clustering algorithm
Cascading classifiers
Category utility
CellCognition
Cellular evolutionary algorithm
Chi-square automatic interaction detection
Chromosome (genetic algorithm)
Classifier chains
Cleverbot
Clonal selection algorithm
Cluster-weighted modeling
Clustering high-dimensional data
Clustering illusion
CoBoosting
Cobweb (clustering)
Cognitive computer
Cognitive robotics
Collostructional analysis
Common-method variance
Complete-linkage clustering
Computer-automated design
Concept class
Concept drift
Conference on Artificial General Intelligence
Conference on Knowledge Discovery and Data Mining
Confirmatory factor analysis
Confusion matrix
Congruence coefficient
Connect (computer system)
Consensus clustering
Constrained clustering
Constrained conditional model
Constructive cooperative coevolution
Correlation clustering
Correspondence analysis
Cortica
Coupled pattern learner
Cross-entropy method
Cross-validation (statistics)
Crossover (genetic algorithm)
Cuckoo search
Cultural algorithm
Cultural consensus theory
Curse of dimensionality
DADiSP
DARPA LAGR Program
Darkforest
Dartmouth workshop
DarwinTunes
Data Mining Extensions
Data exploration
Data pre-processing
Data stream clustering
Dataiku
Davies–Bouldin index
Decision boundary
Decision list
Decision tree model
Deductive classifier
DeepArt
DeepDream
Deep Web Technologies
Defining length
Dendrogram
Dependability state model
Detailed balance
Determining the number of clusters in a data set
Detrended correspondence analysis
Developmental robotics
Diffbot
Differential evolution
Discrete phase-type distribution
Discriminative model
Dissociated press
Distributed R
Dlib
Document classification
Documenting Hate
Domain adaptation
Doubly stochastic model
Dual-phase evolution
Dunn index
Dynamic Bayesian network
Dynamic Markov compression
Dynamic topic model
Dynamic unobserved effects model
EDLUT
ELKI
Edge recombination operator
Effective fitness
Elastic map
Elastic matching
Elbow method (clustering)
Emergent (software)
Encog
Entropy rate
Erkki Oja
Eurisko
European Conference on Artificial Intelligence
Evaluation of binary classifiers
Evolution strategy
Evolution window
Evolutionary Algorithm for Landmark Detection
Evolutionary algorithm
Evolutionary art
Evolutionary music
Evolutionary programming
Evolvability (computer science)
Evolved antenna
Evolver (software)
Evolving classification function
Expectation propagation
Exploratory factor analysis
F1 score
FLAME clustering
Factor analysis of mixed data
Factor graph
Factor regression model
Factored language model
Farthest-first traversal
Fast-and-frugal trees
Feature Selection Toolbox
Feature hashing
Feature scaling
Feature vector
Firefly algorithm
First-difference estimator
First-order inductive learner
Fish School Search
Fisher kernel
Fitness approximation
Fitness function
Fitness proportionate selection
Fluentd
Folding@home
Formal concept analysis
Forward algorithm
Fowlkes–Mallows index
Frederick Jelinek
Frrole
Functional principal component analysis
GATTO
GLIMMER
Gary Bryce Fogel
Gaussian adaptation
Gaussian process
Gaussian process emulator
Gene prediction
General Architecture for Text Engineering
Generalization error
Generalized canonical correlation
Generalized filtering
Generalized iterative scaling
Generalized multidimensional scaling
Generative adversarial network
Generative model
Genetic algorithm
Genetic algorithm scheduling
Genetic algorithms in economics
Genetic fuzzy systems
Genetic memory (computer science)
Genetic operator
Genetic programming
Genetic representation
Geographical cluster
Gesture Description Language
Geworkbench
Glossary of artificial intelligence
Glottochronology
Golem (ILP)
Google matrix
Grafting (decision trees)
Gramian matrix
Grammatical evolution
Granular computing
GraphLab
Graph kernel
Gremlin (programming language)
Growth function
HUMANT (HUManoid ANT) algorithm
Hammersley–Clifford theorem
Harmony search
Hebbian theory
Hidden Markov random field
Hidden semi-Markov model
Hierarchical hidden Markov model
Higher-order factor analysis
Highway network
Hinge loss
Holland's schema theorem
Hopkins statistic
Hoshen–Kopelman algorithm
Huber loss
IRCF360
Ian Goodfellow
Ilastik
Ilya Sutskever
Immunocomputing
Imperialist competitive algorithm
Inauthentic text
Incremental decision tree
Induction of regular languages
Inductive bias
Inductive probability
Inductive programming
Influence diagram
Information Harvesting
Information fuzzy networks
Information gain in decision trees
Information gain ratio
Inheritance (genetic algorithm)
Instance selection
Intel RealSense
Interacting particle system
Interactive machine translation
International Joint Conference on Artificial Intelligence
International Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics
International Semantic Web Conference
Iris flower data set
Island algorithm
Isotropic position
Item response theory
Iterative Viterbi decoding
JOONE
Jabberwacky
Jaccard index
Jackknife variance estimates for random forest
Java Grammatical Evolution
Joseph Nechvatal
Jubatus
Julia (programming language)
Junction tree algorithm
K-SVD
K-means++
K-medians clustering
K-medoids
KNIME
KXEN Inc.
K q-flats
Kaggle
Kalman filter
Katz's back-off model
Keras
Kernel adaptive filter
Kernel density estimation
Kernel eigenvoice
Kernel embedding of distributions
Kernel method
Kernel perceptron
Kernel random forest
Kinect
Klaus-Robert Müller
Kneser–Ney smoothing
Knowledge Vault
Knowledge integration
LIBSVM
LPBoost
Labeled data
LanguageWare
Language Acquisition Device (computer)
Language identification in the limit
Language model
Large margin nearest neighbor
Latent Dirichlet allocation
Latent class model
Latent semantic analysis
Latent variable
Latent variable model
Lattice Miner
Layered hidden Markov model
Learnable function class
Least squares support vector machine
Leave-one-out error
Leslie P. Kaelbling
Linear genetic programming
Linear predictor function
Linear separability
Lingyun Gu
Linkurious
Lior Ron (business executive)
List of genetic algorithm applications
List of metaphor-based metaheuristics
List of text mining software
Local case-control sampling
Local independence
Local tangent space alignment
Locality-sensitive hashing
Log-linear model
Logistic model tree
Low-rank approximation
Low-rank matrix approximations
MATLAB
MIMIC (immunology)
MXNet
Mallet (software project)
Manifold regularization
Margin-infused relaxed algorithm
Margin classifier
Mark V. Shaney
Massive Online Analysis
Matrix regularization
Matthews correlation coefficient
Mean shift
Mean squared error
Mean squared prediction error
Measurement invariance
Medoid
MeeMix
Melomics
Memetic algorithm
Meta-optimization
Mexican International Conference on Artificial Intelligence
Michael Kearns (computer scientist)
MinHash
Mixture model
Mlpy
Models of DNA evolution
Moral graph
Mountain car problem
Movidius
Multi-armed bandit
Multi-label classification
Multi expression programming
Multiclass classification
Multidimensional analysis
Multifactor dimensionality reduction
Multilinear principal component analysis
Multiple correspondence analysis
Multiple discriminant analysis
Multiple factor analysis
Multiple sequence alignment
Multiplicative weight update method
Multispectral pattern recognition
Mutation (genetic algorithm)
MysteryVibe
N-gram
NOMINATE (scaling method)
Native-language identification
Natural Language Toolkit
Natural evolution strategy
Nearest-neighbor chain algorithm
Nearest centroid classifier
Nearest neighbor search
Neighbor joining
Nest Labs
NetMiner
NetOwl
Neural Designer
Neural Engineering Object
Neural Lab
Neural modeling fields
Neural network software
NeuroSolutions
Neuro Laboratory
Neuroevolution
Neuroph
Niki.ai
Noisy channel model
Noisy text analytics
Nonlinear dimensionality reduction
Novelty detection
Nuisance variable
Numenta
One-class classification
Onnx
OpenNLP
Optimal discriminant analysis
Oracle Data Mining
Orange (software)
Ordination (statistics)
Overfitting
PROGOL
PSIPRED
Pachinko allocation
PageRank
Parallel metaheuristic
Parity benchmark
Part-of-speech tagging
Particle swarm optimization
Path dependence
Pattern language (formal languages)
Peltarion Synapse
Perplexity
Persian Speech Corpus
Picas (app)
Pietro Perona
Pipeline Pilot
Piranha (software)
Pitman–Yor process
Plate notation
Polynomial kernel
Pop music automation
Population process
Portable Format for Analytics
Predictive Model Markup Language
Predictive state representation
Preference regression
Premature convergence
Principal geodesic analysis
Prior knowledge for pattern recognition
Prisma (app)
Probabilistic Action Cores
Probabilistic context-free grammar
Probabilistic latent semantic analysis
Probabilistic soft logic
Probability matching
Probit model
Product of experts
Programming with Big Data in R
Proper generalized decomposition
Pruning (decision trees)
Pushpak Bhattacharyya
Q methodology
Qloo
Quality control and genetic algorithms
Quantum Artificial Intelligence Lab
Queueing theory
Quick, Draw!
R (programming language)
Rada Mihalcea
Rademacher complexity
Radial basis function kernel
Rand index
Random indexing
Random projection
Random subspace method
Ranking SVM
RapidMiner
Rattle GUI
Raymond Cattell
Reasoning system
Regularization perspectives on support vector machines
Relational data mining
Relationship square
Relevance vector machine
Relief (feature selection)
Renjin
Repertory grid
Representer theorem
Reward-based selection
Richard Zemel
Right to explanation
RoboEarth
Robust principal component analysis
RuleML Symposium
Rule induction
Rules extraction system family
SAS (software)
SNNS
SPSS Modeler
SUBCLU
Sample complexity
Sample exclusion dimension
Santa Fe Trail problem
Savi Technology
Schema (genetic algorithms)
Search-based software engineering
Selection (genetic algorithm)
Self-Service Semantic Suite
Semantic folding
Semantic mapping (statistics)
Semidefinite embedding
Sense Networks
Sensorium Project
Sequence labeling
Sequential minimal optimization
Shattered set
Shogun (toolbox)
Silhouette (clustering)
SimHash
SimRank
Similarity measure
Simple matching coefficient
Simultaneous localization and mapping
Sinkov statistic
Sliced inverse regression
SmartMatch
Snakes and Ladders
Soft independent modelling of class analogies
Soft output Viterbi algorithm
Solomonoff's theory of inductive inference
SolveIT Software
Spectral clustering
Spike-and-slab variable selection
Statistical machine translation
Statistical parsing
Statistical semantics
Stefano Soatto
Stephen Wolfram
Stochastic block model
Stochastic cellular automaton
Stochastic diffusion search
Stochastic grammar
Stochastic matrix
Stochastic universal sampling
Stress majorization
String kernel
Structural equation modeling
Structural risk minimization
Structured sparsity regularization
Structured support vector machine
Subclass reachability
Sufficient dimension reduction
Sukhotin's algorithm
Sum of absolute differences
Sum of absolute transformed differences
Swarm intelligence
Switching Kalman filter
Symbolic regression
Synchronous context-free grammar
Syntactic pattern recognition
TD-Gammon
TIMIT
Teaching dimension
Teuvo Kohonen
Textual case-based reasoning
Theory of conjoint measurement
Thomas G. Dietterich
Thurstonian model
Topic model
Tournament selection
Training, test, and validation sets
Transiogram
Trax Image Recognition
Trigram tagger
Truncation selection
Tucker decomposition
UIMA
UPGMA
Ugly duckling theorem
Uncertain data
Uniform convergence in probability
Unique negative dimension
Universal portfolio algorithm
User behavior analytics
VC dimension
VIGRA
Validation set
Vapnik–Chervonenkis theory
Variable-order Bayesian network
Variable kernel density estimation
Variable rules analysis
Variational message passing
Varimax rotation
Vector quantization
Vicarious (company)
Viterbi algorithm
Vowpal Wabbit
WACA clustering algorithm
WPGMA
Ward's method
Weasel program
Whitening transformation
Winnow (algorithm)
Win–stay, lose–switch
Witness set
Wolfram Language
Wolfram Mathematica
Writer invariant
Xgboost
Yooreeka
Zeroth (software)

14 Further reading

Trevor Hastie, Robert Tibshirani and Jerome H. Friedman (2001). The Elements of Statistical Learning, Springer. ISBN 0-387-95284-5. Pedro Domingos (September 2015), The Master Algorithm, Basic Books, ISBN 978-0-465-06570-7 Mehryar Mohri, Afshin Rostamizadeh, Ameet Talwalkar (2012). Foundations of Machine Learning, The MIT Press. ISBN 978-0-262-01825-8. Ian H. Witten and Eibe Frank (2011). Data Mining: Practical machine learning tools and techniques Morgan Kaufmann, 664pp., ISBN 978-0-12-374856-0. David J. C. MacKay. Information Theory, Inference, and Learning Algorithms Cambridge: Cambridge University Press, 2003. ISBN 0-521-64298-1 Richard O. Duda, Peter E. Hart, David G. Stork (2001) Pattern classification (2nd edition), Wiley, New York, ISBN 0-471-05669-3. Christopher Bishop (1995). Neural Networks for Pattern Recognition, Oxford University Press. ISBN 0-19-853864-2. Vladimir Vapnik (1998). Statistical Learning Theory. Wiley-Interscience, ISBN 0-471-03003-1. Ray Solomonoff, An Inductive Inference Machine, IRE Convention Record, Section on Information Theory, Part 2, pp., 56-62, 1957. Ray Solomonoff, "An Inductive Inference Machine" A privately circulated report from the 1956 Dartmouth Summer Research Conference on AI.

15 References

^ Jump up to: a b http://www.britannica.com/EBchecked/topic/1116194/machine-learning This tertiary source reuses information from other sources but does not name them. Jump up ^ Phil Simon (March 18, 2013). Too Big to Ignore: The Business Case for Big Data. Wiley. p. 89. ISBN 978-1-118-63817-0. Jump up ^ Ron Kohavi; Foster Provost (1998). "Glossary of terms". Machine Learning. 30: 271–274. Jump up ^ http://www.learningtheory.org/ Jump up ^ Settles, Burr (2010), "Active Learning Literature Survey" (PDF), Computer Sciences Technical Report 1648. University of Wisconsin–Madison, retrieved 2014-11-18 Jump up ^ Rubens, Neil; Elahi, Mehdi; Sugiyama, Masashi; Kaplan, Dain (2016). "Active Learning in Recommender Systems". In Ricci, Francesco; Rokach, Lior; Shapira, Bracha. Recommender Systems Handbook (2 ed.). Springer US. doi:10.1007/978-1-4899-7637-6. ISBN 978-1-4899-7637-6. Jump up ^ https://en.wikipedia.org/wiki/Generative_adversarial_network#cite_note-GANs-1

16 External links

Data Science: Data to Insights from MIT (machine learning) Popular online course by Andrew Ng, at Coursera. It uses GNU Octave. The course is a free version of Stanford University's actual course taught by Ng, see.stanford.edu/Course/CS229 available for free]. mloss is an academic database of open-source machine learning software.