Description: Survey of ideas, methods, and tools for analyzing large data sets. Topics from supervised and unsupervised learning include penalized regression and classification, support vector machines, kernel methods, model selection, matrix factorizations and completion, graphical models, clustering, boosting and ensemble learning. STAT 444 will have applied assignments and exams focusing on data analysis. Cross-list: STAT 640. Mutually Exclusive: Cannot register for STAT 444 if student has credit for STAT 640.