Normal view MARC view ISBD view

Data science solutions with Python : fast and scalable models using Keras, Pyspark Mllib, H2O, XGBoost, and scikit-Learn / Tshepo Chris Nokeri

By: Tshepo, Chris Nokeri.

Contributor(s): Ohio Library and Information Network.

Publisher: [United States] : Apress, ©2022Description: 119 p.Content type: text Media type: computer Carrier type: online resourceISBN: 9781484277614.Subject(s): Machine learning | Python (Computer program language)Genre/Form: Print books.

Contents:

Chapter 1: Understanding Machine Learning and Deep Learning -- Chapter 2: Big Data Frameworks and ML and DL Frameworks -- Chapter 3: The Parametric Method Linear Regression -- Chapter 4: Survival Regression Analysis.-Chapter 5:The Non-Parametric Method - Classification -- Chapter 6:Tree-based Modelling and Gradient Boosting -- Chapter 7: Artificial Neural Networks -- Chapter 8: Cluster Analysis using K-Means -- Chapter 9: Dimension Reduction Principal Components Analysis -- Chapter 10: Automated Machine Learning

Summary: Apply supervised and unsupervised learning to solve practical and real-world big data problems. This book teaches you how to engineer features, optimize hyperparameters, train and test models, develop pipelines, and automate the machine learning (ML) process. The book covers an in-memory, distributed cluster computing framework known as PySpark, machine learning framework platforms known as scikit-learn, PySpark MLlib, H2O, and XGBoost, and a deep learning (DL) framework known as Keras. The book starts off presenting supervised and unsupervised ML and DL models, and then it examines big data frameworks along with ML and DL frameworks. Author Tshepo Chris Nokeri considers a parametric model known as the Generalized Linear Model and a survival regression model known as the Cox Proportional Hazards model along with Accelerated Failure Time (AFT). Also presented is a binary classification model (logistic regression) and an ensemble model (Gradient Boosted Trees). The book introduces DL and an artificial neural network known as the Multilayer Perceptron (MLP) classifier. A way of performing cluster analysis using the K-Means model is covered. Dimension reduction techniques such as Principal Components Analysis and Linear Discriminant Analysis are explored. And automated machine learning is unpacked. This book is for intermediate-level data scientists and machine learning engineers who want to learn how to apply key big data frameworks and ML and DL frameworks. You will need prior knowledge of the basics of statistics, Python programming, probability theories, and predictive analytics. What You Will Learn Understand widespread supervised and unsupervised learning, including key dimension reduction techniques Know the big data analytics layers such as data visualization, advanced statistics, predictive analytics, machine learning, and deep learning Integrate big data frameworks with a hybrid of machine learning frameworks and deep learning frameworks Design, build, test, and validate skilled machine models and deep learning models Optimize model performance using data transformation, regularization, outlier remedying, hyperparameter optimization, and data split ratio alteration

average rating: 0.0 (0 votes)

Holdings ( 1 )
Title notes

Current location	Call number	Status	Date due	Barcode	Item holds
On Shelf	QA76.9 .B45 2022 (Browse shelf)	Available		AU00000000018405

Total holds: 0

Browsing Alfaisal University Shelves , Shelving location: On Shelf Close shelf browser

Previous								Next
Previous	QA76.9.A94 P37 2017 Augmented human : how technology is shaping the new reality /	QA76.9 .A94 P43 2017 Augmented reality : where we will all live /	QA76.9.A94 P47 2021 Augmented reality : unboxing tech's next big thing /	QA76.9 .B45 2022 Data science solutions with Python : fast and scalable models using Keras, Pyspark Mllib, H2O, XGBoost, and scikit-Learn /	QA76.9.B45 A73 2022 Human-centered data science : an introduction /	QA76.9.B45 C38 2021 Big data mining and complexity /	QA76.9.B45 C54 2017 Big Data : How the Information Revolution Is Transforming Our Lives.	Next

Includes index

Available to OhioLINK libraries

Apply supervised and unsupervised learning to solve practical and real-world big data problems. This book teaches you how to engineer features, optimize hyperparameters, train and test models, develop pipelines, and automate the machine learning (ML) process. The book covers an in-memory, distributed cluster computing framework known as PySpark, machine learning framework platforms known as scikit-learn, PySpark MLlib, H2O, and XGBoost, and a deep learning (DL) framework known as Keras. The book starts off presenting supervised and unsupervised ML and DL models, and then it examines big data frameworks along with ML and DL frameworks. Author Tshepo Chris Nokeri considers a parametric model known as the Generalized Linear Model and a survival regression model known as the Cox Proportional Hazards model along with Accelerated Failure Time (AFT). Also presented is a binary classification model (logistic regression) and an ensemble model (Gradient Boosted Trees). The book introduces DL and an artificial neural network known as the Multilayer Perceptron (MLP) classifier. A way of performing cluster analysis using the K-Means model is covered. Dimension reduction techniques such as Principal Components Analysis and Linear Discriminant Analysis are explored. And automated machine learning is unpacked. This book is for intermediate-level data scientists and machine learning engineers who want to learn how to apply key big data frameworks and ML and DL frameworks. You will need prior knowledge of the basics of statistics, Python programming, probability theories, and predictive analytics. What You Will Learn Understand widespread supervised and unsupervised learning, including key dimension reduction techniques Know the big data analytics layers such as data visualization, advanced statistics, predictive analytics, machine learning, and deep learning Integrate big data frameworks with a hybrid of machine learning frameworks and deep learning frameworks Design, build, test, and validate skilled machine models and deep learning models Optimize model performance using data transformation, regularization, outlier remedying, hyperparameter optimization, and data split ratio alteration

Alfaisal Library

Data science solutions with Python : fast and scalable models using Keras, Pyspark Mllib, H2O, XGBoost, and scikit-Learn / Tshepo Chris Nokeri

By: Tshepo, Chris Nokeri.

Contributor(s): Ohio Library and Information Network.

Browsing Alfaisal University Shelves , Shelving location: On Shelf Close shelf browser