Build and deploy high-performance deep learning models using C++ for real-time applications where speed and efficiency matter.
Free with your book: DRM-free PDF version + access to Packt's next-gen Reader*
Deep learning systems often struggle to meet performance demands in real-time and production environments. This book shows you how to build high-performance deep learning systems in C++, enabling efficient and scalable artificial intelligence (AI) in resource-constrained environments where performance matters.
You’ll start by setting up a complete C++ deep learning environment and implementing core neural networks from scratch. As you progress, you’ll build advanced architectures, including Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Long Short-Term Memory Networks (LSTMs), Generative Adversarial Networks (GANs), and Transformers, using C++, CUDA, and PyTorch’s C++ API. The book then focuses on model quantization and compression. It will guide you through the model deployment process in production with robust monitoring and explainability. You’ll also explore distributed training and techniques for real-time inference in performance-critical domains.
By the end of this book, you’ll be able to design, optimize, and deploy deep learning systems in C++ that are production-ready, scalable, and efficient across multiple industries.
*Email sign-up and proof of purchase required
This book is for ML engineers, deep learning practitioners, and data scientists with a C++ background who want to build or learn about high-performance deep learning models. It also serves developers transitioning from Python-based frameworks looking for real-time deployment solutions in industries like finance, autonomous systems, and healthcare.
"synopsis" may belong to another edition of this title.
Bill Chen is a machine learning engineer at Meta specializing in deep learning, CUDA, and C++. He holds a PhD in Bioinformatics from the University of Kentucky and has worked in both production and instructional roles in applied AI. He has taught at the NVIDIA Deep Learning Institute, earned the NVIDIA-Certified Associate: Generative AI Multimodal credential, and served as part-time machine learning faculty at UCSC Silicon Valley Extension. His work includes Facebook group search modeling and surgical duration prediction. In this book, he combines industry experience and teaching to guide readers in building high-performance deep learning systems in C++.
Vikash Gupta Ph.D., is a Senior Solutions Architect at Amazon Web Services (AWS), based in Seattle, Washington. He earned his Ph.D. in Computational Biology from INRIA, France, where his research centered on neuroimaging and statistical modeling. At AWS, he applies deep learning and artificial intelligence to advance medical imaging technologies, contributing to open-source initiatives such as the MONAI framework for healthcare. He also served as a research scientist at The Ohio State University and as an Assistant Professor at Mayo Clinic. He has authored more than 60 peer-reviewed publications.
"About this title" may belong to another edition of this title.
Seller: GreatBookPrices, Columbia, MD, U.S.A.
Condition: New. Seller Inventory # 53476654-n
Seller: California Books, Miami, FL, U.S.A.
Condition: New. Seller Inventory # I-9781835880029
Seller: GreatBookPrices, Columbia, MD, U.S.A.
Condition: As New. Unread book in perfect condition. Seller Inventory # 53476654
Seller: PBShop.store US, Wood Dale, IL, U.S.A.
PAP. Condition: New. New Book. Shipped from UK. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Seller Inventory # L0-9781835880029
Seller: PBShop.store UK, Fairford, GLOS, United Kingdom
PAP. Condition: New. New Book. Delivered from our UK warehouse in 4 to 14 business days. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Seller Inventory # L0-9781835880029
Quantity: Over 20 available
Seller: BargainBookStores, Grand Rapids, MI, U.S.A.
Paperback or Softback. Condition: New. Deep Learning with C++: Design and deploy neural networks using CUDA for high-performance AI in C++. Book. Seller Inventory # BBS-9781835880029
Seller: GreatBookPricesUK, Woodford Green, United Kingdom
Condition: New. Seller Inventory # 53476654-n
Quantity: Over 20 available
Seller: GreatBookPricesUK, Woodford Green, United Kingdom
Condition: As New. Unread book in perfect condition. Seller Inventory # 53476654
Quantity: Over 20 available
Seller: preigu, Osnabrück, Germany
Taschenbuch. Condition: Neu. Deep Learning with C++ | Design and deploy neural networks using CUDA for high-performance AI in C++ | Bill Chen (u. a.) | Taschenbuch | Englisch | 2026 | Packt Publishing | EAN 9781835880029 | Verantwortliche Person für die EU: Libri GmbH, Europaallee 1, 36244 Bad Hersfeld, gpsr[at]libri[dot]de | Anbieter: preigu Print on Demand. Seller Inventory # 135244848
Quantity: 5 available
Seller: AHA-BUCH GmbH, Einbeck, Germany
Taschenbuch. Condition: Neu. nach der Bestellung gedruckt Neuware - Printed after ordering - Build and deploy high-performance deep learning models using C++ for real-time applications where speed and efficiency matter.Free with your book: DRM-free PDF version + access to Packt's next-gen Reader\*Key Features:Build deep learning models in C++ with PyTorch C++ API and CUDAImplement CNNs, RNNs, LSTMs, GANs, and Transformers in C++ for real-world applicationsOptimize and deploy machine learning models to production with scalable C++ pipelinesBook Description:Deep learning systems often struggle to meet performance demands in real-time and production environments. This book shows you how to build high-performance deep learning systems in C++, enabling efficient and scalable artificial intelligence (AI) in resource-constrained environments where performance matters.You'll start by setting up a complete C++ deep learning environment and implementing core neural networks from scratch. As you progress, you'll build advanced architectures, including Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Long Short-Term Memory Networks (LSTMs), Generative Adversarial Networks (GANs), and Transformers, using C++, CUDA, and PyTorch's C++ API. The book then focuses on model quantization and compression. It will guide you through the model deployment process in production with robust monitoring and explainability. You'll also explore distributed training and techniques for real-time inference in performance-critical domains.By the end of this book, you'll be able to design, optimize, and deploy deep learning systems in C++ that are production-ready, scalable, and efficient across multiple industries.\*Email sign-up and proof of purchase requiredWhat You Will Learn:Set up and use CUDA and PyTorch's C++ API for deep learningImplement CNNs, RNNs, LSTMs, GANs, Transformers, and LLMs in C++Leverage CUDA for high-performance model trainingPerform model compression using quantization, pruning, and distillationDeploy and monitor models in production using C++ toolsApply explainability techniques such as LIME, SHAP, and Grad-CAMWho this book is for:This book is for ML engineers, deep learning practitioners, and data scientists with a C++ background who want to build or learn about high-performance deep learning models. It also serves developers transitioning from Python-based frameworks looking for real-time deployment solutions in industries like finance, autonomous systems, and healthcare.Table of ContentsIntroduction to Deep Learning with C++ and Environment SetupData Preparation and Preprocessing in C++CUDA for GPU Acceleration in Deep Learning with C++Building a Basic Neural Network in C++Multilayer Perceptrons in C++Convolutional Neural Networks in C++Recurrent Neural Networks and Long Short-Term Memory Networks in C++Generative Networks, Autoencoders, and Large Language Models in C++Transformers and Large Language Model Fine-tuning in C++Deploying and Optimizing Models for InferenceDebugging and Retraining Deployed ModelsMonitoring Deployed ModelsExplainability and Transparency in Deep Learning Models. Seller Inventory # 9781835880029
Quantity: 2 available