Muutke küpsiste eelistusi

Hands-On Generative Adversarial Networks with PyTorch 2.x: Generate awesome image, audio, text, and 3D models using Python 2nd Revised edition [Pehme köide]

  • Formaat: Paperback / softback, 315 pages, kõrgus x laius: 235x191 mm
  • Ilmumisaeg: 31-Jul-2024
  • Kirjastus: Packt Publishing Limited
  • ISBN-10: 1837637148
  • ISBN-13: 9781837637140
  • Formaat: Paperback / softback, 315 pages, kõrgus x laius: 235x191 mm
  • Ilmumisaeg: 31-Jul-2024
  • Kirjastus: Packt Publishing Limited
  • ISBN-10: 1837637148
  • ISBN-13: 9781837637140
With examples that cover computer vision, NLP and computer graphics, this book aims to become the practical guide for beginners in machine learning to understand the fundamental principles of GANs and learn the efficient applications of PyTorch library. The explanations of underlying mathematics behind the models also deliver in-depth knowledge to the advanced readers who wish to use GANs and PyTorch professionally.

Key Features

Explain the fundamental structures and principles of more than 22 GAN models Showcase various image/audio/text synthesis and image-to-image/text-to-image translation tasks with GANs More than just GAN: enriched with presentation of traditional approaches as well as similar deep learning techniques

Book DescriptionComputer vision is one of the most actively researched fields in deep learning. Aside from image classification and object recognition, the tasks of image synthesis have become an intriguing way to attract the attention of beginners in machine learning. GANs have been proved to be successful in generating realistic images over the last few years. As PyTorch is becoming the most popular open-source library in deep learning, a practical guide for building GAN models with PyTorch can be very useful for readers. In this book, we start with a quick example of building a simple GAN model with pure Python and thorough demonstrations of key features in PyTorch. We dive into a classical GAN model for image synthesis while giving clear explanation of its mechanism, before we move on to more interesting applications including image-to-image translation, image restoration, text synthesis, text-to-image translation, audio synthesis and 3D model generation. We also give thorough explanations of underlying mathematical principles, related traditional approaches and useful techniques in deep learning along the course, which may help advanced readers professionally.What you will learn

Build a simple GAN model with pure Python Key features in the latest version of PyTorch 2.x Understand how computational graphs are built for neural networks Perform image synthesis, image style transfer and image restoration Generate images based on label prompts or text descriptions Handle NLP tasks by generating audio and text with GANs Synthesize texture image and produce 3D models Perform adversarial attack and data augmentation with GANs Useful tricks and techniques in deep learning

Who this book is forThis book is for anyone looking to do creative work in machine learning, especially in deep learning. It is required for the reader to have programming experience in Python. Those who are familiar with the concepts of machine learning, deep learning and computer vision may find it much easier to follow our course. Do not worry if you havent used PyTorch before, because we will explain everything you need to know about PyTorch. It is highly recommended to have access to medium-to-high end NVIDIA graphics card to save you much time waiting for the model results.
Table of Contents

Product Information Document
Generative Adversarial Networks Fundamentals
Getting Started with PyTorch
Building your first GAN with PyTorch
Interactively generating images via Conditional GAN
Image-to-image translation and its applications
Image restoration with GANs
Attack and improve classification models with GANs
Image generation from description text
Sequence synthesis with GANs
GANs for computer graphics
VAE, Vision Transformer and Diffusion models
John Hany received his master's degree and bachelor's degree in calculational mathematics at the University of Electronic Science and Technology of China. He majors in pattern recognition and has years of experience in machine learning and computer vision. He has taken part in several practical projects, including intelligent transport systems and facial recognition systems. His current research interests lie in reducing the computation costs of deep neural networks while improving their performance on image classification and detection tasks. He is enthusiastic about open source projects and has contributed to many of them. Shuai Yan is a Computer Vision engineer. As an engineer in the field of deep learning, he has rich work experience, and he has made contributions to many outstanding enterprises dedicated to deep learning. He has in-depth knowledge and rich engineering experience in related fields.