Mastering OpenCV with Practical Computer Vision Projects

4.23 avg rating
( 13 ratings by Goodreads )
 
9781849517829: Mastering OpenCV with Practical Computer Vision Projects

This is the definitive advanced tutorial for OpenCV, designed for those with basic C++ skills. The computer vision projects are divided into easily assimilated chapters with an emphasis on practical involvement for an easier learning curve.

Overview

  • Allows anyone with basic OpenCV experience to rapidly obtain skills in many computer vision topics, for research or commercial use
  • Each chapter is a separate project covering a computer vision problem, written by a professional with proven experience on that topic.
  • All projects include a step-by-step tutorial and full source-code, using the C++ interface of OpenCV.

In Detail

Computer Vision is fast becoming an important technology and is used in Mars robots, national security systems, automated factories, driver-less cars, and medical image analysis to new forms of human-computer interaction. OpenCV is the most common library for computer vision, providing hundreds of complex and fast algorithms. But it has a steep learning curve and limited in-depth tutorials.

Mastering OpenCV with Practical Computer Vision Projects is the perfect book for developers with just basic OpenCV skills who want to try practical computer vision projects, as well as the seasoned OpenCV experts who want to add more Computer Vision topics to their skill set or gain more experience with OpenCV's new C++ interface before migrating from the C API to the C++ API.

Each chapter is a separate project including the necessary background knowledge, so try them all one-by-one or jump straight to the projects you're most interested in.

Create working prototypes from this book including real-time mobile apps, Augmented Reality, 3D shape from video, or track faces & eyes, fluid wall using Kinect, number plate recognition and so on.

Mastering OpenCV with Practical Computer Vision Projects gives you rapid training in nine computer vision areas with useful projects.

What you will learn from this book

  • Perform Face Analysis including simple Face & Eye & Skin Detection, Fisherfaces Face Recognition, 3D Head Orientation, complex Facial Feature Tracking.
  • Do Number Plate Detection and Optical Character Recognition (OCR) using Artificial Intelligence (AI) methods including SVMs and Neural Networks
  • Learn Augmented Reality for desktop and iPhone or iPad using simple artificial markers or complex markerless natural images
  • Generate a 3D object model by moving a plain 2D camera, using 3D Structure from Motion (SfM) camera reprojection methods
  • Redesign desktop real-time computer vision applications to more suitable Android & iOS mobile apps
  • Use simple image filter effects including cartoon, sketch, paint, and alien effects
  • Execute Human-Computer Interaction with an XBox Kinect sensor using the whole body as a dynamic input

Approach

Each chapter in the book is an individual project and each project is constructed with step-by-step instructions, clearly explained code, and includes the necessary screenshots.

Who this book is written for

You should have basic OpenCV and C/C++ programming experience before reading this book, as it is aimed at Computer Science graduates, researchers, and computer vision experts widening their expertise.

"synopsis" may belong to another edition of this title.

About the Author:

Daniel Lélis Baggio

Daniel Lélis Baggio started his work in computer vision through medical image processing at InCor (Instituto do Coração – Heart Institute) in São Paulo, where he worked with intra-vascular ultrasound image segmentation. Since then, he has focused on GPGPU and ported the segmentation algorithm to work with NVIDIA's CUDA. He has also dived into six degrees of freedom head tracking with a natural user interface group through a project called ehci (http://code.google.com/p/ehci/). He now works for the Brazilian Air Force.



Shervin Emami

Shervin Emami (born in Iran) taught himself electronics and hobby robotics during his early teens in Australia. While building his first robot at the age of 15, he learned how RAM and CPUs work. He was so amazed by the concept that he soon designed and built a whole Z80 motherboard to control his robot, and wrote all the software purely in binary machine code using two push buttons for 0s and 1s. After learning that computers can be programmed in much easier ways such as assembly language and even high-level compilers, Shervin became hooked to computer programming and has been programming desktops, robots, and smartphones nearly every day since then. During his late teens he created Draw3D (http://draw3d.shervinemami.info/), a 3D modeler with 30,000 lines of optimized C and assembly code that rendered 3D graphics faster than all the commercial alternatives of the time; but he lost interest in graphics programming when 3D hardware acceleration became available.

In University, Shervin took a subject on computer vision and became highly interested in it; so for his first thesis in 2003 he created a real-time face detection program based on Eigenfaces, using OpenCV (beta 3) for camera input. For his master's thesis in 2005 he created a visual navigation system for several mobile robots using OpenCV (v0.96). From 2008, he worked as a freelance Computer Vision Developer in Abu Dhabi and Philippines, using OpenCV for a large number of short-term commercial projects that included:

  • Detecting faces using Haar or Eigenfaces
  • Recognizing faces using Neural Networks, EHMM, or Eigenfaces
  • Detecting the 3D position and orientation of a face from a single photo using AAM and POSIT
  • Rotating a face in 3D using only a single photo
  • Face preprocessing and artificial lighting using any 3D direction from a single photo
  • Gender recognition
  • Facial expression recognition
  • Skin detection
  • Iris detection
  • Pupil detection
  • Eye-gaze tracking
  • Visual-saliency tracking
  • Histogram matching
  • Body-size detection
  • Shirt and bikini detection
  • Money recognition
  • Video stabilization
  • Face recognition on iPhone
  • Food recognition on iPhone
  • Marker-based augmented reality on iPhone (the second-fastest iPhone augmented reality app at the time).

OpenCV was putting food on the table for Shervin's family, so he began giving back to OpenCV through regular advice on the forums and by posting free OpenCV tutorials on his website (http://www.shervinemami.info/openCV.html). In 2011, he contacted the owners of other free OpenCV websites to write this book. He also began working on computer vision optimization for mobile devices at NVIDIA, working closely with the official OpenCV developers to produce an optimized version of OpenCV for Android. In 2012, he also joined the Khronos OpenVL committee for standardizing the hardware acceleration of computer vision for mobile devices, on which OpenCV will be based in the future.



David Millán Escrivá

David Millán Escrivá was eight years old when he wrote his first program on an 8086 PC with Basic language, which enabled the 2D plotting of basic equations. In 2005, he finished his studies in IT through the Universitat Politécnica de Valencia with honors in human-computer interaction supported by computer vision with OpenCV (v0.96). He had a final project based on this subject and published it on HCI Spanish congress. He participated in Blender, an open source, 3D-software project, and worked in his first commercial movie Plumiferos - Aventuras voladoras as a Computer Graphics Software Developer.

David now has more than 10 years of experience in IT, with experience in computer vision, computer graphics, and pattern recognition, working on different projects and startups, applying his knowledge of computer vision, optical character recognition, and augmented reality. He is the author of the "DamilesBlog" (http://blog.damiles.com), where he publishes research articles and tutorials about OpenCV, computer vision in general, and Optical Character Recognition algorithms. David has reviewed the book gnuPlot Cookbook by Lee Phillips and published by Packt Publishing.



Khvedchenia Ievgen

Khvedchenia Ievgen is a computer vision expert from Ukraine. He started his career with research and development of a camera-based driver assistance system for Harman International. He then began working as a Computer Vision Consultant for ESG. Nowadays, he is a self-employed developer focusing on the development of augmented reality applications. Ievgen is the author of the Computer Vision Talks blog (http://computer-vision-talks.com), where he publishes research articles and tutorials pertaining to computer vision and augmented reality.



Naureen Mahmood

Naureen Mahmood is a recent graduate from the Visualization department at Texas A&M University. She has experience working in various programming environments, animation software, and microcontroller electronics. Her work involves creating interactive applications using sensor-based electronics and software engineering. She has also worked on creating physics-based simulations and their use in special effects for animation.

Here is her blog - http://howdweknows.blogspot.com/



Jason Saragih

Jason Saragih received his B.Eng degree in mechatronics (with honors) and Ph.D. in computer science from the Australian National University, Canberra, Australia, in 2004 and 2008, respectively. From 2008 to 2010 he was a Postdoctoral fellow at the Robotics Institute of Carnegie Mellon University, Pittsburgh, PA. From 2010 to 2012 he worked at the Commonwealth Scientific and Industrial Research Organization (CSIRO) as a Research Scientist. He is currently a Senior Research Scientist at Visual Features, an Australian tech startup company.

Dr. Saragih has made a number of contributions to the field of computer vision, specifically on the topic of deformable model registration and modeling. He is the author of two non-profit open source libraries that are widely used in the scientific community; DeMoLib and FaceTracker, both of which make use of generic computer vision libraries including OpenCV.

Here is his blog address - http://jsaragih.org/



Roy Shilkrot

Roy Shilkrot is a researcher and professional in the area of computer vision and computer graphics. He obtained a B.Sc. in Computer Science from Tel-Aviv-Yaffo Academic College, and an M.Sc. from Tel-Aviv University. He is currently a PhD candidate in Media Laboratory of the Massachusetts Institute of Technology (MIT) in Cambridge.

Roy has over seven years of experience as a Software Engineer in start-up companies and enterprises. Before joining the MIT Media Lab as a Research Assistant he worked as a Technology Strategist in the Innovation Laboratory of Comverse, a telecom solutions provider. He also dabbled in consultancy, and worked as an intern for Microsoft research at Redmond.

Here is his blog address - http://www.morethantechnical.com/

"About this title" may belong to another edition of this title.

Buy New View Book
List Price: US$ 44.99
US$ 42.99

Convert Currency

Shipping: FREE
From United Kingdom to U.S.A.

Destination, Rates & Speeds

Add to Basket

Top Search Results from the AbeBooks Marketplace

1.

Daniel Lelis Baggio, Khvedchenia Ievgen, Shervin Emami
Published by Packt Publishing Limited, United Kingdom (2012)
ISBN 10: 1849517827 ISBN 13: 9781849517829
New Paperback Quantity Available: 1
Seller:
The Book Depository
(London, United Kingdom)
Rating
[?]

Book Description Packt Publishing Limited, United Kingdom, 2012. Paperback. Book Condition: New. Language: English . Brand New Book. Each chapter in the book is an individual project and each project is constructed with step-by-step instructions, clearly explained code, and includes the necessary screenshots. You should have basic OpenCV and C/C++ programming experience before reading this book, as it is aimed at Computer Science graduates, researchers, and computer vision experts widening their expertise. Bookseller Inventory # AAW9781849517829

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 42.99
Convert Currency

Add to Basket

Shipping: FREE
From United Kingdom to U.S.A.
Destination, Rates & Speeds

2.

Daniel Lelis Baggio, Khvedchenia Ievgen, Shervin Emami
Published by Packt Publishing Limited, United Kingdom (2012)
ISBN 10: 1849517827 ISBN 13: 9781849517829
New Paperback Quantity Available: 1
Seller:
The Book Depository US
(London, United Kingdom)
Rating
[?]

Book Description Packt Publishing Limited, United Kingdom, 2012. Paperback. Book Condition: New. Language: English . Brand New Book. Each chapter in the book is an individual project and each project is constructed with step-by-step instructions, clearly explained code, and includes the necessary screenshots. You should have basic OpenCV and C/C++ programming experience before reading this book, as it is aimed at Computer Science graduates, researchers, and computer vision experts widening their expertise. Bookseller Inventory # AAW9781849517829

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 48.56
Convert Currency

Add to Basket

Shipping: FREE
From United Kingdom to U.S.A.
Destination, Rates & Speeds

3.

Daniel Lelis Baggio
Published by Packt Publishing Limited (2012)
ISBN 10: 1849517827 ISBN 13: 9781849517829
New Quantity Available: > 20
Print on Demand
Seller:
Pbshop
(Wood Dale, IL, U.S.A.)
Rating
[?]

Book Description Packt Publishing Limited, 2012. PAP. Book Condition: New. New Book. Shipped from US within 10 to 14 business days. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Bookseller Inventory # IQ-9781849517829

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 44.58
Convert Currency

Add to Basket

Shipping: US$ 3.99
Within U.S.A.
Destination, Rates & Speeds

4.

Emami, Shervin
Published by Packt Publishing (2016)
ISBN 10: 1849517827 ISBN 13: 9781849517829
New Paperback Quantity Available: 1
Print on Demand
Seller:
Ria Christie Collections
(Uxbridge, United Kingdom)
Rating
[?]

Book Description Packt Publishing, 2016. Paperback. Book Condition: New. PRINT ON DEMAND Book; New; Publication Year 2016; Not Signed; Fast Shipping from the UK. No. book. Bookseller Inventory # ria9781849517829_lsuk

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 45.77
Convert Currency

Add to Basket

Shipping: US$ 5.17
From United Kingdom to U.S.A.
Destination, Rates & Speeds

5.

Daniel Lelis Baggio
Published by Packt Publishing Limited (2012)
ISBN 10: 1849517827 ISBN 13: 9781849517829
New Quantity Available: > 20
Print on Demand
Seller:
Books2Anywhere
(Fairford, GLOS, United Kingdom)
Rating
[?]

Book Description Packt Publishing Limited, 2012. PAP. Book Condition: New. New Book. Delivered from our UK warehouse in 3 to 5 business days. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Bookseller Inventory # LQ-9781849517829

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 39.15
Convert Currency

Add to Basket

Shipping: US$ 12.01
From United Kingdom to U.S.A.
Destination, Rates & Speeds

6.

Emami, Shervin
Published by Packt Publishing 11/27/2012 (2012)
ISBN 10: 1849517827 ISBN 13: 9781849517829
New Paperback or Softback Quantity Available: 10
Seller:
BargainBookStores
(Grand Rapids, MI, U.S.A.)
Rating
[?]

Book Description Packt Publishing 11/27/2012, 2012. Paperback or Softback. Book Condition: New. Mastering Opencv with Practical Computer Vision Projects. Book. Bookseller Inventory # BBS-9781849517829

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 56.91
Convert Currency

Add to Basket

Shipping: FREE
Within U.S.A.
Destination, Rates & Speeds

7.

Lélis Baggio, Daniel; Emami, Shervin; Millán Escrivá, David; Ievgen, Khvedchenia; Mahmood, Naureen; Saragih, Jasonl; Shilkrot, Roy
Published by Packt Publishing (2012)
ISBN 10: 1849517827 ISBN 13: 9781849517829
New Paperback Quantity Available: 10
Seller:
Ergodebooks
(RICHMOND, TX, U.S.A.)
Rating
[?]

Book Description Packt Publishing, 2012. Paperback. Book Condition: New. Bookseller Inventory # INGM9781849517829

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 52.22
Convert Currency

Add to Basket

Shipping: US$ 4.99
Within U.S.A.
Destination, Rates & Speeds

8.

Lélis Baggio, Daniel
Published by Packt Publishing (2017)
ISBN 10: 1849517827 ISBN 13: 9781849517829
New Paperback Quantity Available: > 20
Print on Demand
Seller:
Murray Media
(North Miami Beach, FL, U.S.A.)
Rating
[?]

Book Description Packt Publishing, 2017. Paperback. Book Condition: New. Never used! This item is printed on demand. Bookseller Inventory # 1849517827

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 57.32
Convert Currency

Add to Basket

Shipping: US$ 1.99
Within U.S.A.
Destination, Rates & Speeds

9.

Daniel Lélis Baggio; Shervin Emami; David Millán Escrivá; Khvedchenia Ievgen; Naureen Mahmood; Jasonl Saragih; Roy Shilkrot
Published by Packt Publishing (2012)
ISBN 10: 1849517827 ISBN 13: 9781849517829
New Paperback Quantity Available: 1
Seller:
Irish Booksellers
(Rumford, ME, U.S.A.)
Rating
[?]

Book Description Packt Publishing, 2012. Paperback. Book Condition: New. book. Bookseller Inventory # M1849517827

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 64.02
Convert Currency

Add to Basket

Shipping: FREE
Within U.S.A.
Destination, Rates & Speeds

10.

Daniel Lelis Baggio, Khvedchenia Ievgen, Shervin Emami
Published by Packt Publishing Limited, United Kingdom (2012)
ISBN 10: 1849517827 ISBN 13: 9781849517829
New Paperback Quantity Available: 10
Seller:
Book Depository hard to find
(London, United Kingdom)
Rating
[?]

Book Description Packt Publishing Limited, United Kingdom, 2012. Paperback. Book Condition: New. Language: English . This book usually ship within 10-15 business days and we will endeavor to dispatch orders quicker than this where possible. Brand New Book. Each chapter in the book is an individual project and each project is constructed with step-by-step instructions, clearly explained code, and includes the necessary screenshots. You should have basic OpenCV and C/C++ programming experience before reading this book, as it is aimed at Computer Science graduates, researchers, and computer vision experts widening their expertise. Bookseller Inventory # LIE9781849517829

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 68.48
Convert Currency

Add to Basket

Shipping: FREE
From United Kingdom to U.S.A.
Destination, Rates & Speeds

There are more copies of this book

View all search results for this book