Library

Computer Vision

Richard Hartley

Multiple View Geometry in Computer Vision

A basic problem in computer vision is to understand the structure of a real world scene given several images of it. Techniques for solving this problem are taken from projective geometry and photogrammetry. Here, the authors cover the geometric principles and their algebraic representation in terms of camera projection matrices, the fundamental matrix and the trifocal tensor. The theory and methods of computation of these entities are discussed with real examples, as is their use in the reconstruction of scenes from multiple images. The new edition features an extended introduction covering the key ideas in the book (which itself has been updated with additional examples and appendices) and significant new results which have appeared since the first edition. Comprehensive background material is provided, so readers familiar with linear algebra and basic numerical methods can understand the projective geometry and estimation algorithms presented, and implement the algorithms directly from the book.


Richard Szeliski

Computer Vision: Algorithms and Applications Hardcover

Computer Vision: Algorithms and Applications explores the variety of techniques used to analyze and interpret images. It also describes challenging real-world applications where vision is being successfully used, both in specialized applications such as image search and autonomous navigation, as well as for fun, consumer-level tasks that students can apply to their own personal photos and videos. More than just a source of “recipes,” this exceptionally authoritative and comprehensive textbook/reference takes a scientific approach to the formulation of computer vision problems. These problems are then analyzed using the latest classical and deep learning models and solved using rigorous engineering principles.


Benjamin Planche

Hands-On Computer Vision with TensorFlow 2

A practical guide to building high performance systems for object detection, segmentation, video processing, smartphone applications, and more Key FeaturesDiscover how to build, train, and serve your own deep neural networks with TensorFlow 2 and KerasApply modern solutions to a wide range of applications such as object detection and video analysisLearn how to run your models on mobile devices and web pages and improve their performanceBook Description Computer vision solutions are becoming increasingly common, making their way into fields such as health, automobile, social media, and robotics. This book will help you explore TensorFlow 2, the brand new version of Google's open source framework for machine learning. You will understand how to benefit from using convolutional neural networks (CNNs) for visual tasks. Hands-On Computer Vision with TensorFlow 2 starts with the fundamentals of computer vision and deep learning, teaching you how to build a neural network from scratch. You will discover the features that have made TensorFlow the most widely used AI library, along with its intuitive Keras interface. You'll then move on to building, training, and deploying CNNs efficiently. Complete with concrete code examples, the book demonstrates how to classify images with modern solutions, such as Inception and ResNet, and extract specific content using You Only Look Once (YOLO), Mask R-CNN, and U-Net.


Hanspeter A. Mallot

Computational Vision

This text provides an introduction to computational aspects of early vision, in particular, color, stereo, and visual navigation. It integrates approaches from psychophysics and quantitative neurobiology, as well as theories and algorithms from machine vision and photogrammetry. When presenting mathematical material, it uses detailed verbal descriptions and illustrations to clarify complex points. The text is suitable for upper-level students in neuroscience, biology, and psychology who have basic mathematical skills and are interested in studying the mathematical modeling of perception.

w