Probabilistic Modelling and Stochastic Algorithms for Visual Localisation and Tracking
Author: John MacCormick
Publisher: Springer Science & Business Media
A central problem in computer vision is to track objects as they move and deform in a video sequence. Stochastic algorithms -- in particular, particle filters and the Condensation algorithm -- have dramatically enhanced the state of the art for such visual tracking problems in recent years. This book presents a unified framework for visual tracking using particle filters, including the new technique of partitioned sampling which can alleviate the "curse of dimensionality" suffered by standard particle filters. The book also introduces the notion of contour likelihood: a collection of models for assessing object shape, colour and motion, which are derived from the statistical properties of image features. Because of their statistical nature, contour likelihoods are ideal for use in stochastic algorithms. A unifying theme of the book is the use of statistics and probability, which enable the final output of the algorithms presented to be interpreted as the computer's "belief" about the state of the world. The book will be of use and interest to students, researchers and practitioners in computer vision, and assumes only an elementary knowledge of probability theory.
6th International Gesture Workshop, GW 2005, Berder Island, France, May 18-20, 2005, Revised Selected Papers
Author: Sylvie Gibet
This book constitutes the thoroughly refereed post-proceedings of the 6th International Workshop on Gesture in Human-Computer Interaction and Simulation, GW 2005, held in May 2005. The 22 revised long papers and 14 revised short papers presented together with 2 invited lectures were carefully selected from numerous submissions during two rounds of reviewing and improvement. The papers are organized in topical sections on human perception and production of gesture, sign language representation, sign language recognition, vision-based gesture recognition, gesture analysis, gesture synthesis, gesture and music, and gesture interaction in multimodal systems.
Third International Workshop, AMDO 2004, Palma de Mallorca, Spain, September 22-24, 2004, Proceedings
Author: Francisco J. Perales
This book constitutes the refereed proceedings of the Third International Workshop on Articulated Motion and Deformable Objects, AMDO 2004, held in Palma de Mallorca, Spain in September 2004. The 25 revised full papers presented were carefully reviewed and selected for inclusion in the book. Among the topics addressed are geometric and physical modeling of deformable objects, motion analysis, articulated models and animation, visualization of deformable models, 3D recovery from motion, single or multiple human motion analysis and synthesis, applications of deformable models and motion analysis, face tracking, recovering and recognition models, and virtual and augmented reality systems.
13th International Conference, ACIVS 2011, Ghent, Belgium, August 22-25, 2011, Proceedings
Author: Jaques Blanc-Talon
This book constitutes the refereed proceedings of the 13th International Conference on Advanced Concepts for Intelligent Vision Systems, ACIVS 2011, held in Ghent, Belgium, in August 2011. The 66 revised full papers presented were carefully reviewed and selected from 124 submissions. The papers are organized in topical sections on classification recognition, and tracking, segmentation, images analysis, image processing, video surveillance and biometrics, algorithms and optimization; and 3D, depth and scene understanding.
This book deals with the creation of the algorithmic backbone that enables a computer to perceive humans in a monitored space. This is performed using the same signals that humans process, i.e., audio and video. Computers reproduce the same type of perception using sensors and algorithms in order to detect and track multiple interacting humans, by way of multiple cues, like bodies, faces or speech. This application domain is challenging, because audio and visual signals are cluttered by both background and foreground objects. First, particle filtering is established as the framework for tracking. Then, audio, visual and also audio-visual tracking systems are separately explained. Each modality is analyzed, starting with sensor configuration, detection for tracker initialization and the trackers themselves. Techniques to fuse the modalities are then considered. Instead of offering a monolithic approach to the tracking problem, this book also focuses on implementation by providing MATLAB code for every presented component. This way, the reader can connect every concept with corresponding code. Finally, the applications of the various tracking systems in different domains are studied./a
This book presents the state of the art in online visual tracking, including the motivations, practical algorithms, and experimental evaluations. Visual tracking remains a highly active area of research in Computer Vision and the performance under complex scenarios has substantially improved, driven by the high demand in connection with real-world applications and the recent advances in machine learning. A large variety of new algorithms have been proposed in the literature over the last two decades, with mixed success. Chapters 1 to 6 introduce readers to tracking methods based on online learning algorithms, including sparse representation, dictionary learning, hashing codes, local model, and model fusion. In Chapter 7, visual tracking is formulated as a foreground/background segmentation problem, and tracking methods based on superpixels and end-to-end deep networks are presented. In turn, Chapters 8 and 9 introduce the cutting-edge tracking methods based on correlation filter and deep learning. Chapter 10 summarizes the book and points out potential future research directions for visual tracking. The book is self-contained and suited for all researchers, professionals and postgraduate students working in the fields of computer vision, pattern recognition, and machine learning. It will help these readers grasp the insights provided by cutting-edge research, and benefit from the practical techniques available for designing effective visual tracking algorithms. Further, the source codes or results of most algorithms in the book are provided at an accompanying website.
This book is an updated effort in summarizing the trending topics and new hot research lines in solving dynamic problems using metaheuristics. An analysis of the present state in solving complex problems quickly draws a clear picture: problems that change in time, having noise and uncertainties in their definition are becoming very important. The tools to face these problems are still to be built, since existing techniques are either slow or inefficient in tracking the many global optima that those problems are presenting to the solver technique. Thus, this book is devoted to include several of the most important advances in solving dynamic problems. Metaheuristics are the more popular tools to this end, and then we can find in the book how to best use genetic algorithms, particle swarm, ant colonies, immune systems, variable neighborhood search, and many other bioinspired techniques. Also, neural network solutions are considered in this book. Both, theory and practice have been addressed in the chapters of the book. Mathematical background and methodological tools in solving this new class of problems and applications are included. From the applications point of view, not just academic benchmarks are dealt with, but also real world applications in logistics and bioinformatics are discussed here. The book then covers theory and practice, as well as discrete versus continuous dynamic optimization, in the aim of creating a fresh and comprehensive volume. This book is targeted to either beginners and experienced practitioners in dynamic optimization, since we took care of devising the chapters in a way that a wide audience could profit from its contents. We hope to offer a single source for up-to-date information in dynamic optimization, an inspiring and attractive new research domain that appeared in these last years and is here to stay.
Dealing with visual perception in robots and its applications to manipulation and imitation, this monograph focuses on stereo-based methods and systems for object recognition and 6 DoF pose estimation as well as for marker-less human motion capture.
Third International Work-Conference on the Interplay Between Natural and Artificial Computation, IWINAC 2009, Santiago de Compostela, Spain, June 22-26, 2009, Proceedings
Author: Jose Mira
The two-volume set LNCS 5601 and LNCS 5602 constitutes the refereed proceedings of the Third International Work-Conference on the Interplay between Natural and Artificial Computation, IWINAC 2009, held in Santiago de Compostela, Spain, in June 2009. The 108 revised papers presented are thematically divided into two volumes. The first volume includes papers relating the most recent collaborations with Professor Mira and contributions mainly related with theoretical, conceptual and methodological aspects linking AI and knowledge engineering with neurophysiology, clinics and cognition. The second volume contains all the contributions connected with biologically inspired methods and techniques for solving AI and knowledge engineering problems in different application domains.
International Workshop, Dagstuhl Castle, Germany, October 15-20, 2000. Selected Revised Papers
Author: Gregory D. Hager
Category: Technology & Engineering
Robotics is a highly interdisciplinary research topic, that requires integration of methods for mechanics, control engineering, signal processing, planning, gra- ics, human-computer interaction, real-time systems, applied mathematics, and software engineering to enable construction of fully operational systems. The diversity of topics needed to design, implement, and deploy such systems implies that it is almost impossible for individual teams to provide the needed critical mass for such endeavors. To facilitate interaction and progress on sensor-based intelligent robotics inter-disciplinary workshops are necessary through which - depthdiscussioncanbeusedforcrossdisseminationbetweendi?erentdisciplines. The Dagstuhl foundation has organized a number of workshops on Mod- ing and Integration of Sensor Based Intelligent Robot Systems. The Dagstuhl seminars take place over a full week in a beautiful setting in the Saarland in Germany. The setting provides an ideal environment for in-depth presentations and rich interaction between the participants. This volume contains papers presented during the fourth workshop held - tober 15–20, 2000. All papers were submitted by workshop attendees, and were reviewed by at least one reviewer. We wish to thank all of the reviewers for their invaluable help in making this a high-quality selection of papers. We gratefully acknowledge the support of the Schloss Dagstuhl Foundation and the sta? at Springer-Verlag. Without their support the production of this volume would not have been possible.
International Evaluation Workshops CLEAR 2007 and RT 2007, Baltimore, MD, USA, May 8-11, 2007, Revised Selected Papers
Author: Rainer Stiefelhagen
This book constitutes the thoroughly refereed joint post-workshop proceedings of two co-located events: the Second International Workshop on Classification of Events, Activities and Relationships, CLEAR 2007, and the 5th Rich Transcription 2007 Meeting Recognition evaluation, RT 2007, held in succession in Baltimore, MD, USA, in May 2007. The workshops had complementary evaluation efforts; CLEAR for the evaluation of human activities, events, and relationships in multiple multimodal data domains; and RT for the evaluation of speech transcription-related technologies from meeting room audio collections. The 35 revised full papers presented from CLEAR 2007 cover 3D person tracking, 2D face detection and tracking, person and vehicle tracking on surveillance data, vehicle and person tracking aerial videos, person identification, head pose estimation, and acoustic event detection. The 15 revised full papers presented from RT 2007 are organized in topical sections on speech-to-text, and speaker diarization.
Face recognition has been actively studied over the past decade and continues to be a big research challenge. Just recently, researchers have begun to investigate face recognition under unconstrained conditions. Unconstrained Face Recognition provides a comprehensive review of this biometric, especially face recognition from video, assembling a collection of novel approaches that are able to recognize human faces under various unconstrained situations. The underlying basis of these approaches is that, unlike conventional face recognition algorithms, they exploit the inherent characteristics of the unconstrained situation and thus improve the recognition performance when compared with conventional algorithms. Unconstrained Face Recognition is structured to meet the needs of a professional audience of researchers and practitioners in industry. This volume is also suitable for advanced-level students in computer science.
Proceedings of the VIIth Biennial Australian Pattern Recognition Society Conference, DICTA 2003
Author: Changming Sun
Publisher: CSIRO PUBLISHING
Category: Technology & Engineering
Digital Image Computing: Techniques and Applications is the premier biennial conference in Australia on the topics of image processing and image analysis. This seventh edition of the proceedings has seen an unprecedented level of submission, on such diverse areas as: Image processing; Face recognition; Segmentation; Registration; Motion analysis; Medical imaging; Object recognition; Virtual environments; Graphics; Stereo-vision; and Video analysis. These two volumes contain all the 108 accepted papers and five invited talks that were presented at the conference. These two volumes provide the Australian and international imaging research community with a snapshot of current theoretical and practical developments in these areas. They are of value to any engineer, computer scientist, mathematician, statistician or student interested in these matters.
Speech and language technologies continue to grow in importance as they are used to create natural and efficient interfaces between people and machines, and to automatically transcribe, extract, analyze, and route information from high-volume streams of spoken and written information. The workshops on Mathematical Foundations of Speech Processing and Natural Language Modeling were held in the Fall of 2000 at the University of Minnesota's NSF-sponsored Institute for Mathematics and Its Applications, as part of a "Mathematics in Multimedia" year-long program. Each workshop brought together researchers in the respective technologies on the one hand, and mathematicians and statisticians on the other hand, for an intensive week of cross-fertilization. There is a long history of benefit from introducing mathematical techniques and ideas to speech and language technologies. Examples include the source-channel paradigm, hidden Markov models, decision trees, exponential models and formal languages theory. It is likely that new mathematical techniques, or novel applications of existing techniques, will once again prove pivotal for moving the field forward. This volume consists of original contributions presented by participants during the two workshops. Topics include language modeling, prosody, acoustic-phonetic modeling, and statistical methodology.
Nature-inspired algorithms have a great popularity in the current scientific community, being the focused scope of many research contributions in the literature year by year. The rationale behind the acquired momentum by this broad family of methods lies on their outstanding performance evinced in hundreds of research fields and problem instances. This book gravitates on the development of nature-inspired methods and their application to stochastic, dynamic and robust optimization. Topics covered by this book include the design and development of evolutionary algorithms, bio-inspired metaheuristics, or memetic methods, with empirical, innovative findings when used in different subfields of mathematical optimization, such as stochastic, dynamic, multimodal and robust optimization, as well as noisy optimization and dynamic and constraint satisfaction problems.
This comprehensive and state-of-the art approach to video processing gives engineers and students a comprehensive introduction and includes full coverage of key applications: wireless video, video networks, video indexing and retrieval and use of video in speech processing. Containing all the essential methods in video processing alongside the latest standards, it is a complete resource for the professional engineer, researcher and graduate student. Numerous conceptual and numerical examples All the latest standards are thoroughly covered: MPEG-1, MPEG-2, MPEG-4, H.264 and AVC Coverage of the latest techniques in video security "Like its sister volume "The Essential Guide to Image Processing," Professor Bovik’s Essential Guide to Video Processing provides a timely and comprehensive survey, with contributions from leading researchers in the area. Highly recommended for everyone with an interest in this fascinating and fast-moving field." —Prof. Bernd Girod, Stanford University, USA * Edited by a leading person in the field who created the IEEE International Conference on Image Processing, with contributions from experts in their fields. * Numerous conceptual and numerical examples *All the latest standards are thoroughly covered: MPEG-1, MPEG-2, MPEG-4, H.264 and AVC. * Coverage of the latest techniques in video security
Chandrika Kamath describes how techniques from the multi-disciplinary field of data mining can be used to address the modern problem of data overload in science and engineering domains. Starting with a survey of analysis problems in different applications, it identifies the common themes across these domains.
Proceedings of the 6th Computer Science On-line Conference 2017 (CSOC2017)
Author: Radek Silhavy
This book presents new methods and approaches to real-world problems as well as exploratory research that describes novel artificial intelligence applications, including deep learning, neural networks and hybrid algorithms. This book constitutes the refereed proceedings of the Artificial Intelligence Trends in Intelligent Systems Section of the 6th Computer Science On-line Conference 2017 (CSOC 2017), held in April 2017.
Asynchronous System-on-Chip Interconnect describes the use of an entirely asynchronous system-bus for the modular construction of integrated circuits. Industry is just awakening to the benefits of asynchronous design in avoiding the problems of clock-skew and multiple clock-domains, an din parallel with this is coming to grips with Intellectual Property (IP) based design flows which emphasise the need for a flexible interconnect strategy. In this book, John Bainbridge investigates the design of an asynchronous on-chip interconnect, looking at all the stages of the design from the choice of wiring layout, through asynchronous signalling protocols to the higher level problems involved in supporting split transactions. The MARBLE bus (the first asynchronous SoC bus) used in a commercial demonstrator chip containing a mixture of asynchronous and synchronous macrocells is used as a concrete example throughout the book.