Author: Michael W. Berry
This volume contains selected papers that focus on the use of linear algebra, computational statistics, and computer science in the development of algorithms and software systems for text retrieval. Experts in information modeling and retrieval share their perspectives on the design of scalable but precise text retrieval systems, revealing many of the challenges and obstacles that mathematical and statistical models must overcome to be viable for automated text processing. This very useful proceedings is an excellent companion for courses in information retrieval, applied linear algebra, and applied statistics.
Author: Dariush Alimohammadi
Category: Information retrieval
This book is about Information Retrieval (IR), particularly Classical Information Retrieval (CIR). It looks at these topics through their mathematical roots. The mathematical bases of CIR are briefly reviewed, followed by the most important and interesting models of CIR, including Boolean, Vector Space, and Probabilistic. The primary goal of book is to create a context for understanding the principles of CIR by discussing its mathematical bases. This book can be helpful for LIS students who are studying IR but have no knowledge of mathematics. Weakness in math impairs the ability to understand current issues in IR. While LIS students are the main target of this book, it may be of interest to computer science and communications students as well.
Author: Christopher D. Manning,Prabhakar Raghavan,Hinrich Schütze
Publisher: Cambridge University Press
Class-tested and coherent, this textbook teaches classical and web information retrieval, including web search and the related areas of text classification and text clustering from basic concepts. It gives an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections. All the important ideas are explained using examples and figures, making it perfect for introductory courses in information retrieval for advanced undergraduates and graduate students in computer science. Based on feedback from extensive classroom experience, the book has been carefully structured in order to make teaching more natural and effective. Slides and additional exercises (with solutions for lecturers) are also available through the book's supporting website to help course instructors prepare their lectures.
Proceedings of the 5th International Conference on Computer Science, Applied Mathematics and Applications, ICCSAMA 2017
Author: Nguyen-Thinh Le,Tien van Do,Ngoc Thanh Nguyen,Hoai An Le Thi
These proceedings consist of 19 papers, which have been peer-reviewed by international program committee and selected for the 5th International Conference on Computer Science, Applied Mathematics and Applications (ICCSAMA 2017), which was held on June 30–July 1, 2017 in Berlin, Germany. The respective chapters discuss both theoretical and practical issues in connection with computational methods and optimization methods for knowledge engineering. The broad range of application areas discussed includes network computing, simulation, intelligent and adaptive e-learning, information retrieval, sentiment analysis, autonomous underwater vehicles, social media analysis, natural language processing, biomimetics in organizations, and cash management. In addition to pure content, the book offers many inspiring ideas and suggests new research directions, making it a valuable resource for graduate students, Ph.D. students, and researchers in Computer Science and Applied Mathematics alike.
Author: Gerard Salton
Category: Technology & Engineering
Test results are included which illustrate the effectiveness of the theory.
Author: L. M. Patnaik
Publisher: I. K. International Pvt Ltd
Category: Information storage and retrieval systems
The proceedings features several key-note addresses in the areas of advanced information processing tools. This area has been recognized to be one of the key five technologies poised to shape the modern society in the next decade. It aptly focuses on the tools and techniques for the development of Information Systems. Emphasis is on pattern recognition and image processing, software engineering, mobile ad hoc networks, security aspects in computer networks, signal processing and hardware synthesis, optimization techniques, data mining and information processing.
Author: H. S. Heaps
Publisher: Academic Pr
General concepts; Document data bases for computer search; Question logic and format; Data structures for storage and retrieval; Structure of search programs; Vocabulary characteristics of document data bases; Information theory considerations; Coding and compression of data bases; Example of design of a document retrieval system; Document indexing and term associations; Automatic question modification; Automatic document classification; Concluding remarks.
Advancing Applications and Technologies
Author: Yan, Li
Publisher: IGI Global
As consumer costs for multimedia devices such as digital cameras and Web phones have decreased and diversity in the market has skyrocketed, the amount of digital information has grown considerably. Intelligent Multimedia Databases and Information Retrieval: Advancing Applications and Technologies details the latest information retrieval technologies and applications, the research surrounding the field, and the methodologies and design related to multimedia databases. Together with academic researchers and developers from both information retrieval and artificial intelligence fields, this book details issues and semantics of data retrieval with contributions from around the globe. As the information and data from multimedia databases continues to expand, the research and documentation surrounding it should keep pace as best as possible, and this book provides an excellent resource for the latest developments.
Clustering, Classification, and Retrieval
Author: Michael W. Berry
Publisher: Springer Science & Business Media
Extracting content from text continues to be an important research problem for information processing and management. Approaches to capture the semantics of text-based document collections may be based on Bayesian models, probability theory, vector space models, statistical models, or even graph theory. As the volume of digitized textual media continues to grow, so does the need for designing robust, scalable indexing and search strategies (software) to meet a variety of user needs. Knowledge extraction or creation from text requires systematic yet reliable processing that can be codified and adapted for changing needs and environments. This book will draw upon experts in both academia and industry to recommend practical approaches to the purification, indexing, and mining of textual information. It will address document identification, clustering and categorizing documents, cleaning text, and visualizing semantic models of text.
Methodologies and Applications : Proceedings of the 15th Toyota Conference
Author: Masatake Mori,T. Mitsui
The 20th century saw tremendous achievements and progress in science and technology. Undoubtedly, computers and computer-related technologies acted as one of vital catalysts for accelerating this progress in the latter half of the century. The contributions of mathematical sciences have been equally profound, and the synergy between mathematics and computer science has played a key role in accelerating the progress of both fields as well as science and engineering. Mathematical sciences will undoubtedly continue to play this vital role in this new century. In particular, mathematical modeling and numerical simulation will continue to be among the essential methodologies for solving massive and complex problems that arise in science, engineering and manufacturing. Underpinning this all from a sound, theoretical perspective will be numerical algorithms. In recognition of this observation, this volume focuses on the following specific topics. (1) Fundamental numerical algorithms (2) Applications of numerical algorithms (3) Emerging technologies. The articles included in this issue by experts on advanced scientific and engineering computations from numerous countries elucidate state-of-the-art achievements in these three topics from various angles and suggest the future directions. Although we cannot hope to cover all the aspects in scientific and engineering computations, we hope that the articles will interest, inform and inspire members of the science and engineering community.
Author: C. J. van Rijsbergen
Publisher: Cambridge University Press
An important work on a new framework for information retrieval: implications for artificial intelligence, natural language processing.
Theory and Implementation
Author: Gerald J. Kowalski,Mark T. Maybury
Publisher: Springer Science & Business Media
Chapter 1 places into perspective a total Information Storage and Retrieval System. This perspective introduces new challenges to the problems that need to be theoretically addressed and commercially implemented. Ten years ago commercial implementation of the algorithms being developed was not realistic, allowing theoreticians to limit their focus to very specific areas. Bounding a problem is still essential in deriving theoretical results. But the commercialization and insertion of this technology into systems like the Internet that are widely being used changes the way problems are bounded. From a theoretical perspective, efficient scalability of algorithms to systems with gigabytes and terabytes of data, operating with minimal user search statement information, and making maximum use of all functional aspects of an information system need to be considered. The dissemination systems using persistent indexes or mail files to modify ranking algorithms and combining the search of structured information fields and free text into a consolidated weighted output are examples of potential new areas of investigation. The best way for the theoretician or the commercial developer to understand the importance of problems to be solved is to place them in the context of a total vision of a complete system. Understanding the differences between Digital Libraries and Information Retrieval Systems will add an additional dimension to the potential future development of systems. The collaborative aspects of digital libraries can be viewed as a new source of information that dynamically could interact with information retrieval techniques.
Applications and Theory
Author: Michael W. Berry,Jacob Kogan
Publisher: John Wiley & Sons
Text Mining: Applications and Theory presents the state-of-the-art algorithms for text mining from both the academic and industrial perspectives. The contributors span several countries and scientific domains: universities, industrial corporations, and government laboratories, and demonstrate the use of techniques from machine learning, knowledge discovery, natural language processing and information retrieval to design computational models for automated text analysis and mining. This volume demonstrates how advancements in the fields of applied mathematics, computer science, machine learning, and natural language processing can collectively capture, classify, and interpret words and their contexts. As suggested in the preface, text mining is needed when “words are not enough.” This book: Provides state-of-the-art algorithms and techniques for critical tasks in text mining applications, such as clustering, classification, anomaly and trend detection, and stream analysis. Presents a survey of text visualization techniques and looks at the multilingual text classification problem. Discusses the issue of cybercrime associated with chatrooms. Features advances in visual analytics and machine learning along with illustrative examples. Is accompanied by a supporting website featuring datasets. Applied mathematicians, statisticians, practitioners and students in computer science, bioinformatics and engineering will find this book extremely useful.
Author: Charles T. Meadow,Bert R. Boyce,Donald H. Kraft
Category: Business & Economics
"Information retrieval is a communication process that links an information user or seeker to a computer system that contains data bases or to a librarian, museum curator, fingerprint identification specialist, or whoever is in charge of a collection of what we are calling documents. The communication will normally involve the processing of text, strings of words known to both parties in the process that can be used to describe a document's content and other attributes and link it with a need expressed in similar terms. Even non-text entities, such as musical themes, are converted to alphanumeric symbols to allow effective storage, search, and retrieval. This book's purpose is to teach people who will be searching or designing text retrieval systems how the systems work. For designers, it covers problems they will face and reviews currently available solutions to provide a basis for more advanced study. For the searcher its purpose is to describe why such systems work as they do. The book is primarily about computer-based retrieval systems, but the principles apply to nonmechanized ones as well"--Back cover.
20th International Symposium, SPIRE 2013, Jerusalem, Israel, October 7-9, 2013, Proceedings
Author: Oren Kurland,Moshe Lewenstein,Ely Porat
This book constitutes the refereed proceedings of the 20th International Symposium on String Processing and Information Retrieval, SPIRE 2013, held in Jerusalem, Israel, in October 2013. The 18 full papers, 10 short papers were carefully reviewed and selected from 60 submissions. The program also featured 4 keynote speeches. The following topics are covered: fundamentals algorithms in string processing and information retrieval; SP and IR techniques as applied to areas such as computational biology, DNA sequencing, and Web mining.
Author: T. Strzalkowski
Publisher: Springer Science & Business Media
Category: Language Arts & Disciplines
The last decade has been one of dramatic progress in the field of Natural Language Processing (NLP). This hitherto largely academic discipline has found itself at the center of an information revolution ushered in by the Internet age, as demand for human-computer communication and informa tion access has exploded. Emerging applications in computer-assisted infor mation production and dissemination, automated understanding of news, understanding of spoken language, and processing of foreign languages have given impetus to research that resulted in a new generation of robust tools, systems, and commercial products. Well-positioned government research funding, particularly in the U. S. , has helped to advance the state-of-the art at an unprecedented pace, in no small measure thanks to the rigorous 1 evaluations. This volume focuses on the use of Natural Language Processing in In formation Retrieval (IR), an area of science and technology that deals with cataloging, categorization, classification, and search of large amounts of information, particularly in textual form. An outcome of an information retrieval process is usually a set of documents containing information on a given topic, and may consist of newspaper-like articles, memos, reports of any kind, entire books, as well as annotated image and sound files. Since we assume that the information is primarily encoded as text, IR is also a natural language processing problem: in order to decide if a document is relevant to a given information need, one needs to be able to understand its content.
The Concepts and Technology Behind Search
Author: Ricardo Baeza-Yates,Berthier Ribeiro-Neto
Publisher: Addison-Wesley Professional
This is a rigorous and complete textbook for a first course on information retrieval from the computer science perspective. It provides an up-to-date student oriented treatment of information retrieval including extensive coverage of new topics such as web retrieval, web crawling, open source search engines and user interfaces. From parsing to indexing, clustering to classification, retrieval to ranking, and user feedback to retrieval evaluation, all of the most important concepts are carefully introduced and exemplified. The contents and structure of the book have been carefully designed by the two main authors, with individual contributions coming from leading international authorities in the field, including Yoelle Maarek, Senior Director of Yahoo! Research Israel; Dulce Poncele´on IBM Research; and Malcolm Slaney, Yahoo Research USA. This completely reorganized, revised and enlarged second edition of Modern Information Retrieval contains many new chapters and double the number of pages and bibliographic references of the first edition, and a companion website www.mir2ed.org with teaching material. It will prove invaluable to students, professors, researchers, practitioners, and scholars of this fascinating field of information retrieval.
6th International Conference, PPAM 2005, Poznan, Poland, September 11-14, 2005, Revised Selected Papers
Author: Roman Wyrzykowski,Jack Dongarra,Norbert Meyer,Jerzy Wasniewski
Category: Parallel processing (Electronic computers)
This book constitutes the thoroughly refereed post-proceedings of the 6th International Conference on Parallel Processing and Applied Mathematics, PPAM 2005. The book presents 135 papers organized in topical sections on parallel and distributed architectures, parallel and distributed non-numerical algorithms, performance analysis, prediction and optimization, grid programming, tools and environments for clusters and grids, applications of parallel/distributed/grid computing, evolutionary computing with applications, parallel data mining, parallel numerics, and mathematical and computing methods.