Search Engines

Information Retrieval in Practice

Author: Bruce Croft,Donald Metzler,Trevor Strohman

Publisher: Pearson Higher Ed

ISBN: 0133001598

Category: Computers

Page: 552

View: 2340

This is the eBook of the printed book and may not include any media, website access codes, or print supplements that may come packaged with the bound book. Search Engines: Information Retrieval in Practice is ideal for introductory information retrieval courses at the undergraduate and graduate level in computer science, information science and computer engineering departments. It is also a valuable tool for search engine and information retrieval professionals. Written by a leader in the field of information retrieval, Search Engines: Information Retrieval in Practice , is designed to give undergraduate students the understanding and tools they need to evaluate, compare and modify search engines. Coverage of the underlying IR and mathematical models reinforce key concepts. The book’s numerous programming exercises make extensive use of Galago, a Java-based open source search engine.

Search Engines

Information Retrieval in Practice

Author: W. Bruce Croft,Donald Metzler,Trevor Strohman

Publisher: Addison-Wesley Professional

ISBN: 9780136072249

Category: Computers

Page: 520

View: 7414

Search Engines: Information Retrieval in Practice is ideal for introductory information retrieval courses at the undergraduate and graduate level in computer science, information science and computer engineering departments. It is also a valuable tool for search engine and information retrieval professionals. Written by a leader in the field of information retrieval, Search Engines: Information Retrieval in Practice , is designed to give undergraduate students the understanding and tools they need to evaluate, compare and modify search engines. Coverage of the underlying IR and mathematical models reinforce key concepts. The book's numerous programming exercises make extensive use of Galago, a Java-based open source search engine.

Search Engines

Information Retrieval in Practice

Author: W. Bruce Croft,Donald Metzler,Trevor Strohman

Publisher: N.A

ISBN: 9780131364899

Category: Computers

Page: 524

View: 7918

Search Engines: Information Retrieval in Practice introduces the key issues in information retrieval (IR) and shows how they affect the design and implementation of search engines, with mathematical models reinforcing important concepts. This book is ideal for an introductory course on IR at either the undergraduate or master's level or for professionals seeking an authoritative introduction. An extensive set of resources is available to instructors."--BOOK JACKET.

Information Retrieval

Implementing and Evaluating Search Engines

Author: Stefan Büttcher,Charles L. A. Clarke,Gordon V. Cormack

Publisher: MIT Press

ISBN: 0262528878

Category: Computers

Page: 632

View: 843

An introduction to information retrieval, the foundation for modern search engines, that emphasizes implementation and experimentation.

Introduction to Information Retrieval

Author: Christopher D. Manning,Prabhakar Raghavan,Hinrich Schütze

Publisher: Cambridge University Press

ISBN: 1139472100

Category: Computers

Page: N.A

View: 1503

Class-tested and coherent, this textbook teaches classical and web information retrieval, including web search and the related areas of text classification and text clustering from basic concepts. It gives an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections. All the important ideas are explained using examples and figures, making it perfect for introductory courses in information retrieval for advanced undergraduates and graduate students in computer science. Based on feedback from extensive classroom experience, the book has been carefully structured in order to make teaching more natural and effective. Slides and additional exercises (with solutions for lecturers) are also available through the book's supporting website to help course instructors prepare their lectures.

Next Generation Search Engines

Advanced Models for Information Retrieval

Author: Christophe Jouis,Ismail Biskri,Jean-Gabriel Ganascia,Magali Roux

Publisher: IGI Publishing

ISBN: 9781466603301

Category: Computers

Page: 540

View: 5981

"This book is intended for scientists and decision-makers who wish to gain working knowledge about search engines in order to evaluate available solutions and to dialogue with software and data providers"--Provided by publisher.

Interactive Information Retrieval in Digital Environments

Author: Xie, Iris

Publisher: IGI Global

ISBN: 9781599042428

Category: Computers

Page: 376

View: 3487

"This book includes the integration of existing frameworks on user-oriented information retrieval systems across multiple disciplines; the comprehensive review of empirical studies of interactive information retrieval systems for different types of users, tasks, and subtasks; and the discussion of how to evaluate interactive information retrieval systems. "--Provided by publisher.

Learning to Rank for Information Retrieval

Author: Tie-Yan Liu

Publisher: Springer Science & Business Media

ISBN: 9783642142673

Category: Computers

Page: 285

View: 8796

Due to the fast growth of the Web and the difficulties in finding desired information, efficient and effective information retrieval systems have become more important than ever, and the search engine has become an essential tool for many people. The ranker, a central component in every search engine, is responsible for the matching between processed queries and indexed documents. Because of its central role, great attention has been paid to the research and development of ranking technologies. In addition, ranking is also pivotal for many other information retrieval applications, such as collaborative filtering, definition ranking, question answering, multimedia retrieval, text summarization, and online advertisement. Leveraging machine learning technologies in the ranking process has led to innovative and more effective ranking models, and eventually to a completely new research area called “learning to rank”. Liu first gives a comprehensive review of the major approaches to learning to rank. For each approach he presents the basic framework, with example algorithms, and he discusses its advantages and disadvantages. He continues with some recent advances in learning to rank that cannot be simply categorized into the three major approaches – these include relational ranking, query-dependent ranking, transfer ranking, and semisupervised ranking. His presentation is completed by several examples that apply these technologies to solve real information retrieval problems, and by theoretical discussions on guarantees for ranking performance. This book is written for researchers and graduate students in both information retrieval and machine learning. They will find here the only comprehensive description of the state of the art in a field that has driven the recent advances in search engine development.

The Turn

Integration of Information Seeking and Retrieval in Context

Author: Peter Ingwersen,Kalervo Järvelin

Publisher: Springer Science & Business Media

ISBN: 1402038518

Category: Computers

Page: 448

View: 3961

T The Turn analyzes the research of information seeking and retrieval (IS&R) and proposes a new direction of integrating research in these two areas: the fields should turn off their separate and narrow paths and construct a new avenue of research. An essential direction for this avenue is context as given in the subtitle Integration of Information Seeking and Retrieval in Context. Other essential themes in the book include: IS&R research models, frameworks and theories; search and works tasks and situations in context; interaction between humans and machines; information acquisition, relevance and information use; research design and methodology based on a structured set of explicit variables - all set into the holistic cognitive approach. The present monograph invites the reader into a construction project - there is much research to do for a contextual understanding of IS&R. The Turn represents a wide-ranging perspective of IS&R by providing a novel unique research framework, covering both individual and social aspects of information behavior, including the generation, searching, retrieval and use of information. Regarding traditional laboratory information retrieval research, the monograph proposes the extension of research toward actors, search and work tasks, IR interaction and utility of information. Regarding traditional information seeking research, it proposes the extension toward information access technology and work task contexts. The Turn is the first synthesis of research in the broad area of IS&R ranging from systems oriented laboratory IR research to social science oriented information seeking studies.

Human Information Retrieval

Author: Warner

Publisher: MIT Press

ISBN: 0262258501

Category: Computers

Page: 200

View: 2668

Information retrieval in the age of Internet search engines has become part of ordinary discourse and everyday practice: "Google" is a verb in common usage. Thus far, more attention has been given to practical understanding of information retrieval than to a full theoretical account. In Human Information Retrieval, Julian Warner offers a comprehensive overview of information retrieval, synthesizing theories from different disciplines (information and computer science, librarianship and indexing, and information society discourse) and incorporating such disparate systems as WorldCat and Google into a single, robust theoretical framework. There is a need for such a theoretical treatment, he argues, one that reveals the structure and underlying patterns of this complex field while remaining congruent with everyday practice. Warner presents a labor theoretic approach to information retrieval, building on his previously formulated distinction between semantic and syntactic mental labor, arguing that the description and search labor of information retrieval can be understood as both semantic and syntactic in character. Warner's information science approach is rooted in the humanities and the social sciences but informed by an understanding of information technology and information theory. The chapters offer a progressive exposition of the topic, with illustrative examples to explain the concepts presented. Neither narrowly practical nor largely speculative, Human Information Retrieval meets the contemporary need for a broader treatment of information and information systems.

Modern Information Retrieval

The Concepts and Technology Behind Search

Author: Ricardo Baeza-Yates,Berthier Ribeiro-Neto

Publisher: Addison-Wesley Professional

ISBN: 9780321416919

Category: Computers

Page: 913

View: 774

This is a rigorous and complete textbook for a first course on information retrieval from the computer science perspective. It provides an up-to-date student oriented treatment of information retrieval including extensive coverage of new topics such as web retrieval, web crawling, open source search engines and user interfaces. From parsing to indexing, clustering to classification, retrieval to ranking, and user feedback to retrieval evaluation, all of the most important concepts are carefully introduced and exemplified. The contents and structure of the book have been carefully designed by the two main authors, with individual contributions coming from leading international authorities in the field, including Yoelle Maarek, Senior Director of Yahoo! Research Israel; Dulce Poncele´on IBM Research; and Malcolm Slaney, Yahoo Research USA. This completely reorganized, revised and enlarged second edition of Modern Information Retrieval contains many new chapters and double the number of pages and bibliographic references of the first edition, and a companion website www.mir2ed.org with teaching material. It will prove invaluable to students, professors, researchers, practitioners, and scholars of this fascinating field of information retrieval.

Financial Reporting under IFRS

A Topic Based Approach

Author: Wolfgang Dick,Franck Missonier-Piera

Publisher: John Wiley & Sons

ISBN: 0470971622

Category: Business & Economics

Page: 360

View: 2279

The International Financial Reporting Standards are quite different from other sets of accounting standards, and are fundamentally different from US-GAAP, in that they are based on principles, and not on detailed rules. Financial Reporting under IFRS:A topic-based approach offers a global perspective on IFRS by presenting the prescribed rationale and principles and illustrating them through numerous examples from large international companies. It aims to develop the fundamental skills necessary to read and use the information contained in all types of financial statements, through examples, activities, questions and answers. The book is broadly divided into three sections. Section one examines the structure of the Balance Sheet and the Income Statement, their links and the accounting mechanisms used to prepare them. Section two deals with the identification, evaluation and reporting of Balance Sheet items. Section three covers the use of financial statements to analyze a firm’s performance and its risks. Throughout the book special topics are covered, including Derivatives and Hedge accounting (IAS 39), Business Combination (IFRS 3) and Operating Segments (IFRS 8). Financial Reporting under IFRS is ideally suited to the needs of students of accounting and financial reporting, but all users of financial statements, from creditors and investors to suppliers, customers, employees and governments will benefit from its concise, topic-based approach.

Understanding Information Retrieval Systems

Management, Types, and Standards

Author: Marcia J. Bates

Publisher: CRC Press

ISBN: 1466551356

Category: Business & Economics

Page: 752

View: 2977

In order to be effective for their users, information retrieval (IR) systems should be adapted to the specific needs of particular environments. The huge and growing array of types of information retrieval systems in use today is on display in Understanding Information Retrieval Systems: Management, Types, and Standards, which addresses over 20 types of IR systems. These various system types, in turn, present both technical and management challenges, which are also addressed in this volume. In order to be interoperable in a networked environment, IR systems must be able to use various types of technical standards, a number of which are described in this book—often by their original developers. The book covers the full context of operational IR systems, addressing not only the systems themselves but also human user search behaviors, user-centered design, and management and policy issues. In addition to theory and practice of IR system design, the book covers Web standards and protocols, the Semantic Web, XML information retrieval, Web social mining, search engine optimization, specialized museum and library online access, records compliance and risk management, information storage technology, geographic information systems, and data transmission protocols. Emphasis is given to information systems that operate on relatively unstructured data, such as text, images, and music. The book is organized into four parts: Part I supplies a broad-level introduction to information systems and information retrieval systems Part II examines key management issues and elaborates on the decision process around likely information system solutions Part III illustrates the range of information retrieval systems in use today discussing the technical, operational, and administrative issues for each type Part IV discusses the most important organizational and technical standards needed for successful information retrieval This volume brings together authoritative articles on the different types of information systems and how to manage real-world demands such as digital asset management, network management, digital content licensing, data quality, and information system failures. It explains how to design systems to address human characteristics and considers key policy and ethical issues such as piracy and preservation. Focusing on web–based systems, the chapters in this book provide an excellent starting point for developing and managing your own IR systems.

Innovations in Information Retrieval

Perspectives for Theory and Practice

Author: Allen Foster,Pauline Rafferty

Publisher: Facet Publishing

ISBN: 1856046974

Category: Language Arts & Disciplines

Page: 156

View: 2214

The advent of new information retrieval (IR) technologies and approaches to storage and retrieval provide communities with previously unheard of opportunities for mass documentation, digitization, and the recording of information in all its forms. This book introduces and contextualizes these developments and looks at supporting research in IR, the debates, theories and issues. Contributed by an international team of experts, each authored chapter provides a snapshot of changes in the field, as well as the importance of developing innovation, creativity and thinking in IR practice and research. Key discussion areas include: browsing in new information environments classification revisited: a web of knowledge approaches to fiction retrieval research music information retrieval research folksonomies, social tagging and information retrieval digital information interaction as semantic navigation assessing web search machines: a webometric approach. Readership: LIS professionals , researchers and students, and for all those interested in the future of IR.

Information Retrieval

Algorithms and Heuristics

Author: David A. Grossman,Ophir Frieder

Publisher: Springer Science & Business Media

ISBN: 9781402030048

Category: Computers

Page: 332

View: 1559

Interested in how an efficient search engine works? Want to know what algorithms are used to rank resulting documents in response to user requests? The authors answer these and other key information retrieval design and implementation questions. This book is not yet another high level text. Instead, algorithms are thoroughly described, making this book ideally suited for both computer science students and practitioners who work on search-related applications. As stated in the foreword, this book provides a current, broad, and detailed overview of the field and is the only one that does so. Examples are used throughout to illustrate the algorithms. The authors explain how a query is ranked against a document collection using either a single or a combination of retrieval strategies, and how an assortment of utilities are integrated into the query processing scheme to improve these rankings. Methods for building and compressing text indexes, querying and retrieving documents in multiple languages, and using parallel or distributed processing to expedite the search are likewise described. This edition is a major expansion of the one published in 1998. Besides updating the entire book with current techniques, it includes new sections on language models, cross-language information retrieval, peer-to-peer processing, XML search, mediators, and duplicate document detection.

Think Data Structures

Algorithms and Information Retrieval in Java

Author: Allen B Downey

Publisher: "O'Reilly Media, Inc."

ISBN: 1491972343

Category:

Page: N.A

View: 9444

If you're a student studying computer science or a software developer preparing for technical interviews, this practical book will help you learn and review some of the most important ideas in software engineering--data structures and algorithms--in a way that's clearer, more concise, and more engaging than other materials. By emphasizing practical knowledge and skills over theory, author Allen Downey shows you how to use data structures to implement efficient algorithms, and then analyze and measure their performance. You'll explore the important classes in the Java collections framework (JCF), how they're implemented, and how they're expected to perform. Each chapter presents hands-on exercises supported by test code online. Use data structures such as lists and maps, and understand how they work Build an application that reads Wikipedia pages, parses the contents, and navigates the resulting data tree Analyze code to predict how fast it will run and how much memory it will require Write classes that implement the Map interface, using a hash table and binary search tree Build a simple web search engine with a crawler, an indexer that stores web page contents, and a retriever that returns user query results Other books by Allen Downey include Think Java, Think Python, Think Stats, and Think Bayes.

Managing Gigabytes

Compressing and Indexing Documents and Images

Author: Ian H. Witten,Alistair Moffat,Timothy C. Bell

Publisher: Morgan Kaufmann

ISBN: 9781558605701

Category: Business & Economics

Page: 519

View: 932

In this fully updated second edition of the highly acclaimed Managing Gigabytes, authors Witten, Moffat, and Bell continue to provide unparalleled coverage of state-of-the-art techniques for compressing and indexing data. Whatever your field, if you work with large quantities of information, this book is essential reading--an authoritative theoretical resource and a practical guide to meeting the toughest storage and access challenges. It covers the latest developments in compression and indexing and their application on the Web and in digital libraries. It also details dozens of powerful techniques supported by mg, the authors' own system for compressing, storing, and retrieving text, images, and textual images. mg's source code is freely available on the Web. * Up-to-date coverage of new text compression algorithms such as block sorting, approximate arithmetic coding, and fat Huffman coding * New sections on content-based index compression and distributed querying, with 2 new data structures for fast indexing * New coverage of image coding, including descriptions of de facto standards in use on the Web (GIF and PNG), information on CALIC, the new proposed JPEG Lossless standard, and JBIG2 * New information on the Internet and WWW, digital libraries, web search engines, and agent-based retrieval * Accompanied by a public domain system called MG which is a fully worked-out operational example of the advanced techniques developed and explained in the book * New appendix on an existing digital library system that uses the MG software

Mining the Web

Discovering Knowledge from Hypertext Data

Author: Soumen Chakrabarti

Publisher: Morgan Kaufmann

ISBN: 9781558607545

Category: Computers

Page: 345

View: 480

The definitive book on mining the Web from the preeminent authority.

Current Challenges in Patent Information Retrieval

Author: Mihai Lupu,Katja Mayer,Noriko Kando,Anthony J. Trippe

Publisher: Springer

ISBN: 3662538172

Category: Computers

Page: 455

View: 5215

This second edition provides a systematic introduction to the work and views of the emerging patent-search research and innovation communities as well as an overview of what has been achieved and, perhaps even more importantly, of what remains to be achieved. It revises many of the contributions of the first edition and adds a significant number of new ones. The first part “Introduction to Patent Searching” includes two overview chapters on the peculiarities of patent searching and on contemporary search technology respectively, and thus sets the scene for the subsequent parts. The second part on “Evaluating Patent Retrieval” then begins with two chapters dedicated to patent evaluation campaigns, followed by two chapters discussing complementary issues from the perspective of patent searchers and from the perspective of related domains, notably legal search. “High Recall Search” includes four completely new chapters dealing with the issue of finding only the relevant documents in a reasonable time span. The last (and with six papers the largest) part on “Special Topics in Patent Information Retrieval” covers a large spectrum of research in the patent field, from classification and image processing to translation. Lastly, the book is completed by an outlook on open issues and future research. Several of the chapters have been jointly written by intellectual property and information retrieval experts. However, members of both communities with a background different to that of the primary author have reviewed the chapters, making the book accessible to both the patent search community and to the information retrieval research community. It also not only offers the latest findings for academic researchers, but is also a valuable resource for IP professionals wanting to learn about current IR approaches in the patent domain.

TREC

Experiment and Evaluation in Information Retrieval

Author: E. Voorhees,Donna K. Harman

Publisher: Mit Press

ISBN: N.A

Category: Computers

Page: 462

View: 9928

Results from twelve years of the Text REtrieval Conference (TREC), documenting test collections, evaluation standards, and current best practices.