Users expect search to be simple: They enter a few terms and expect perfectly-organized, relevant results instantly. But behind this simple user experience, complex machinery is at work. Whether using Elasticsearch, Solr, or another search technology, the solution is never one size fits all. Returning the right search results requires conveying domain knowledge and business rules in the search engine's data structures, text analytics, and results ranking capabilities. Relevant Search demystifies relevance work. Using Elasticsearch, it tells how to return engaging search results to users, helping readers understand and leverage the internals of Lucene-based search engines. The book walks through several real-world problems using a cohesive philosophy that combines text analysis, query building, and score shaping to express business ranking rules to the search engine. It outlines how to guide the engineering process by monitoring search user behavior and shifting the enterprise to a search-first culture focused on humans, not computers. It also shows how the search engine provides a deeply pluggable platform for integrating search ranking with machine learning, ontologies, personalization, domain-specific expertise, and other enriching sources. Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.
Provides comprehensive coverage of everything that students and practitioners need to know about working in the field of forensic anthropology Forensic anthropology has been plagued by questions of scientific validity and rigor despite its acceptance as a section in the American Academy of Forensic Sciences nearly half a century ago. Critics have viewed it as a laboratory-based applied subfield of biological anthropology, and characterised it as emphasising methodology over theory. This book shows that these views are not only antiquated, but inadequate and inaccurate. Forensic Anthropology: Theoretical Framework and Scientific Basis introduces readers to all of the theoretical and scientific foundations of forensic anthropology — beginning with how it was influenced by the early theoretical approaches of Tyler, Morgan, Spencer and Darwin. It instructs on how modern forensic science relies on an interdisciplinary approach — with research being conducted in the fields of archaeology, physics, geology and other disciplines. This modern approach to theory in forensic anthropology is presented through the introduction and discussion of Foundational, Interpretive and Methodological theories. Sections cover: Bias and Objectivity in Forensic Anthropology Theory and Practice; The Theory and Science Behind Biological Profile and Personal Identification; Scientific Foundation for Interpretations of Antemortem, Perimortem, and Postmortem Processes; and Interdisciplinary Influences, Legal Ramifications and Future Directions. Illustrates important aspects of the theory building process and reflects methods for strengthening the scientific framework of forensic anthropology as a discipline Inspired by the “Application of Theory to Forensic Anthropology” symposium presented at the 67th annual meeting of the American Academy of Forensic Sciences Chapters written by experts in the field who were presenters at the symposium Forensic Anthropology: Theoretical Framework and Scientific Basis is ideal for university courses in anthropological science, forensic science, criminal science and forensic archaeology.
Designing and Building Big Data Systems using the Hadoop Ecosystem
Author: Kerry Koitzsch
Learn advanced analytical techniques and leverage existing tool kits to make your analytic applications more powerful, precise, and efficient. This book provides the right combination of architecture, design, and implementation information to create analytical systems that go beyond the basics of classification, clustering, and recommendation. Pro Hadoop Data Analytics emphasizes best practices to ensure coherent, efficient development. A complete example system will be developed using standard third-party components that consist of the tool kits, libraries, visualization and reporting code, as well as support glue to provide a working and extensible end-to-end system. The book also highlights the importance of end-to-end, flexible, configurable, high-performance data pipeline systems with analytical components as well as appropriate visualization results. You'll discover the importance of mix-and-match or hybrid systems, using different analytical components in one application. This hybrid approach will be prominent in the examples. What You'll Learn Build big data analytic systems with the Hadoop ecosystem Use libraries, tool kits, and algorithms to make development easier and more effective Apply metrics to measure performance and efficiency of components and systems Connect to standard relational databases, noSQL data sources, and more Follow case studies with example components to create your own systems Who This Book Is For Software engineers, architects, and data scientists with an interest in the design and implementation of big data analytical systems using Hadoop, the Hadoop ecosystem, and other associated technologies.
Computational Analysis and Understanding of Natural Languages: Principles, Methods and Applications, Volume 38, the latest release in this monograph that provides a cohesive and integrated exposition of these advances and associated applications, includes new chapters on Linguistics: Core Concepts and Principles, Grammars, Open-Source Libraries, Application Frameworks, Workflow Systems, Mathematical Essentials, Probability, Inference and Prediction Methods, Random Processes, Bayesian Methods, Machine Learning, Artificial Neural Networks for Natural Language Processing, Information Retrieval, Language Core Tasks, Language Understanding Applications, and more. The synergistic confluence of linguistics, statistics, big data, and high-performance computing is the underlying force for the recent and dramatic advances in analyzing and understanding natural languages, hence making this series all the more important. Provides a thorough treatment of open-source libraries, application frameworks and workflow systems for natural language analysis and understanding Presents new chapters on Linguistics: Core Concepts and Principles, Grammars, Open-Source Libraries, Application Frameworks, Workflow Systems, Mathematical Essentials, Probability, and more
Is your organization rapidly accumulating more information than you know how to manage? This updated edition of Enterprise Search helps you create an enterprise search solution based on more than just technology. Author Martin White shows you how to plan and implement a managed search environment that meets the needs of your business and your employees. You'll learn why it’s absolutely vital to have a dedicated staff manage your search technology and support your users. New material for this second edition includes material on SharePoint 2013 search, managing open source search development, website search, designing the search user, and assessing search performance. Chapters now include a Further Reading section for computer science and information science students. Topics include: 10 critical success factors to assess organizational search maturity Essential skills needed to support a successful search application How to specify and manage open source search development How to manage SharePoint 2013 search Methods to assess the business impact of search Best practices in user interface design The importance of search for websites What to include in a search strategy
If you are a .NET developer who is looking for a simpler way to build services, this is the book for you. It will show you how to write fast, maintainable APIs that are a pleasure to use and maintain starting from the database to the client and everything in-between.
Leverage Elasticsearch to create a robust, fast, and flexible search solution with ease About This Book Boost the searching capabilities of your system through synonyms, multilingual data handling, nested objects and parent-child documents Deep dive into the world of data aggregation and data analysis with ElasticSearch Explore a wide range of ElasticSearch modules that define the behavior of a cluster Who This Book Is For If you are a competent developer and want to learn about the great and exciting world of ElasticSearch, then this book is for you. No prior knowledge of Java or Apache Lucene is needed. What You Will Learn Configure, create, and retrieve data from your indices Use an ElasticSearch query DSL to create a wide range of queries Discover the highlighting and geographical search features offered by ElasticSearch Find out how to index data that is not flat or data that has a relationship Exploit a prospective search to search for queries not documents Use the aggregations framework to get more from your data and improve your client's search experience Monitor your cluster state and health using the ElasticSearch API as well as third-party monitoring solutions Discover how to properly set up ElasticSearch for various use cases In Detail ElasticSearch is a very fast and scalable open source search engine, designed with distribution and cloud in mind, complete with all the goodies that Apache Lucene has to offer. ElasticSearch's schema-free architecture allows developers to index and search unstructured content, making it perfectly suited for both small projects and large big data warehouses, even those with petabytes of unstructured data. This book will guide you through the world of the most commonly used ElasticSearch server functionalities. You'll start off by getting an understanding of the basics of ElasticSearch and its data indexing functionality. Next, you will see the querying capabilities of ElasticSearch, followed by a through explanation of scoring and search relevance. After this, you will explore the aggregation and data analysis capabilities of ElasticSearch and will learn how cluster administration and scaling can be used to boost your application performance. You'll find out how to use the friendly REST APIs and how to tune ElasticSearch to make the most of it. By the end of this book, you will have be able to create amazing search solutions as per your project's specifications. Style and approach This step-by-step guide is full of screenshots and real-world examples to take you on a journey through the wonderful world of full text search provided by ElasticSearch.
Store, search, and analyze your data with ease using Elasticsearch 5.x About This Book Get to grips with the basics of Elasticsearch concepts and its APIs, and use them to create efficient applications Create large-scale Elasticsearch clusters and perform analytics using aggregation This comprehensive guide will get you up and running with Elasticsearch 5.x in no time Who This Book Is For If you want to build efficient search and analytics applications using Elasticsearch, this book is for you. It will also benefit developers who have worked with Lucene or Solr before and now want to work with Elasticsearch. No previous knowledge of Elasticsearch is expected. What You Will Learn See how to set up and configure Elasticsearch and Kibana Know how to ingest structured and unstructured data using Elasticsearch Understand how a search engine works and the concepts of relevance and scoring Find out how to query Elasticsearch with a high degree of performance and scalability Improve the user experience by using autocomplete, geolocation queries, and much more See how to slice and dice your data using Elasticsearch aggregations. Grasp how to use Kibana to explore and visualize your data Know how to host on Elastic Cloud and how to use the latest X-Pack features such as Graph and Alerting In Detail Elasticsearch is a modern, fast, distributed, scalable, fault tolerant, and open source search and analytics engine. You can use Elasticsearch for small or large applications with billions of documents. It is built to scale horizontally and can handle both structured and unstructured data. Packed with easy-to- follow examples, this book will ensure you will have a firm understanding of the basics of Elasticsearch and know how to utilize its capabilities efficiently. You will install and set up Elasticsearch and Kibana, and handle documents using the Distributed Document Store. You will see how to query, search, and index your data, and perform aggregation-based analytics with ease. You will see how to use Kibana to explore and visualize your data. Further on, you will learn to handle document relationships, work with geospatial data, and much more, with this easy-to-follow guide. Finally, you will see how you can set up and scale your Elasticsearch clusters in production environments. Style and approach This comprehensive guide will get you started with Elasticsearch 5.x, so you build a solid understanding of the basics. Every topic is explained in depth and is supplemented with practical examples to enhance your understanding.
Elasticsearch is a distributed search server similar to Apache Solr with a focus on large datasets, schemaless setup, and high availability. Utilizing the Apache Lucene library (also used in Apache Solr), Elasticsearch enables powerful full-text search, as well as autocomplete "morelikethis" search, multilingual functionality, and an extensive search query DSL. This book starts with the creation of a Google-like web search service, enabling you to generate your own search results. You will then learn how an e-commerce website can be built using Elasticsearch. We will discuss various approaches in getting relevant content up the results, such as relevancy based on how well a query matched the text, time-based recent documents, geographically nearer items, and other frequently used approaches. Finally, the book will cover various geocapabilities of Elasticsearch to make your searches similar to real-world scenarios.
LCPD 2013, SUEDL 2013, DataCur 2013, Held in Valletta, Malta, September 22-26, 2013. Revised Selected Papers
Author: Łukasz Bolikowski
This book constitutes the refereed proceedings of the selected workshops co-located with the 17th International Conference on Theory and Practice of Digital Libraries, TPDL 2013, held in Valletta, Malta, in September 2013. The volume is organized in three parts, containing the 26 revised full papers of the three workshops: Linking and Contextualizing Publications and Datasets (LCPD 2013); Supporting Users Exploration of Digital Libraries (SUEDL 2013); Moving beyond technology: iSchools and education in data curation. Is Data Curator a new role? (DataCur 2013).