ebook img

Apache Solr Search Patterns PDF

435 Pages·2015·7.359 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Apache Solr Search Patterns

Apache Solr Search Patterns Table of Contents Apache Solr Search Patterns Credits About the Author About the Reviewers www.PacktPub.com Support files, eBooks, discount offers, and more Why subscribe? Free access for Packt account holders Preface What this book covers What you need for this book Who this book is for Conventions Reader feedback Customer support Downloading the example code Errata Piracy Questions 1. Solr Indexing Internals The job site problem statement – Solr indexing fundamentals Working of analyzers, tokenizers, and filters Handling a multilingual search Measuring the quality of search results The e-commerce problem statement The job site problem statement Challenges of large-scale indexing Using multiple threads for indexing on Solr Using the Java binary format of data for indexing Using the ConcurrentUpdateSolrServer class for indexing Solr configuration changes that can improve indexing performance Planning your commit strategy Using better hardware Distributed indexing The SolrCloud solution Summary 2. Customizing the Solr Scoring Algorithm Relevance calculation Building a custom scorer Drawbacks of the TF-IDF model The information gain model Implementing the information gain model Options to TF-IDF similarity BM25 similarity DFR similarity Summary 3. Solr Internals and Custom Queries Working of a scorer on an inverted index Working of OR and AND clauses The eDisMax query parser Working of the eDisMax query parser The minimum should match parameter Working of filters Using BRS queries instead of DisMax Building a custom query parser Proximity search using SWAN queries Creating a parboiled parser Building a Solr plugin for SWAN queries Integrating the SWAN plugin in Solr Summary 4. Solr for Big Data Introduction to big data Getting data points using facets Field faceting Query and range faceting Radius faceting for location-based data The geofilt filter The bounding box filter The rectangle filter Distance function queries Radius faceting Data analysis using pivot faceting Graphs for analytics Getting started with Highcharts Displaying Solr data using Highcharts Summary 5. Solr in E-commerce Designing an e-commerce search Handling unclean data Handling variations in the product Sorting Problems and solutions of flash sale searches Faceting with the option of multi-select Faceting with hierarchical taxonomy Faceting with size Implementing semantic search Optimizations Summary 6. Solr for Spatial Search Features of spatial search Java Topology Suite Well-known Text The Spatial4j library Lucene 4 spatial module SpatialRecursivePrefixTreeFieldType BBoxField (to be introduced in Solr 4.10) Indexing for spatial search Searching and filtering on a spatial index The bbox query Distance sort and relevancy boost Advanced concepts Quadtree Indexing data Searching data Geohash Summary 7. Using Solr in an Advertising System Ad system functionalities Architecture of an ad distribution system Requirements of an ad distribution system Schema for a listing ad Schema for targeted ads Performance improvements fieldCache fieldValueCache documentCache filterCache queryResultCache Application cache Garbage collection Merging Solr with Redis Summary 8. AJAX Solr The purpose of AJAX Solr The AJAX Solr architecture The Manager controller The ParameterStore model Available parameters Exposed parameters Using the ParameterHashStore class Extending the ParameterStore class Widgets Working with AJAX Solr Talking to AJAX Solr Displaying the result Adding facets Adding pagination Adding a tag cloud Performance tuning Summary 9. SolrCloud The SolrCloud architecture Centralized configuration Setting up SolrCloud Test setup for SolrCloud Setting up SolrCloud in production Setting up the Zookeeper ensemble Setting up Tomcat with Solr Distributed indexing and search Routing documents to a particular shard Adding more nodes to the SolrCloud Fault tolerance and high availability in SolrCloud Advanced sharding with SolrCloud Shard splitting Deleting a shard Moving the existing shard to a new node Shard splitting based on split key Asynchronous calls Migrating documents to another collection Sizing and monitoring of SolrCloud Using SolrCloud as a NoSQL database Summary 10. Text Tagging with Lucene FST An overview of FST and text tagging Implementation of FST in Lucene Text tagging algorithms Fuzzy string matching algorithm The Levenshtein distance algorithm Damerau–Levenshtein distance Using Solr for text tagging Implementing a text tagger using Solr Summary Index

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.