nexusstc/Advanced Elasticsearch 7.0: A practical guide to designing, indexing, and querying advanced distributed search engines/0113ade3b5a2fe22003391181bc8f5da.pdf
Advanced Elasticsearch 7.0 : A Practical Guide to Designing, Indexing, and Querying Advanced Distributed Search Engines 🔍
Wai Tak Wong
Packt Publishing, Limited, Packt Publishing, Birmingham, 2019
英语 [en] · PDF · 77.4MB · 2019 · 📘 非小说类图书 · 🚀/lgli/lgrs/nexusstc/zlib · Save
描述
Master the intricacies of Elasticsearch 7.0 and use it to create flexible and scalable search solutions Key Features Master the latest distributed search and analytics capabilities of Elasticsearch 7.0 Perform searching, indexing, and aggregation of your data at scale Discover tips and techniques for speeding up your search query performance Book Description Building enterprise-grade distributed applications and executing systematic search operations call for a strong understanding of Elasticsearch and expertise in using its core APIs and latest features. This book will help you master the advanced functionalities of Elasticsearch and understand how you can develop a sophisticated, real-time search engine confidently. In addition to this, you'll also learn to run machine learning jobs in Elasticsearch to speed up routine tasks. You'll get started by learning to use Elasticsearch features on Hadoop and Spark and make search results faster, thereby improving the speed of query results and enhancing the customer experience. You'll then get up to speed with performing analytics by building a metrics pipeline, defining queries, and using Kibana for intuitive visualizations that help provide decision-makers with better insights. The book will later guide you through using Logstash with examples to collect, parse, and enrich logs before indexing them in Elasticsearch. By the end of this book, you will have comprehensive knowledge of advanced topics such as Apache Spark support, machine learning using Elasticsearch and scikit-learn, and real-time analytics, along with the expertise you need to increase business productivity, perform analytics, and get the very best out of Elasticsearch. What you will learn Pre-process documents before indexing in ingest pipelines Learn how to model your data in the real world Get to grips with using Elasticsearch for exploratory data analysis Understand how to build analytics and RESTful services Use Kibana, Logstash, and Beats for dashboard applications Get up to speed with Spark and Elasticsearch for real-time analytics Explore the basics of Spring Data Elasticsearch, and understand how to index, search, and query in a Spring application Who this book is for This book is for Elasticsearch developers and data engineers who want to take their basic knowledge of Elasticsearch to the next level and use it to build enterprise-grade distributed search applications. Prior experience of working with Elasticsearch will be us ..
备用文件名
lgrsnf/advanced-elasticsearch-7-practical-distributed.pdf
备用文件名
zlib/Computers/Wai Tak Wong/Advanced Elasticsearch 7.0: A practical guide to designing, indexing, and querying advanced distributed search engines_12115629.pdf
备选标题
Advanced Elasticsearch 7. 0: A Practical Guide to Designing, Indexing, and Querying Advanced Distributed Search Engines
备选作者
Wai Wong; Safari, an O'Reilly Media Company
备选作者
Wong, Wai Tak
备用版本
1st edition, Erscheinungsort nicht ermittelbar, 2019
备用版本
United Kingdom and Ireland, United Kingdom
备用版本
1st ed, Birmingham, 2019
备用版本
2019-08-23
元数据中的注释
lg2992087
元数据中的注释
{"isbns":["1789957753","9781789957754"],"last_page":560,"publisher":"Packt Publishing"}
备用描述
Cover
Title Page
Copyright and Credits
Dedication
About Packt
Contributors
Table of Contents
Preface
Section 1: Fundamentals and Core APIs
Chapter 1: Overview of Elasticsearch 7
Preparing your environment
Running Elasticsearch
Basic Elasticsearch configuration
Important system configuration
Talking to Elasticsearch
Using Postman to work with the Elasticsearch REST API
Elasticsearch architectural overview
Elastic Stack architecture
Elasticsearch architecture
Between the Elasticsearch index and the Lucene index
Key concepts
Mapping concepts across SQL and Elasticsearch
Mapping
Analyzer
Standard analyzer
API conventions
New features
New features to be discussed
New features with description and issue number
Breaking changes
Aggregations changes
Analysis changes
API changes
Cluster changes
Discovery changes
High-level REST client changes
Low-level REST client changes
Indices changes
Java API changes
Mapping changes
ML changes
Packaging changes
Search changes
Query DSL changes
Settings changes
Scripting changes
Migration between versions
Summary
Chapter 2: Index APIs
Index management APIs
Basic CRUD APIs
Index settings
Index templates
Index aliases
Reindexing with zero downtime
Grouping multiple indices
Views on a subset of documents
Miscellaneous
Monitoring indices
Indices stats
Indices segments, recovery, and share stores
Index persistence
Advanced index management APIs
Split index
Shrink index
Rollover index
Summary
Chapter 3: Document APIs
The Elasticsearch document life cycle
What is a document?
The document life cycle
Single document management APIs
Sample documents
Indexing a document
Retrieving a document by identifier
Updating a document
Removing a document by identifier
Multi-document management APIs
Retrieving multiple documents
Bulk API
Update by query API
Delete by query API
Reindex API
Copying documents
Migration from a multiple mapping types index
Summary
Chapter 4: Mapping APIs
Dynamic mapping
Mapping rules
Dynamic templates
Meta fields in mapping
Field datatypes
Static mapping for the sample document
Mapping parameters
Refreshing mapping changes for static mapping
Typeless APIs working with old custom index types
Summary
Chapter 5: Anatomy of an Analyzer
An analyzer's components
Character filters
The html_strip filter
The mapping filter
The pattern_replace filter
Tokenizers
Token filters
Built-in analyzers
Custom analyzers
Normalizers
Summary
Chapter 6: Search APIs
Indexing sample documents
Search APIs
URI search
Request body search
The sort parameter
The scroll parameter
The search_after parameter
The rescore parameter
The _name parameter
The collapse parameter
The highlighting parameter
Other search parameters
Query DSL
Full text queries
The match keyword
The query string keyword
The intervals keyword
Term-level queries
Compound queries
The script query
The multi-search API
Other search-related APIs
The _explain API
The _validate API
The _count API
The field capabilities API
Profiler
Suggesters
Summary
Section 2: Data Modeling, Aggregations Framework, Pipeline, and Data Analytics
Chapter 7: Modeling Your Data in the Real World
The Investor Exchange Cloud
Modeling data and the approaches
Data denormalization
Using an array of objects datatype
Nested object mapping datatypes
Join datatypes
Parent ID query
has_child query
has_parent query
Practical considerations
Summary
Chapter 8: Aggregation Frameworks
ETF historical data preparation
Aggregation query syntax
Matrix aggregations
Matrix stats
Metrics aggregations
avg
weighted_avg
cardinality
value_count
sum
min
max
stats
extended_stats
top_hit
percentiles
percentile_ranks
median_absolute_deviation
geo_bound
geo_centroid
scripted_metric
Bucket aggregations
histogram
date_histogram
auto_date_histogram
ranges
date_range
ip_range
filter
filters
term
significant_terms
significant_text
sampler
diversified_sampler
nested
reverse_nested
global
missing
composite
adjacency_matrix
parent
children
geo_distance
geohash_grid
geotile_grid
Pipeline aggregations
Sibling family
avg_bucket
max_bucket
min_bucket
sum_bucket
stats_bucket
extended_stats_bucket
percentiles_bucket
Parent family
cumulative_sum
derivative
bucket_script
bucket_selector
bucket_sort
serial_diff
Moving average aggregation
simple
linear
ewma
holt
holt_winters
Moving function aggregation
max
min
sum
stdDev
unweightedAvg
linearWeightedAvg
ewma
holt
holtWinters
Post filter on aggregations
Summary
Chapter 9: Preprocessing Documents in Ingest Pipelines
Ingest APIs
Accessing data in pipelines
Processors
Conditional execution in pipelines
Handling failures in pipelines
Summary
Chapter 10: Using Elasticsearch for Exploratory Data Analysis
Business analytics
Operational data analytics
Sentiment analysis
Summary
Section 3: Programming with the Elasticsearch Client
Chapter 11: Elasticsearch from Java Programming
Overview of Elasticsearch Java REST client
The Java low-level REST client
The Java low-level REST client workflow
REST client initialization
Performing requests using a REST client
Handing responses
Testing with Swagger UI
New features
The Java high-level REST client
The Java high-level REST client workflow
REST client initialization
Performing requests using the REST client
Handling responses
Testing with Swagger UI
New features
Spring Data Elasticsearch
Summary
Chapter 12: Elasticsearch from Python Programming
Overview of the Elasticsearch Python client
The Python low-level Elasticsearch client
Workflow for the Python low-level Elasticsearch client
Client initialization
Performing requests
Handling responses
The Python high-level Elasticsearch library
Illustrating the programming concept
Initializing a connection
Performing requests
Handling responses
The query class
The aggregations class
Summary
Section 4: Elastic Stack
Chapter 13: Using Kibana, Logstash, and Beats
Overview of the Elastic Stack
Running the Elastic Stack with Docker
Running Elasticsearch in a Docker container
Running Kibana in a Docker container
Running Logstash in a Docker container
Running Beats in a Docker container
Summary
Chapter 14: Working with Elasticsearch SQL
Overview
Getting started
Elasticsearch SQL language
Reserved keywords
Data type
Operators
Functions
Aggregate
Grouping
Date-time
Full-text search
Mathematics
String
Type conversion
Conditional
System
Elasticsearch SQL query syntax
New features
Elasticsearch SQL REST API
Elasticsearch SQL JDBC
Upgrading Elasticsearch from a basic to a trial license
Workflow of Elasticsearch SQL JDBC
Testing with Swagger UI
Summary
Chapter 15: Working with Elasticsearch Analysis Plugins
What are Elasticsearch plugins?
Plugin management
Working with the ICU Analysis plugin
Examples
Working with the Smart Chinese Analysis plugin
Examples
Working with the IK Analysis plugin
Examples
Configuring a custom dictionary in the IK Analysis plugin
Summary
Section 5: Advanced Features
Chapter 16: Machine Learning with Elasticsearch
Machine learning with Elastic Stack
Machine learning APIs
Machine learning jobs
Sample data
Running a single-metric job
Creating index patterns
Creating a new machine learning job
Examining the result
Machine learning using Elasticsearch and scikit-learn
Summary
Chapter 17: Spark and Elasticsearch for Real-Time Analytics
Overview of ES-Hadoop
Apache Spark support
Real-time analytics using Elasticsearch and Apache Spark
Building a virtual environment to run the sample ES-Hadoop project
Running the sample ES-Hadoop project
Running the sample ES-Hadoop project using a prepared Docker image
Source code
Summary
Chapter 18: Building Analytics RESTful Services
Building a RESTful web service with Spring Boot
Project program structure
Running the program and examining the APIs
Main workflow anatomy
Building the analytic model
Performing daily update data
Getting the registered symbols
Building the scheduler
Integration with the Bollinger Band
Building a Java Spark ML module for k-means anomaly detection
Source code
Testing Analytics RESTful services
Testing the build-analytics-model API
Testing the get-register-symbols API
Working with Kibana to visualize the analytics results
Summary
Other Books You May Enjoy
Index
Title Page
Copyright and Credits
Dedication
About Packt
Contributors
Table of Contents
Preface
Section 1: Fundamentals and Core APIs
Chapter 1: Overview of Elasticsearch 7
Preparing your environment
Running Elasticsearch
Basic Elasticsearch configuration
Important system configuration
Talking to Elasticsearch
Using Postman to work with the Elasticsearch REST API
Elasticsearch architectural overview
Elastic Stack architecture
Elasticsearch architecture
Between the Elasticsearch index and the Lucene index
Key concepts
Mapping concepts across SQL and Elasticsearch
Mapping
Analyzer
Standard analyzer
API conventions
New features
New features to be discussed
New features with description and issue number
Breaking changes
Aggregations changes
Analysis changes
API changes
Cluster changes
Discovery changes
High-level REST client changes
Low-level REST client changes
Indices changes
Java API changes
Mapping changes
ML changes
Packaging changes
Search changes
Query DSL changes
Settings changes
Scripting changes
Migration between versions
Summary
Chapter 2: Index APIs
Index management APIs
Basic CRUD APIs
Index settings
Index templates
Index aliases
Reindexing with zero downtime
Grouping multiple indices
Views on a subset of documents
Miscellaneous
Monitoring indices
Indices stats
Indices segments, recovery, and share stores
Index persistence
Advanced index management APIs
Split index
Shrink index
Rollover index
Summary
Chapter 3: Document APIs
The Elasticsearch document life cycle
What is a document?
The document life cycle
Single document management APIs
Sample documents
Indexing a document
Retrieving a document by identifier
Updating a document
Removing a document by identifier
Multi-document management APIs
Retrieving multiple documents
Bulk API
Update by query API
Delete by query API
Reindex API
Copying documents
Migration from a multiple mapping types index
Summary
Chapter 4: Mapping APIs
Dynamic mapping
Mapping rules
Dynamic templates
Meta fields in mapping
Field datatypes
Static mapping for the sample document
Mapping parameters
Refreshing mapping changes for static mapping
Typeless APIs working with old custom index types
Summary
Chapter 5: Anatomy of an Analyzer
An analyzer's components
Character filters
The html_strip filter
The mapping filter
The pattern_replace filter
Tokenizers
Token filters
Built-in analyzers
Custom analyzers
Normalizers
Summary
Chapter 6: Search APIs
Indexing sample documents
Search APIs
URI search
Request body search
The sort parameter
The scroll parameter
The search_after parameter
The rescore parameter
The _name parameter
The collapse parameter
The highlighting parameter
Other search parameters
Query DSL
Full text queries
The match keyword
The query string keyword
The intervals keyword
Term-level queries
Compound queries
The script query
The multi-search API
Other search-related APIs
The _explain API
The _validate API
The _count API
The field capabilities API
Profiler
Suggesters
Summary
Section 2: Data Modeling, Aggregations Framework, Pipeline, and Data Analytics
Chapter 7: Modeling Your Data in the Real World
The Investor Exchange Cloud
Modeling data and the approaches
Data denormalization
Using an array of objects datatype
Nested object mapping datatypes
Join datatypes
Parent ID query
has_child query
has_parent query
Practical considerations
Summary
Chapter 8: Aggregation Frameworks
ETF historical data preparation
Aggregation query syntax
Matrix aggregations
Matrix stats
Metrics aggregations
avg
weighted_avg
cardinality
value_count
sum
min
max
stats
extended_stats
top_hit
percentiles
percentile_ranks
median_absolute_deviation
geo_bound
geo_centroid
scripted_metric
Bucket aggregations
histogram
date_histogram
auto_date_histogram
ranges
date_range
ip_range
filter
filters
term
significant_terms
significant_text
sampler
diversified_sampler
nested
reverse_nested
global
missing
composite
adjacency_matrix
parent
children
geo_distance
geohash_grid
geotile_grid
Pipeline aggregations
Sibling family
avg_bucket
max_bucket
min_bucket
sum_bucket
stats_bucket
extended_stats_bucket
percentiles_bucket
Parent family
cumulative_sum
derivative
bucket_script
bucket_selector
bucket_sort
serial_diff
Moving average aggregation
simple
linear
ewma
holt
holt_winters
Moving function aggregation
max
min
sum
stdDev
unweightedAvg
linearWeightedAvg
ewma
holt
holtWinters
Post filter on aggregations
Summary
Chapter 9: Preprocessing Documents in Ingest Pipelines
Ingest APIs
Accessing data in pipelines
Processors
Conditional execution in pipelines
Handling failures in pipelines
Summary
Chapter 10: Using Elasticsearch for Exploratory Data Analysis
Business analytics
Operational data analytics
Sentiment analysis
Summary
Section 3: Programming with the Elasticsearch Client
Chapter 11: Elasticsearch from Java Programming
Overview of Elasticsearch Java REST client
The Java low-level REST client
The Java low-level REST client workflow
REST client initialization
Performing requests using a REST client
Handing responses
Testing with Swagger UI
New features
The Java high-level REST client
The Java high-level REST client workflow
REST client initialization
Performing requests using the REST client
Handling responses
Testing with Swagger UI
New features
Spring Data Elasticsearch
Summary
Chapter 12: Elasticsearch from Python Programming
Overview of the Elasticsearch Python client
The Python low-level Elasticsearch client
Workflow for the Python low-level Elasticsearch client
Client initialization
Performing requests
Handling responses
The Python high-level Elasticsearch library
Illustrating the programming concept
Initializing a connection
Performing requests
Handling responses
The query class
The aggregations class
Summary
Section 4: Elastic Stack
Chapter 13: Using Kibana, Logstash, and Beats
Overview of the Elastic Stack
Running the Elastic Stack with Docker
Running Elasticsearch in a Docker container
Running Kibana in a Docker container
Running Logstash in a Docker container
Running Beats in a Docker container
Summary
Chapter 14: Working with Elasticsearch SQL
Overview
Getting started
Elasticsearch SQL language
Reserved keywords
Data type
Operators
Functions
Aggregate
Grouping
Date-time
Full-text search
Mathematics
String
Type conversion
Conditional
System
Elasticsearch SQL query syntax
New features
Elasticsearch SQL REST API
Elasticsearch SQL JDBC
Upgrading Elasticsearch from a basic to a trial license
Workflow of Elasticsearch SQL JDBC
Testing with Swagger UI
Summary
Chapter 15: Working with Elasticsearch Analysis Plugins
What are Elasticsearch plugins?
Plugin management
Working with the ICU Analysis plugin
Examples
Working with the Smart Chinese Analysis plugin
Examples
Working with the IK Analysis plugin
Examples
Configuring a custom dictionary in the IK Analysis plugin
Summary
Section 5: Advanced Features
Chapter 16: Machine Learning with Elasticsearch
Machine learning with Elastic Stack
Machine learning APIs
Machine learning jobs
Sample data
Running a single-metric job
Creating index patterns
Creating a new machine learning job
Examining the result
Machine learning using Elasticsearch and scikit-learn
Summary
Chapter 17: Spark and Elasticsearch for Real-Time Analytics
Overview of ES-Hadoop
Apache Spark support
Real-time analytics using Elasticsearch and Apache Spark
Building a virtual environment to run the sample ES-Hadoop project
Running the sample ES-Hadoop project
Running the sample ES-Hadoop project using a prepared Docker image
Source code
Summary
Chapter 18: Building Analytics RESTful Services
Building a RESTful web service with Spring Boot
Project program structure
Running the program and examining the APIs
Main workflow anatomy
Building the analytic model
Performing daily update data
Getting the registered symbols
Building the scheduler
Integration with the Bollinger Band
Building a Java Spark ML module for k-means anomaly detection
Source code
Testing Analytics RESTful services
Testing the build-analytics-model API
Testing the get-register-symbols API
Working with Kibana to visualize the analytics results
Summary
Other Books You May Enjoy
Index
备用描述
Master the intricacies of Elasticsearch 7.0 and use it to create flexible and scalable search solutionsKey FeaturesMaster the latest distributed search and analytics capabilities of Elasticsearch 7.0Perform searching, indexing, and aggregation of your data at scaleDiscover tips and techniques for speeding up your search query performanceBook DescriptionBuilding enterprise-grade distributed applications and executing systematic search operations call for a strong understanding of Elasticsearch and expertise in using its core APIs and latest features. This book will help you master the advanced functionalities of Elasticsearch and understand how you can develop a sophisticated, real-time search engine confidently. In addition to this, you'll also learn to run machine learning jobs in Elasticsearch to speed up routine tasks.You'll get started by learning to use Elasticsearch features on Hadoop and Spark and make search results faster, thereby improving the speed of query results and enhancing the customer experience. You'll then get up to speed with performing analytics by building a metrics pipeline, defining queries, and using Kibana for intuitive visualizations that help provide decision-makers with better insights. The book will later guide you through using Logstash with examples to collect, parse, and enrich logs before indexing them in Elasticsearch.By the end of this book, you will have comprehensive knowledge of advanced topics such as Apache Spark support, machine learning using Elasticsearch and scikit-learn, and real-time analytics, along with the expertise you need to increase business productivity, perform analytics, and get the very best out of Elasticsearch.What you will learnPre-process documents before indexing in ingest pipelinesLearn how to model your data in the real worldGet to grips with using Elasticsearch for exploratory data analysisUnderstand how to build analytics and RESTful servicesUse Kibana, Logstash, and Beats for dashboard applicationsGet up to speed with Spark and Elasticsearch for real-time analyticsExplore the basics of Spring Data Elasticsearch, and understand how to index, search, and query in a Spring applicationWho this book is forThis book is for Elasticsearch developers and data engineers who want to take their basic knowledge of Elasticsearch to the next level and use it to build enterprise-grade distributed search applications. Prior experience of working with Elasticsearch will be useful to get the most out of this book.
备用描述
Advanced Elasticsearch 7.0, will help the readers to leverage new features and Core APIs of Elasticsearch to perform advanced search operations. This book covers data modeling, aggregations, pipeline processing, and data Analytics using Elasticsearch
开源日期
2021-04-20
🚀 快速下载
成为会员以支持书籍、论文等的长期保存。为了感谢您对我们的支持,您将获得高速下载权益。❤️
🐢 低速下载
由可信的合作方提供。 更多信息请参见常见问题解答。 (可能需要验证浏览器——无限次下载!)
- 低速服务器(合作方提供) #1 (稍快但需要排队)
- 低速服务器(合作方提供) #2 (稍快但需要排队)
- 低速服务器(合作方提供) #3 (稍快但需要排队)
- 低速服务器(合作方提供) #4 (稍快但需要排队)
- 低速服务器(合作方提供) #5 (无需排队,但可能非常慢)
- 低速服务器(合作方提供) #6 (无需排队,但可能非常慢)
- 低速服务器(合作方提供) #7 (无需排队,但可能非常慢)
- 低速服务器(合作方提供) #8 (无需排队,但可能非常慢)
- 低速服务器(合作方提供) #9 (无需排队,但可能非常慢)
- 下载后: 在我们的查看器中打开
所有选项下载的文件都相同,应该可以安全使用。即使这样,从互联网下载文件时始终要小心。例如,确保您的设备更新及时。
外部下载
-
对于大文件,我们建议使用下载管理器以防止中断。
推荐的下载管理器:JDownloader -
您将需要一个电子书或 PDF 阅读器来打开文件,具体取决于文件格式。
推荐的电子书阅读器:Anna的档案在线查看器、ReadEra和Calibre -
使用在线工具进行格式转换。
推荐的转换工具:CloudConvert和PrintFriendly -
您可以将 PDF 和 EPUB 文件发送到您的 Kindle 或 Kobo 电子阅读器。
推荐的工具:亚马逊的“发送到 Kindle”和djazz 的“发送到 Kobo/Kindle” -
支持作者和图书馆
✍️ 如果您喜欢这个并且能够负担得起,请考虑购买原版,或直接支持作者。
📚 如果您当地的图书馆有这本书,请考虑在那里免费借阅。
下面的文字仅以英文继续。
总下载量:
“文件的MD5”是根据文件内容计算出的哈希值,并且基于该内容具有相当的唯一性。我们这里索引的所有影子图书馆都主要使用MD5来标识文件。
一个文件可能会出现在多个影子图书馆中。有关我们编译的各种数据集的信息,请参见数据集页面。
有关此文件的详细信息,请查看其JSON 文件。 Live/debug JSON version. Live/debug page.