0% found this document useful (0 votes)

88 views

Elasticsearch Optimization

Elasticsearch is a distributed search and analytics engine that can efficiently store and index data to support fast searches. It can handle structured, unstructured, numerical or geospatial data. Elasticsearch offers speed and flexibility to handle data in many use cases like adding search to apps/websites, storing logs/metrics, and using machine learning for modeling data in real-time. Kibana enables users to interactively explore, visualize and share insights from data. It allows searching, observing and protecting data as well as analyzing data through visualizations and dashboards. Logstash is an open-source data collection engine that can dynamically unify data from various sources and normalize it for downstream analytics. NLTK is a Python toolkit that supports

Uploaded by

Ayaan Mukherjee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

88 views

Elasticsearch Optimization

Uploaded by

Ayaan Mukherjee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 25

Elasticsearch: Store, Search, and Analyze

By Ketan Bansal
What is Elasticsearch?

● Elasticsearch is the distributed search and analytics engine at

the heart of Elastic Stack.

● It provides near real-time search and analytics for all types of

data(structured, unstructured, numerical or geospatial data)
● It can efficiently stores and index data in a way that supports
fast searches
● You can even go far beyond from simple data retrieval and
aggregate information to discover trends and pattern in your
data
What is Elasticsearch?

● Elasticsearch offers speed and flexibility to handle data in a wide

variety of cases:
* Add a search box to an app or website
* Store and analyze logs, metrics, and security event data
* Use ML to automatically model the behaviour of the data in
real-time and etc.
A. Create and Delete an Index ( Elasticsearch using Python)
B. Insert and Get Query ( Elasticsearch using Python)
C. Search Query ( Elasticsearch using Python)
D. Mapping ( Elasticsearch using Python)
D.1. Mapping ( Elasticsearch using Python)
D.2. Custom-Mapping ( Elasticsearch using Python)
Kibana: Explore, Visualize, and Share

By Your Name
What is Kibana?

● Kibana enables you to interactively explore, visualize, and share insights

into your data and manage and monitor the Elastic Stack.

● With Kibana, We can:

* Search, Observe, and Protect the data - From discovering documents

to analyzing logs to finding security vulnerabilities
* Analyze your data - Search for hidden Insights, visualyze what we’ve
found in charts, maps, and more, and combine them in a Dashboard
* Manage, Monitor, and Secure the Elastic Stack - Manage your data,
monitor the health of ES and manage accesses to the features
Add Data

● The best way to add data to Elastic Stack is to use one of the integrations
from Kibana Dashboard such as:

1. Add Data with Elastic Solutions - Website Search crawler, Elastic APM,
Endpoint Security

2. Add Data with Programming Languages - Add any data in ES using any
programming language, such as JavaScript, JAVA, Python and Ruby

3. Add Sample Data - Sample data sets come with sample visualizations,
dashboards, and more you to explore data before you add your own data

4. Upload a file - If you have a CSV, TSV, or JSON file you can upload it
and optionally import it into Elasticsearch
Kibana Query Language (KQL)

● KQL is a simple syntax for filtering Elasticsearch data using free text
search or field-based search

● It is only used in filtering data, and has no role in sorting or aggregating

data
● It is able to query nested fields and scripting fields, and does not support
regular expressions or searching with fuzzy terms
Logstash: Collect, Enrich, and Transport

By Your Name
What is Logstash?

● Logstash is an open-source data collection engine with real-time pipeline

capabilities
*Logstash event processing pipeline had 3 stages-
Inputs→filters→outputs
*Inputs generates events, filters modify them, and outputs ship them
elsewhere

● It can dynamically unify data from disparate sources and normalize the
data into the destination of our choice
● Cleanse and Democratize all the data for diverse advanced downstream
analytics and visualization use cases
Natural Language Toolkit (NLTK)

By Your Name
What is NLTK?

● Natural Language Toolkit(NLTK) is a suite of open-source python

modules, data sets, and tutorials supporting research and development in
Natural Language Processing

● A variety of text processing tasks can be performed using NLTK such as

tokenizing, stemming, lemmatization, tagging Parts of Speech etc.
Tokenizing

● By tokenizing, you can easily split up text by word or by sentence

● Convert whole text into various pieces of smaller text that are still
relatively meaningful outside from the main text (converting unstructured
data into structured data)

* Tokenizing by Words : Tokenizing by word allows you to identify words

that come up more often

word_tokenize(your_text) is the class that is used to tokenize your text into

words
Tokenizing

* Tokenizing by Sentence: When we tokenize by sentence, we can analyze

how those words are related to one another and see more context

sent_tokenize(your_text) is the class that is used to tokenize your text into

sentences

NOTE: Before using these classes, you need to first import relevant part of
NLTK
Stemming

● Stemming is a text processing task in which you reduce words to their

roots, which is a core part of a word

● For Example, “helping” and “helper” share the same root i.e. “help”

● NLTK has more than one stemmer, but we’ll use Porter Stemmer
Stemming

Where “words” is a list of tokenized words

Tagging Parts of Speech

● Tagging Parts of Speech, or POS tagging, is the task of labelling the

words in our text according to the parts of speech

● NLTK uses the word determiner to refer to articles(like “a” or “the”)

● nltk.pos_tag() is the library used for tagging, giving the output as tuple
values
Lemmatizing: Like Stemming, Lemmatizing reduces words to their core
meaning, but it’ll give you a complete English word that makes sense of its
own instead of just a fragment of a word like “discoveri”
Elasticsearch practice :
https://github.com/S19CRXPP0098/Practice/blob/main/Elasticsearch_Pr
actice.ipynb

NLTK practice :
https://github.com/S19CRXPP0098/Practice/blob/main/NLTK_Practice.
ipynb
THANK YOU

Elastic Stack: Elasticsearch Logstash and Kibana
No ratings yet
Elastic Stack: Elasticsearch Logstash and Kibana
24 pages
Project - Real Time Monitoring Project
No ratings yet
Project - Real Time Monitoring Project
12 pages
Apache Kafka Installation
No ratings yet
Apache Kafka Installation
3 pages
Apache Load Balancer
No ratings yet
Apache Load Balancer
8 pages
Installation Istio and Microk8s
No ratings yet
Installation Istio and Microk8s
5 pages
Aws Sns - Pub - Sub Model
No ratings yet
Aws Sns - Pub - Sub Model
8 pages
OpenStack 101
No ratings yet
OpenStack 101
36 pages
Canary Deployment Using Kubernetes Primitives 1670176779
No ratings yet
Canary Deployment Using Kubernetes Primitives 1670176779
14 pages
AWS EKS in Action
No ratings yet
AWS EKS in Action
23 pages
16 - Prometheus Checklist
No ratings yet
16 - Prometheus Checklist
9 pages
High Availability and Disaster Recovery Kubernetes
No ratings yet
High Availability and Disaster Recovery Kubernetes
6 pages
Cloud Computing Assignments: Name: Shubham Ubhe GR No: 21810164 Roll No: 321055 Class: TY Btech Branch: Computer
No ratings yet
Cloud Computing Assignments: Name: Shubham Ubhe GR No: 21810164 Roll No: 321055 Class: TY Btech Branch: Computer
105 pages
Container Security Best Practices Cheat Sheet
No ratings yet
Container Security Best Practices Cheat Sheet
9 pages
Step by Step Tutorial To Create Keystore and Truststore File - Tech Brainwave
No ratings yet
Step by Step Tutorial To Create Keystore and Truststore File - Tech Brainwave
13 pages
9 - Kubernetes (Light Theme)
No ratings yet
9 - Kubernetes (Light Theme)
11 pages
Kubernetes For Beginners
100% (1)
Kubernetes For Beginners
29 pages
Mastering Openstack: Controller Nodes
No ratings yet
Mastering Openstack: Controller Nodes
21 pages
Prometheus and Grafana For EKS Cluster
No ratings yet
Prometheus and Grafana For EKS Cluster
9 pages
Autoscalling New
No ratings yet
Autoscalling New
4 pages
Lab13 - Secrets and ConfigMaps
100% (1)
Lab13 - Secrets and ConfigMaps
10 pages
Drupal and Container Orchestration - Using Kubernetes To Manage All The Things
No ratings yet
Drupal and Container Orchestration - Using Kubernetes To Manage All The Things
21 pages
100 Kubernetes Commands
No ratings yet
100 Kubernetes Commands
16 pages
Hands-On On AWS-L2: Case1
100% (1)
Hands-On On AWS-L2: Case1
51 pages
Working With Elastic Load Balancing
No ratings yet
Working With Elastic Load Balancing
13 pages
Redis Internals
No ratings yet
Redis Internals
20 pages
Monitor Traefik With Grafana, Prometheus & Loki - by Sven Van Ginkel - Medium
No ratings yet
Monitor Traefik With Grafana, Prometheus & Loki - by Sven Van Ginkel - Medium
13 pages
Istio Service Mesh Summary1 8th April 2023
No ratings yet
Istio Service Mesh Summary1 8th April 2023
4 pages
15 Reasons To Use Redis As An Application Cache: Itamar Haber
No ratings yet
15 Reasons To Use Redis As An Application Cache: Itamar Haber
9 pages
Filebeat To Graylog
No ratings yet
Filebeat To Graylog
4 pages
Kubernetes Ingress Controllers
No ratings yet
Kubernetes Ingress Controllers
18 pages
Apache Kafka 101
No ratings yet
Apache Kafka 101
25 pages
CoreDeveloper-5 5 1
No ratings yet
CoreDeveloper-5 5 1
559 pages
Skillmix - Terraform Associate Certification Question Guide
No ratings yet
Skillmix - Terraform Associate Certification Question Guide
26 pages
Openshift Container Platform 4.6: Jaeger
No ratings yet
Openshift Container Platform 4.6: Jaeger
49 pages
EKS Overview
No ratings yet
EKS Overview
14 pages
Kafka Secuirty
No ratings yet
Kafka Secuirty
4 pages
What Is Elastic Load Balancing
No ratings yet
What Is Elastic Load Balancing
3 pages
Mastering Hazelcast 3.9
No ratings yet
Mastering Hazelcast 3.9
335 pages
Spring Cloud
No ratings yet
Spring Cloud
44 pages
Kubernetes
No ratings yet
Kubernetes
67 pages
ec2 class notes
No ratings yet
ec2 class notes
15 pages
Elastic Search
No ratings yet
Elastic Search
19 pages
Rancher 2.0: Technical Architecture
No ratings yet
Rancher 2.0: Technical Architecture
11 pages
Extending Kubernetes - Kubernetes
No ratings yet
Extending Kubernetes - Kubernetes
30 pages
Locking Down Your Kubernetes Cluster With Linkerd
No ratings yet
Locking Down Your Kubernetes Cluster With Linkerd
24 pages
Amazon Elastic File System
No ratings yet
Amazon Elastic File System
180 pages
Create An AWS VPC Peering Connection
No ratings yet
Create An AWS VPC Peering Connection
5 pages
Kata Containers
No ratings yet
Kata Containers
34 pages
Kolla
No ratings yet
Kolla
5 pages
L Ive! Lab: Configuring VPC Dns
No ratings yet
L Ive! Lab: Configuring VPC Dns
6 pages
Ec2 Ug
No ratings yet
Ec2 Ug
1,044 pages
Cloud Train DevOps Instructor Led Training Curriculum 2023
No ratings yet
Cloud Train DevOps Instructor Led Training Curriculum 2023
25 pages
Log Stash
No ratings yet
Log Stash
41 pages
Dev Ops for Cloud
100% (1)
Dev Ops for Cloud
240 pages
03 - Kubernetes Architecture
No ratings yet
03 - Kubernetes Architecture
19 pages
Mastering Apache Cassandra - Second Edition
From Everand
Mastering Apache Cassandra - Second Edition
Nishant Neeraj
No ratings yet
About Kubernetes and Security Practices - Short Edition: First Edition, #1
From Everand
About Kubernetes and Security Practices - Short Edition: First Edition, #1
Ami Adi
No ratings yet
Kubernetes A Complete Guide
From Everand
Kubernetes A Complete Guide
Gerardus Blokdyk
No ratings yet
Extending Jenkins: Get a complete walkthrough of the many interfaces available in Jenkins with the help of real-world examples to take you to the next level with Jenkins
From Everand
Extending Jenkins: Get a complete walkthrough of the many interfaces available in Jenkins with the help of real-world examples to take you to the next level with Jenkins
Donald Simpson
No ratings yet
Ultimate AWS Certified Solutions Architect Associate Exam Guide: Master Designing Resilient, Scalable Architectures with Core and Advanced AWS Services to Crack the SAA-C03 Certification (English Edition)
From Everand
Ultimate AWS Certified Solutions Architect Associate Exam Guide: Master Designing Resilient, Scalable Architectures with Core and Advanced AWS Services to Crack the SAA-C03 Certification (English Edition)
Otieno Ododa
No ratings yet
Intoduction To Distributed Systems-Slides
No ratings yet
Intoduction To Distributed Systems-Slides
83 pages
CHAPTER THREE (3)
No ratings yet
CHAPTER THREE (3)
34 pages
CS 3 - Problem Solving Agent
No ratings yet
CS 3 - Problem Solving Agent
80 pages
Ebooks File Clean Code Principles and Patterns: A Software Practitioner's Handbook 1 / No Cover Edition Petri Silén All Chapters
No ratings yet
Ebooks File Clean Code Principles and Patterns: A Software Practitioner's Handbook 1 / No Cover Edition Petri Silén All Chapters
49 pages
2004 - Final Year Project Digital Library
No ratings yet
2004 - Final Year Project Digital Library
73 pages
4 Months Nasscom - SuprMentr Internship 2025
No ratings yet
4 Months Nasscom - SuprMentr Internship 2025
8 pages
Unit 3 Architec Style
No ratings yet
Unit 3 Architec Style
6 pages
Xi Ip Split-Up 2024-25 KVS Ro Guwahati Final-1
No ratings yet
Xi Ip Split-Up 2024-25 KVS Ro Guwahati Final-1
4 pages
Chan
No ratings yet
Chan
70 pages
10.1007_978-981-16-2164-2
No ratings yet
10.1007_978-981-16-2164-2
652 pages
BDA Mid-2 Important Questions
No ratings yet
BDA Mid-2 Important Questions
19 pages
Sample Questions:: Section I: Subjective Questions
No ratings yet
Sample Questions:: Section I: Subjective Questions
6 pages
C2 INFOGRAPHIC UK ENGINEERING INVENTIONS
No ratings yet
C2 INFOGRAPHIC UK ENGINEERING INVENTIONS
20 pages
A Framework For The Structural Analysis of Rest Apis: 2017 IEEE International Conference On Software Architecture
No ratings yet
A Framework For The Structural Analysis of Rest Apis: 2017 IEEE International Conference On Software Architecture
4 pages
R18-B.Tech_.CSE-Data-Science-4-1-Tentative-Syllabus
No ratings yet
R18-B.Tech_.CSE-Data-Science-4-1-Tentative-Syllabus
16 pages
SMU Pelajar
No ratings yet
SMU Pelajar
1 page
Bank Fraud Detection Project
No ratings yet
Bank Fraud Detection Project
30 pages
The Importance of Artificial Intelligence
No ratings yet
The Importance of Artificial Intelligence
2 pages
Mohit Kumar Dora Resume
No ratings yet
Mohit Kumar Dora Resume
1 page
Dbms Module 3
No ratings yet
Dbms Module 3
12 pages
An Explainable Transformer-Based Model For Phishing Email Detection: A Large Language Model Approach
No ratings yet
An Explainable Transformer-Based Model For Phishing Email Detection: A Large Language Model Approach
15 pages
Question Bank IRS All Module - OS
No ratings yet
Question Bank IRS All Module - OS
5 pages
Module-1 Part-3 Cryptography
No ratings yet
Module-1 Part-3 Cryptography
14 pages
Mini Project- G50
No ratings yet
Mini Project- G50
11 pages
Edpm Sba A3
No ratings yet
Edpm Sba A3
12 pages
ADA File
No ratings yet
ADA File
34 pages
Notes Internet and Web Technology Iwt Unit 2
No ratings yet
Notes Internet and Web Technology Iwt Unit 2
17 pages
22621-2023-Summer-Question-Paper (Msbte Study Resources)
No ratings yet
22621-2023-Summer-Question-Paper (Msbte Study Resources)
2 pages
PythonFMU ECMS 20203
No ratings yet
PythonFMU ECMS 20203
6 pages
Lecture 1
No ratings yet
Lecture 1
37 pages

Elasticsearch Optimization

Uploaded by

Elasticsearch Optimization

Uploaded by

Elasticsearch: Store, Search, and Analyze

● Elasticsearch is the distributed search and analytics engine at

● It provides near real-time search and analytics for all types of

● Elasticsearch offers speed and flexibility to handle data in a wide

● Kibana enables you to interactively explore, visualize, and share insights

● With Kibana, We can:

* Search, Observe, and Protect the data - From discovering documents

● It is only used in filtering data, and has no role in sorting or aggregating

● Logstash is an open-source data collection engine with real-time pipeline

● Natural Language Toolkit(NLTK) is a suite of open-source python

● A variety of text processing tasks can be performed using NLTK such as

● By tokenizing, you can easily split up text by word or by sentence

* Tokenizing by Words : Tokenizing by word allows you to identify words

word_tokenize(your_text) is the class that is used to tokenize your text into

* Tokenizing by Sentence: When we tokenize by sentence, we can analyze

sent_tokenize(your_text) is the class that is used to tokenize your text into

● Stemming is a text processing task in which you reduce words to their

Where “words” is a list of tokenized words

● Tagging Parts of Speech, or POS tagging, is the task of labelling the

● NLTK uses the word determiner to refer to articles(like “a” or “the”)

You might also like